src/engines/TTSEngine/BaseTTSEngine.py

from abc import abstractmethod
from typing import TypedDict

import moviepy as mp
import whisper_timestamped as wt
from torch.cuda import is_available

from ..BaseEngine import BaseEngine


class Word(TypedDict):
    start: str
    end: str
    text: str


class BaseTTSEngine(BaseEngine):
    @abstractmethod
    def synthesize(self, text: str, path: str) -> float:
        pass

    def force_duration(self, duration: float, path: str):
        """
        Forces the audio clip at the given path to have the specified duration.

        Args:
            duration (float): The desired duration in seconds.
            path (str): The path to the audio clip file.

        Returns:
            None
        """
        audio_clip = mp.AudioFileClip(path)

        if audio_clip.duration > duration:
            speed_factor = audio_clip.duration / duration

            new_audio = audio_clip.fx(
                mp.vfx.speedx, speed_factor, final_duration=duration
            )

            new_audio.write_audiofile(path, codec="libmp3lame")

        audio_clip.close()
Formatting 2024-02-23 13:12:48 +01:00			`from abc import abstractmethod`
Formatting & improving imports 2024-02-23 09:50:43 +01:00			`from typing import TypedDict`

Support for upgraded moviepy 2024-03-02 15:19:30 +01:00			`import moviepy as mp`
:rocket: Maaany things 2024-02-15 14:11:16 +01:00			`import whisper_timestamped as wt`
			`from torch.cuda import is_available`

Some stuff 2024-02-13 14:15:27 +01:00			`from ..BaseEngine import BaseEngine`

Formatting 2024-02-15 17:54:13 +01:00
:rocket: Maaany things 2024-02-15 14:11:16 +01:00			`class Word(TypedDict):`
			`start: str`
			`end: str`
			`text: str`
Some stuff 2024-02-13 14:15:27 +01:00

Formatting 2024-02-15 17:54:13 +01:00			`class BaseTTSEngine(BaseEngine):`
Some stuff 2024-02-13 14:15:27 +01:00			`@abstractmethod`
:coffin: Remove unused functions 2024-04-21 21:51:05 +02:00			`def synthesize(self, text: str, path: str) -> float:`
:rocket: 2024-02-14 17:49:51 +01:00			`pass`
Formatting 2024-02-20 14:47:54 +01:00
fix(GenerationContext.py): fix typo in variable name powerfulllmengine to powerfulllmengine for better readability feat(GenerationContext.py): add setup_dir method to create a directory for output files with a timestamp feat(GenerationContext.py): call setup_dir method before generating script and synthesizing audio to ensure output directory exists feat(prompts/fix_captions.yaml): add a new prompt file to provide instructions for fixing captions fix(BaseTTSEngine.py): add force_duration method to adjust audio clip duration if it exceeds a specified duration feat(CoquiTTSEngine.py): add options for forcing duration and specifying duration in the UI feat(utils/prompting.py): add get_prompt function to load prompt files from a specified location fix(gradio_ui.py): set equal_height=True for engine_rows to ensure consistent height for engine options 2024-02-15 12:27:13 +01:00			`def force_duration(self, duration: float, path: str):`
:rocket: Maaany things 2024-02-15 14:11:16 +01:00			`"""`
			`Forces the audio clip at the given path to have the specified duration.`

			`Args:`
			`duration (float): The desired duration in seconds.`
			`path (str): The path to the audio clip file.`

			`Returns:`
			`None`
			`"""`
fix(GenerationContext.py): fix typo in variable name powerfulllmengine to powerfulllmengine for better readability feat(GenerationContext.py): add setup_dir method to create a directory for output files with a timestamp feat(GenerationContext.py): call setup_dir method before generating script and synthesizing audio to ensure output directory exists feat(prompts/fix_captions.yaml): add a new prompt file to provide instructions for fixing captions fix(BaseTTSEngine.py): add force_duration method to adjust audio clip duration if it exceeds a specified duration feat(CoquiTTSEngine.py): add options for forcing duration and specifying duration in the UI feat(utils/prompting.py): add get_prompt function to load prompt files from a specified location fix(gradio_ui.py): set equal_height=True for engine_rows to ensure consistent height for engine options 2024-02-15 12:27:13 +01:00			`audio_clip = mp.AudioFileClip(path)`
Formatting 2024-02-15 17:54:13 +01:00
fix(GenerationContext.py): fix typo in variable name powerfulllmengine to powerfulllmengine for better readability feat(GenerationContext.py): add setup_dir method to create a directory for output files with a timestamp feat(GenerationContext.py): call setup_dir method before generating script and synthesizing audio to ensure output directory exists feat(prompts/fix_captions.yaml): add a new prompt file to provide instructions for fixing captions fix(BaseTTSEngine.py): add force_duration method to adjust audio clip duration if it exceeds a specified duration feat(CoquiTTSEngine.py): add options for forcing duration and specifying duration in the UI feat(utils/prompting.py): add get_prompt function to load prompt files from a specified location fix(gradio_ui.py): set equal_height=True for engine_rows to ensure consistent height for engine options 2024-02-15 12:27:13 +01:00			`if audio_clip.duration > duration:`
			`speed_factor = audio_clip.duration / duration`
Formatting 2024-02-15 17:54:13 +01:00
			`new_audio = audio_clip.fx(`
			`mp.vfx.speedx, speed_factor, final_duration=duration`
			`)`

			`new_audio.write_audiofile(path, codec="libmp3lame")`

			`audio_clip.close()`