Download YouTube video (or supply your own) and generate dual languange subtitles with OpenAI Whisper and translation API (GPT) 下载 YouTube 视频（或提供您自己的视频）并使用 Whisper 和翻译API (GPT) 生成双语字幕

Jupyter Notebook 91 17 Updated Jun 4, 2024

Plachtaa / VITS-fast-fine-tuning

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

Python 4,753 718 Updated Jul 3, 2024

agermanidis / autosub

[NO LONGER MAINTAINED] Command-line utility for auto-generating subtitles for any video file

Python 4,151 1,646 Updated Mar 22, 2024

soiqualang / colab

Learn Python with Colaboratory (colab.research.google.com)

Jupyter Notebook 4 2 Updated Apr 5, 2024

MahtaFetrat / ManaTTS-Persian-Speech-Dataset

ManaTTS is the largest open Persian speech dataset with 86+ hours of transcribed audio. Includes data collection pipeline and tools. Suitable for Persian text-to-speech models.

Jupyter Notebook 8 Updated Sep 13, 2024

lukaszliniewicz / Pandrator

Turn PDFs and EPUBs into audiobooks, subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses local models, notably XTTS, including voice-cloning (instant,…

Python 346 27 Updated Nov 12, 2024

FlorianEagox / WeeaBlind

A program to dub non-english media with modern AI speech synthesis, diarization, and voice cloning!

Python 284 26 Updated Nov 21, 2024

ap-atul / Audio-Denoising

Noise removal/ reducer from the audio file in python. De-noising is done using Wavelets and thresholding is done by VISU Shrink thresholding technique

Python 174 19 Updated Apr 30, 2023

clab / fast_align

Simple, fast unsupervised word aligner

C++ 738 159 Updated Jul 19, 2022

neulab / awesome-align

A neural word aligner based on multilingual BERT

Python 328 47 Updated Mar 10, 2022

Malith-Rukshan / Auto-Reaction-Bot

A Telegram Bot that automatically reacts to posts in Telegram Channels, groups, and private messages, developed as a server-less application.✨

JavaScript 54 111 Updated Oct 23, 2024

SayaSS / vits-finetuning

Fine-Tuning your VITS model using a pre-trained model

Python 551 86 Updated May 2, 2023

Camb-ai / MARS5-TTS

MARS5 speech model (TTS) from CAMB.AI

Jupyter Notebook 2,533 208 Updated Aug 1, 2024

bakwc / JamSpell

Modern spell checking library - accurate, fast, multi-language

C++ 613 102 Updated Aug 29, 2024

rmcpantoja / Espeak-NG-Voice-creator

Create different voices for the Espeak synthesizer. New version restored and improved, but the documentation has not yet been restored.

AutoIt 5 Updated Jun 3, 2022

met4citizen / TalkingHead

Talking Head (3D): A JavaScript class for real-time lip-sync using Ready Player Me full-body 3D avatars.

JavaScript 350 108 Updated Nov 18, 2024

mittagessen / kraken

OCR engine for all the languages

Python 751 131 Updated Nov 21, 2024

uiuc-sst / g2ps

Data and code for grapheme-to-phoneme transducers in lots of languages

HTML 130 19 Updated Apr 5, 2024

KdaiP / StableTTS

Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3

Python 365 42 Updated Sep 13, 2024

lingjzhu / charsiu

Charsiu: A neural phonetic aligner.

Jupyter Notebook 280 35 Updated Sep 19, 2022

allensunjian / jsmind.menu.js

开源项目jsmind.js的右键扩展插件

JavaScript 31 19 Updated Dec 13, 2019

windingwind / zotero-better-notes

Everything about note management. All in Zotero.

TypeScript 5,582 188 Updated Nov 19, 2024

swz30 / Restormer

[CVPR 2022--Oral] Restormer: Efficient Transformer for High-Resolution Image Restoration. SOTA for motion deblurring, image deraining, denoising (Gaussian/real data), and defocus deblurring.

Python 1,813 241 Updated Aug 16, 2024

dali92002 / DE-GAN

Document Image Enhancement with GANs - TPAMI journal

Python 183 32 Updated Mar 24, 2023

datagym-ru / tg_tqdm

Forked from ermakovpetr/tg_tqdm

Extension for tqdm progressbar in Telegram

Python 27 6 Updated Sep 11, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

karim23657

Achievements