Skip to content
View karim23657's full-sized avatar

Block or report karim23657

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles.

Python 768 99 Updated Nov 20, 2024
TypeScript 20 3 Updated Aug 10, 2024
Jupyter Notebook 7,776 547 Updated Jun 16, 2024

Repository for research project about watermarkng audio

Python 3 Updated Nov 15, 2024

A lightweight end-to-end text-to-speech model

Python 91 13 Updated Sep 17, 2024

Download YouTube video (or supply your own) and generate dual languange subtitles with OpenAI Whisper and translation API (GPT) 下载 YouTube 视频(或提供您自己的视频)并使用 Whisper 和翻译API (GPT) 生成双语字幕

Jupyter Notebook 91 17 Updated Jun 4, 2024

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

Python 4,753 718 Updated Jul 3, 2024

[NO LONGER MAINTAINED] Command-line utility for auto-generating subtitles for any video file

Python 4,151 1,646 Updated Mar 22, 2024

Learn Python with Colaboratory (colab.research.google.com)

Jupyter Notebook 4 2 Updated Apr 5, 2024

ManaTTS is the largest open Persian speech dataset with 86+ hours of transcribed audio. Includes data collection pipeline and tools. Suitable for Persian text-to-speech models.

Jupyter Notebook 8 Updated Sep 13, 2024

Turn PDFs and EPUBs into audiobooks, subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses local models, notably XTTS, including voice-cloning (instant,…

Python 346 27 Updated Nov 12, 2024

A program to dub non-english media with modern AI speech synthesis, diarization, and voice cloning!

Python 284 26 Updated Nov 21, 2024

Noise removal/ reducer from the audio file in python. De-noising is done using Wavelets and thresholding is done by VISU Shrink thresholding technique

Python 174 19 Updated Apr 30, 2023

Simple, fast unsupervised word aligner

C++ 738 159 Updated Jul 19, 2022

A neural word aligner based on multilingual BERT

Python 328 47 Updated Mar 10, 2022

A Telegram Bot that automatically reacts to posts in Telegram Channels, groups, and private messages, developed as a server-less application.✨

JavaScript 54 111 Updated Oct 23, 2024

Fine-Tuning your VITS model using a pre-trained model

Python 551 86 Updated May 2, 2023

MARS5 speech model (TTS) from CAMB.AI

Jupyter Notebook 2,533 208 Updated Aug 1, 2024

Modern spell checking library - accurate, fast, multi-language

C++ 613 102 Updated Aug 29, 2024

Create different voices for the Espeak synthesizer. New version restored and improved, but the documentation has not yet been restored.

AutoIt 5 Updated Jun 3, 2022

Talking Head (3D): A JavaScript class for real-time lip-sync using Ready Player Me full-body 3D avatars.

JavaScript 350 108 Updated Nov 18, 2024

OCR engine for all the languages

Python 751 131 Updated Nov 21, 2024

Data and code for grapheme-to-phoneme transducers in lots of languages

HTML 130 19 Updated Apr 5, 2024

Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3

Python 365 42 Updated Sep 13, 2024

Charsiu: A neural phonetic aligner.

Jupyter Notebook 280 35 Updated Sep 19, 2022

开源项目jsmind.js的右键扩展插件

JavaScript 31 19 Updated Dec 13, 2019

Everything about note management. All in Zotero.

TypeScript 5,582 188 Updated Nov 19, 2024

[CVPR 2022--Oral] Restormer: Efficient Transformer for High-Resolution Image Restoration. SOTA for motion deblurring, image deraining, denoising (Gaussian/real data), and defocus deblurring.

Python 1,813 241 Updated Aug 16, 2024

Document Image Enhancement with GANs - TPAMI journal

Python 183 32 Updated Mar 24, 2023

Extension for tqdm progressbar in Telegram

Python 27 6 Updated Sep 11, 2019
Next