CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
-
Updated
Nov 8, 2024 - Python
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
The dataset of Speech Recognition
A python library for voice activity detection (VAD) for speech/non-speech segmentation.
Phoneme segmentation using pre-trained speech models
Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python
Pretrained models, tools and resources for Persian ASR
The Voxseg implementation in PyTorch. Voxseg is a python library for voice activity detection (VAD) for speech/non-speech segmentation.
A python model to detect and segment coughs, forked from coughvid's repo
Deep Learning Utilities for Audio Segmentation
Matlab scripts for segmenting mono-speaker speech recording sessions and tagging speaker turns.
Segmentation of audio for a speech pipeline
This repository explores speech processing techniques like noise cancellation and speech segmentation through Python code.(Speech recognition soon)
Simple speech segmentation in Matlab
Data and analysis scripts of an experiment on how speaking style variation impacts listeners' use of statistical regularities to segment continuous speech
Add a description, image, and links to the speech-segmentation topic page so that developers can more easily learn about it.
To associate your repository with the speech-segmentation topic, visit your repo's landing page and select "manage topics."