A Music Player that can show audio waveform
-
Updated
Jul 4, 2018 - JavaScript
A Music Player that can show audio waveform
Database of "Learning to Predict Salient Faces: A Novel Visual-Audio Saliency Model", ECCV 2020
This repository lists publicly available datasets for visual-audio, speech and audio, and biomedical signal related tasks.
哈工大视听觉信号处理实验作业Visual-auditory signal processing lab assignments
EchoSight is a tool that helps visually impaired individuals by audibly describing images taken with a Raspberry Pi Camera or inputted via image path or URL across different operating systems.
Code for "Learning to Predict Salient Faces: A Novel Visual-Audio Saliency Model", ECCV 2020
Source code for "Visually aligned sound generation via sound-producing motion parsing" (Published at Neurocomputing)
[WIP] Yet another multimodal video-audio feature extractor based on recent research
Add a description, image, and links to the visual-audio topic page so that developers can more easily learn about it.
To associate your repository with the visual-audio topic, visit your repo's landing page and select "manage topics."