-
VNPT Technology
- Ha Noi, Viet Nam
-
22:06
(UTC +07:00) - https://www.youtube.com/channel/UCzzuLS8DoFSCuJi-0eFUENw
- @Toan_Nguyen_99
Stars
A Trimap-Free Portrait Matting Solution in Real Time [AAAI 2022]
Code for ACCV 2020 "Speech2Video Synthesis with 3D Skeleton Regularization and Expressive Body Poses"
A command-line tool for the conversion of 3D model assets on the FBX file format to the glTF file format.
A simple MFCC extractor using C++ STL and C++11
OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
kaldi-asr/kaldi is the official location of the Kaldi project.
speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription
High-Resolution 3D Human Digitization from A Single Image.
A header-only C++ library for L-BFGS and L-BFGS-B algorithms
Deezer source separation library including pretrained models.
Code repository of all OpenGL chapters from the book and its accompanying website https://learnopengl.com
PyTorch Face Recognizer based on 'VGGFace2: A dataset for recognising faces across pose and age'
Code for the paper "End-to-end Learning for 3D Facial Animation from Speech"
Code and data for our paper "High-Fidelity 3D Digital Human Creation from RGB-D Selfies".
python version of deformation transfer
Demo programs for the Talking Head Anime from a Single Image 2: More Expressive project.
PU-GAN: a Point Cloud Upsampling Adversarial Network, ICCV, 2019
TensorRT-7 Network Lib 包括常用目标检测、关键点检测、人脸检测、OCR等 可训练自己数据
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
Visualizer for neural network, deep learning and machine learning models
This codebase demonstrates how to synthesize realistic 3D character animations given an arbitrary speech signal and a static character mesh.
The Munich Open-Source Large-Scale Multimedia Feature Extractor
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
练手项目, 有简单的SmartMooc刷课Python脚本, etc
Speech emotion recognition implemented in Keras (LSTM, CNN, SVM, MLP) | 语音情感识别
Command-line program to download videos from YouTube.com and other video sites