Underthesea - Vietnamese NLP Toolkit
-
Updated
Oct 27, 2024 - Python
Underthesea - Vietnamese NLP Toolkit
PhoGPT: Generative Pre-training for Vietnamese (2023)
PhoBERT: Pre-trained language models for Vietnamese (EMNLP-2020 Findings)
A Vietnamese natural language processing toolkit (NAACL 2018)
Repository to track the progress in Vietnamese Natural Language Processing, including the datasets and the current state-of-the-art for the most common Vietnamese NLP tasks.
Vietnamese NLP Toolkit for Node
PhoNLP: A BERT-based multi-task learning model for part-of-speech tagging, named entity recognition and dependency parsing (NAACL 2021)
A Vietnamese-English Neural Machine Translation System (INTERSPEECH 2022)
Vietnamese question answering system with BERT
VietASR - Vietnamese Automatic Speech Recognition
A Large-scale Vietnamese News Text Classification Corpus
BARTpho: Pre-trained Sequence-to-Sequence Models for Vietnamese (INTERSPEECH 2022)
A Fast and Accurate Vietnamese Word Segmenter (LREC 2018)
Electra pre-trained model using Vietnamese corpus
Vietnamese Automatic Speech Recognition
COVID-19 Named Entity Recognition for Vietnamese (NAACL 2021)
Vietnamese sensitive words (including teencode) was created by ML algorithm
A Python wrapper for VnCoreNLP using a bidirectional communication channel.
Add a description, image, and links to the vietnamese-nlp topic page so that developers can more easily learn about it.
To associate your repository with the vietnamese-nlp topic, visit your repo's landing page and select "manage topics."