BNLP is a natural language processing toolkit for Bengali Language.
-
Updated
Nov 23, 2024 - Jupyter Notebook
BNLP is a natural language processing toolkit for Bengali Language.
Automatic Context Sensitive Spelling Correction for Bangla Text Using Bert and Levenstein Distance
Bangla NLP toolkit: Bangla text normalization, punctuation generation and augmentation for Bangla NLP tasks. This project is available on PyPi as well.
BNLTK(Bangla Natural Language Processing Toolkit): a python package for NLP in Bangla
Concept Level Sentiment Analysis of Bengali Text: A Relative Study to Feature Level Evaluation
This repository contains the code and data of the paper titled "Not Low-Resource Anymore: Aligner Ensembling, Batch Filtering, and New Datasets for Bengali-English Machine Translation" published in Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), November 16 - November 20, 2020.
The official GitHub repository of the Bangla Visual Question Answering (VQA) system ChitroJera
Bangla PDF to text converter that works on Windows, macOS, and Linux without any extra downloads or configurations.
Bengali Natural Language Processing(BengaliNLP)
Bangla clickbait detection system
This project focuses on sentiment analysis of Bangla text data, aiming to classify comments as positive or negative. The model leverages deep learning techniques and advanced natural language processing (NLP) methods tailored for Bangla, including custom text preprocessing and AI models.
This is the official repository containing all codes used to generate the results reported in the paper titled "Social Bias in Large Language Models For Bangla: An Empirical Study on Gender and Religious Bias"
This is the official repository containing all codes used to generate the results reported in the paper titled "An Empirical Study on the Characteristics of Bias upon Context Length Variation for Bangla" accepted in Findings of the Association for Computational Linguistics: ACL 2024
Bengali Hate Speech Detection using Bag of Words, Binary Bag of Words and TF_IDF. Accuracy for Bag of Words 56.67% , Binary Bag of Words 57.22% , TF-IDF 56.89%.
বাংলায় ন্যাচারাল ল্যাঙ্গুয়েজ প্রসেসিং এর উপর লেখা সিরিজের জন্য কোড রিপোজিটরি
Bengali/Bangla Fake Review Detection Dataset
This is the official repository of the paper titled "BnPC: A Gold Standard Corpus for Paraphrase Detection in Bangla, and its Evaluation", accepted in The 17th Workshop on Building and Using Comparable Corpora (BUCC 2024) co-located with LREC-COLING 2024. It contains the codes and the dataset.
A bangla chatbot using bidirectional lstm
✍️ Bengali Alphabet (বাংলা বর্ণমালা)
Nirmol is an open-source dataset and API for detecting Bangla slang words. Detect offensive/bad/slang words in Bangla/Bengali/Banglish sentences. A helpful API and dataset for developers and researchers.
Add a description, image, and links to the bangla-nlp topic page so that developers can more easily learn about it.
To associate your repository with the bangla-nlp topic, visit your repo's landing page and select "manage topics."