Persian NLP Toolkit
-
Updated
Jul 16, 2024 - Python
Persian NLP Toolkit
Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing
A Python wrapper of the Yandex Mystem 3.1 morphological analyzer (http://api.yandex.ru/mystem). The original tool is shipped as a binary and this library makes it easy to integrate it in Python projects. Let us know in the issues if you would like to be involved into the developments or maintenance of this project. If you have any fix or suggest…
A python module for English lemmatization and inflection.
HuSpaCy: industrial-strength Hungarian natural language processing
Simple multilingual lemmatizer for Python, especially useful for speed and efficiency
Qutuf (قُطُوْف): An Arabic Morphological analyzer and Part-Of-Speech tagger as an Expert System.
🍊 📄 Text Mining add-on for Orange3
[LREC 2022] An off-the-shelf pre-trained Tweet NLP Toolkit (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Tweebank-NER dataset
[GSOC] Greek language support for spacy.io python NLP software
📂 Additional lookup tables and data resources for spaCy
Lemmatization for Turkish Language
A lemmatizer for German language text
Python morphological analyzer for Turkish language. Partial port of ZemberekNLP.
Лемматизатор для русскоязычных текстов
Babel Street Analytics Client Library for Python
A neural network that jointly part-of-speech tags and lemmatizes sentences, boosting accuracy for morphologically-rich languages (Czech, Arabic, etc.)
The uploaded codes help to classify emails into spam and non spam classes by using Support Vector Machine classifier.
A language-independent post-correction app for POS-tagging and lemmatization
Add a description, image, and links to the lemmatization topic page so that developers can more easily learn about it.
To associate your repository with the lemmatization topic, visit your repo's landing page and select "manage topics."