Skip to content
#

text-augmentation

Here are 28 public repositories matching this topic...

automation-bijoy-to-avro

ANSI and Unicode are encoding standards used across the world by writers and common users. ANSI is an older encoding version and is used in operating systems like Windows 95/ 98 and much older systems. Unicode is a newer version of encoding used in the current day operating systems

  • Updated Jun 25, 2022
  • Jupyter Notebook

This repo offers a Python script using NLPAug library & RTT to augment text datasets. It processes TXT files in "data/" folder, translating text and creating augmented versions. Augmented data enhances NLP tasks like chatbot training & text classification. Includes overview of techniques, applications & implementation.

  • Updated Oct 7, 2024
  • Python

Improve this page

Add a description, image, and links to the text-augmentation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the text-augmentation topic, visit your repo's landing page and select "manage topics."

Learn more