Skip to content

orhanf/zemberekMorphTR

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 

Repository files navigation

zemberekMorphTR

This repo provides a way for morhological segmentation of Turkish corpora by using Zemberek.

The segmentations are done as a preprocessing step for Neural Machine Translation, Eng-Tr and Tr-Eng.

There is also a jar provided for direct usage ZemberekJar.jar, only works with Java>=7. Usage of the jar is as follows: java -jar zemberekJar -i <input_file> -o <output_file> -d -s -d : disambiguate -s : short suffix list

About

Wrapper for zemberek morphology tool

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages