About the current version
Please note that KLPT is under development and some of the functionalities will appear in the future versions. You can find out regarding the progress of each task at the Projects section. In the current version, the following tasks are included:
Modules |
Tasks | Sorani (ckb) | Kurmanji (kmr) |
---|---|---|---|
preprocess |
normalization | ✓ (v0.1.0) | ✓ (v0.1.0) |
standardization | ✓ (v0.1.0) | ✓ (v0.1.0) | |
unification of numerals | ✓ (v0.1.0) | ✓ (v0.1.0) | |
tokenize |
word tokenization |
✓ (v0.1.0) | ✓ (v0.1.0) |
MWE tokenization |
✓ (v0.1.0) | ✓ (v0.1.0) | |
sentence tokenization | ✓ (v0.1.0) | ✓ (v0.1.0) | |
transliterate |
Arabic to Latin | ✓ (v0.1.0) | ✓ (v0.1.0) |
Latin to Arabic | ✓ (v0.1.0) | ✓ (v0.1.0) | |
Detection of u/w and î/y | ✓ (v0.1.0) | ✓ (v0.1.0) | |
Detection of Bizroke ( i ) | ✗ | ✗ | |
stem |
morphological analysis | ✓ (v0.1.0) | ✗ |
morphological generation | ✓ (v0.1.0) | ✗ | |
stemming | ✗ | ✗ | |
lemmatization | ✗ | ✗ | |
spell error detection and correction | ✓ (v0.1.0) | ✗ |