Current projects

GMNLP logo George Mason NLP
Oct. 2022-
I am very lucky to be working at GMNLP on various topics in language technology in low-resource setups. Looking forward to a rewarding experience in research, supervision and teaching.
Kurdish Language Processing Toolkit Kurdish Langauge Processing Toolkit
KLPT-Kurdish Language Processing Toolkit is a natural language processing (NLP) toolkit in Python for the Kurdish language. It comes with core modules for language processing, such as preprocessing, stemming, transliteration and tokenizion, spell-checking and morphological analysis of various dialects of Kurdish.

Past projects

ELEXIS logo ELEXIS - European Lexicographic Infrastructure

Linking lexical resources is a challenge that requires impractical amounts of human efforts, but is still not easy to solve automatically. We develop methods that will make the linking problem viable for large resources, by using state-of-the-art semantic and natural language processing techniques. Our approaches are integrated as a single automated linking tool called NAISC /naʃk/ which facilitates any linking problem for any type of input, from tabular to ontological.

Thank to ELEXIS and all colleagues who accomplished such an amazing project!

ELEXIS last meeting Florece 2022

NexusLinguarum logo NexusLinguarum
I was a member of the WG1 focusing on cross-lingual data interlinking, access and retrieval in the Linguistic Linked Open Data (LLOD).
Kurdish XeLaTeX logo XƎLATEX for Kurdish
This project started as a weekend project. Could you believe it was not possible to write a proper document in Kurdish in any of TeX engines without using the Persian or Arabic packages? Well, I spent quite a while adding Kurdish to Polyglossia and Babel. Find a bunch of sample documents here.
Kurdish-BLARK logo Kurdish Basic Language Resource Kit (BLARK)
The Kurdish BLARK (Basic Language Resource Kit) aims at providing resources for the Kurdish language. As a project leader, I was involved in the development of resources and technology for Kurdish language processing, including electronic lexicons based on Ontolex-Lemon, corpus of oral material and parallel corpora. A few particular examples are the Kurdish Folkloric Lyrics Corpus and Kurdish parallel corpora.
KLPP logo Kurdish Language Processing Project (KLPP)
As a bachelor's student, I was a volunteer in this project mostly by providing linguistic support regarding the Sorani dialect of Kurdish for implementing fundamental tools for Kurdish processing and resources such as KurdNet–the Kurdish WordNet. For the latter, I translated the Princeton WordNet synsets and definitions into Sorani. I believe this was a remarkable project as a pioneer in Kurdish language processing from 2009.