Projects

Current projects

ELEXIS logo ELEXIS - European Lexicographic Infrastructure
2018-2022
Linking lexical resources is a challenge that requires impractical amounts of human efforts, but is still not easy to solve automatically. We develop methods that will make the linking problem viable for large resources, by using state-of-the-art semantic and natural language processing techniques. Our approaches are integrated as a single automated linking tool called NAISC /naʃk/ which facilitates any linking problem for any type of input, from tabular to ontological.
Kurdish Language Processing Toolkit Kurdish Langauge Processing Toolkit
2020-
KLPT-Kurdish Language Processing Toolkit is a natural language processing (NLP) toolkit in Python for the Kurdish language. It comes with core modules for language processing, such as preprocessing, stemming, transliteration and tokenizion, spell-checking and morphological analysis of various dialects of Kurdish.

Past projects

Kurdish XeLaTeX logo XƎLATEX for Kurdish
2019-2020
This project started as a weekend project. Could you believe it was not possible to write a proper document in Kurdish in any of TeX engines without using the Persian or Arabic packages? Well, I spent quite a while adding Kurdish to Polyglossia and Babel. Find a bunch of sample documents here.
Kurdish-BLARK logo Kurdish Basic Language Resource Kit (BLARK)
2017-2020
The Kurdish BLARK (Basic Language Resource Kit) aims at providing resources for the Kurdish language. As a project leader, I was involved in the development of resources and technology for Kurdish language processing, including electronic lexicons based on Ontolex-Lemon, corpus of oral material and parallel corpora. A few particular examples are the Kurdish Folkloric Lyrics Corpus and Kurdish parallel corpora.
KLPP logo Kurdish Language Processing Project (KLPP)
2013
As a bachelor's student, I was a volunteer in this project mostly by providing linguistic support regarding the Sorani dialect of Kurdish for implementing fundamental tools for Kurdish processing and resources such as KurdNet–the Kurdish WordNet. For the latter, I translated the Princeton WordNet synsets and definitions into Sorani. I believe this was a remarkable project as a pioneer in Kurdish language processing from 2009.