Publications

Publications in reversed chronological order in two categories:

  1. Peer-reviewed Publications
  2. Preprints

* indicates equal contribution.


Peer-reviewed Publications
2025
PARME: Parallel Corpora for Low-Resourced Middle Eastern Languages
Sina Ahmadi / Rico Sennrich / Erfan Karami / Ako Marani / Parviz Fekrazad / Gholamreza Akbarzadeh Baghban / Hanah Hadi / Semko Heidari / Mahîr Dogan / Pedram Asadi / Dashne Bashir / Mohammad Amin Ghodrati / Kourosh Amini / Zeynab Ashourinezhad / Mana Baladi / Farshid Ezzati / Alireza Ghasemifar / Daryoush Hosseinpour / Behrooz Abbaszadeh / Amin Hassanpour / Bahaddin Jalal Hamaamin / Saya Kamal Hama / Ardeshir Mousavi / Sarko Nazir Hussein / Isar Nejadgholi / Mehmet Ölmez / Horam Osmanpour / Rashid Roshan Ramezani / Aryan Sediq Aziz / Ali Salehi / Mohammadreza Yadegari / Kewyar Yadegari / Sedighe Zamani Roodsari
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025)
PARME: Parallel Corpora for Low-Resourced Middle Eastern Languages
Loading...
ConLoan: A Contrastive Multilingual Dataset for Evaluating Loanwords
Sina Ahmadi / Micha David Hess / Elena Álvarez-Mellado / Alessia Battisti / Cui Ding / Anne Göhring / Yingqiang Gao / Zifan Jiang / Andrianos Michail / Peshmerge Morad / Joel Niklaus / Maria Christina Panagiotopoulou / Stefano Perrella / Juri Opitz / Anastassia Shaitarova / Rico Sennrich
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025)
Loading...
Automatic Speech Recognition for Low-Resourced Middle Eastern Languages
Razhan Hameed / Sina Ahmadi / Hanah Hadi / Rico Sennrich
Proceedings of Interspeech 2025
Loading...
Literary Translations and Synthetic Data for Machine Translation of Low-resourced Middle Eastern Languages
Sina Ahmadi / Razhan Hameed / Rico Sennrich
Proceedings of the 22st International Conference on Spoken Language Translation (IWSLT 2025)
Literary Translations and Synthetic Data for Machine Translation of Low-resourced Middle Eastern Languages
Loading...
Dia-Lingle: A Gamified Interface for Dialectal Data Collection
Jiugeng Sun / Rita Sevastjanova / Sina Ahmadi / Rico Sennrich / Mennatallah El-Assady
Proceedings of the 63nd Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations - ACL 2025)
Dia-Lingle: A Gamified Interface for Dialectal Data Collection
Loading...
SwiLTra-Bench: The Swiss Legal Translation Benchmark
Jiugeng Sun / Joel Niklaus / Jakob Merane / Luka Nenadic / Sina Ahmadi / Yingqiang Gao / Cyrill A. H. Chevalley / Claude Humbel / Christophe Gösken / Lorenzo Tanzi / Thomas Lüthi / Stefan Palombo / Spencer Poff / Boling Yang / Nan Wu / Matthew Guillod / Robin Mamié / Daniel Brunner / Julio Pereyra / Niko Grupen
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025)
Conversational Lexicography: Querying Lexicographic Data on Knowledge Graphs with SPARQL through Natural Language
Kilian Sennrich / Sina Ahmadi
Proceedings of the 5th Conference on Language, Data and Knowledge (LDK 2025)
Conversational Lexicography: Querying Lexicographic Data on Knowledge Graphs with SPARQL through Natural Language
Loading...
2024
CODET: A Benchmark for Contrastive Dialectal Evaluation of Machine Translation
Md Mahfuz Ibn Alam / Sina Ahmadi / Antonios Anastasopoulos
17th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2024)
CODET: A Benchmark for Contrastive Dialectal Evaluation of Machine Translation
Loading...
Language and Speech Technology for Central Kurdish Varieties
Sina Ahmadi / Daban Q Jaff / Md Mahfuz Ibn Alam / Antonios Anastasopoulos
Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)
Language and Speech Technology for Central Kurdish Varieties
Loading...
Part-of-Speech Tagging for Northern Kurdish
Peshmerge Morad / Sina Ahmadi / Lorenzo Gatti
Proceedings of the Joint Workshop on Multiword Expressions and Universal Dependencies (MWE-UD 2024) @LREC-COLING-2024
Loading...
2023
Script Normalization for Unconventional Writing of Under-Resourced Languages in Bilingual Communities
Sina Ahmadi / Antonios Anastasopoulos
The 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023)
Script Normalization for Unconventional Writing of Under-Resourced Languages in Bilingual Communities
Loading...
PALI: A Language Identification Benchmark for Perso-Arabic Scripts
Sina Ahmadi / Milind Agarwal / Antonios Anastasopoulos
Proceedings of the 10th Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial at EACL 2023)
PALI: A Language Identification Benchmark for Perso-Arabic Scripts
Loading...
When Ontolex Meets Wikibase: Remodeling Use Cases
David Lindemann / Sina Ahmadi / Fahad Khan / Francesco Mambrini
Wikidata'23: Wikidata workshop at ISWC 2023
Loading...
Approaches to Corpus Creation for Low-Resource Language Technology: the Case of Southern Kurdish and Laki
Sina Ahmadi / Zahra Azin / Sara Belelli / Antonios Anastasopoulos
Proceedings of the second workshop on NLP applications to field linguistics at EACL 2023
Approaches to Corpus Creation for Low-Resource Language Technology: the Case of Southern Kurdish and Laki
Loading...
Revisiting and Amending Central Kurdish Data on UniMorph 4.0
Sina Ahmadi / Aso Mahmudi
The 20th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology at ACL 2023
Loading...
A Corpus-based Study of Endoclitic =îş in Kurdish
Sina Ahmadi / Antonios Anastasopoulos / Géraldine Walther
Book of abstracts of the the 56th Annual Meeting of the Societas Linguistica Europaea (SLE 2023)
Loading...
2022
Monolingual Alignment of Word Senses and Definitions in Lexicographical Resources
Sina Ahmadi
National University of Ireland Galway (Thesis)
Loading...
Cross-Lingual Link Discovery for Under-Resourced Languages
Michael Rosner / Sina Ahmadi / Elena-Simona Apostol / Julia Bosque-Gil / Christian Chiarcos / Milan Dojchinovski / Katerina Gkirtzou / Jorge Gracia / Dagmar Gromann / Chaya Liebeskind / Giedrė Valūnaitė Oleškevičienė̇ / Gilles Sérasset / Ciprian-Octavian Truică
The 13th International Conference on Language Resources and Evaluation (LREC 2022)
CoFiF Plus: A French Financial Narrative Summarisation Corpus
Nadhem Zmandar / Tobias Daudert / Sina Ahmadi / Mahmoud El-Haj / Paul Rayson
The 13th International Conference on Language Resources and Evaluation (LREC 2022)
Loading...
Towards an Integrative Approach for Making Sense Distinctions
John P. McCrae / Theodorus Fransen / Sina Ahmadi / Paul Buitelaar / Koustava Goswami
Frontiers in Artificial Intelligence
Towards an Integrative Approach for Making Sense Distinctions
Loading...
Leveraging Multilingual News Websites for Building a Kurdish Parallel Corpus
Sina Ahmadi / Hossein Hassani / Daban Q. Jaff
ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP)
Leveraging Multilingual News Websites for Building a Kurdish Parallel Corpus
Loading...
2021
Convertir le Trésor de la Langue Française en Ontolex-Lemon : un zeste de données liées
Sina Ahmadi / Mathieu Constant / Karën Fort / Bruno Guillaume / John P. McCrae
LIFT 2021 : Journées scientifiques "Linguistique informatique, formelle & de terrain"
Loading...
NUIG at TIAD 2021: Cross-lingual Word Embeddings for Translation Inference
Sina Ahmadi / Atul Kr. Ojha / Shubhanker Banerjee / John P. McCrae
Proceedings of the Translation Inference Across Dictionaries Workshop (TIAD 2021)
NUIG at TIAD 2021: Cross-lingual Word Embeddings for Translation Inference
Loading...
The ELEXIS system for monolingual sense linking in dictionaries
John P. McCrae / Sina Ahmadi / Seung-bin Yim / Lenka Bajčetić
Proceedings of the Seventh Biennial Conference on Electronic Lexicography (eLex 2021)
Loading...
An Evaluation of Definition Paradigms in Lexicography for Word Sense Alignment
Sina Ahmadi / John P. McCrae
Proceedings of the Seventh Biennial Conference on Electronic Lexicography (eLex 2021)
Loading...
Word Sense Alignment as a Classification Problem
Sina Ahmadi / John P. McCrae
The 11th International Global Wordnet Conference (GWC2021)
Loading...
Creating an Electronic Lexicon for the Under-resourced Southern Varieties of Kurdish Language
Zahra Azin / Sina Ahmadi
Proceedings of the Seventh Biennial Conference on Electronic Lexicography (eLex 2021)
Loading...
On the Current State of Kurdish Language Processing
Sina Ahmadi
Proceedings of the 5th International Conference on Kurdish Linguistics Conference (ICKL-5)
Loading...
2020
A Multilingual Evaluation Dataset for Monolingual Word Sense Alignment
Sina Ahmadi / John P. McCrae / Sanni Nimb / Fahad Khan / Monica Monachini / Bolette S. Pedersen / Thierry Declerck / Tanja Wissik / Andrea Bellandi / Irene Pisani / Thomas Troelsgård / Sussi Olsen / Simon Krek / Veronika Lipp / Tamás Váradi / László Simon / András Győrffy / Carole Tiberius / Tanneke Schoonheim / Yifat Ben Moshe / Maya Rudich / Raya Abu Ahmad / Dorielle Lonke / Kira Kovalenko / Margit Langemets / Jelena Kallas / Oksana Dereza / Theodorus Fransen / David Cillessen / David Lindemann / Mikel Alonso / Ana Salgado / José Luis Sancho / Rafael-J. Ureña-Ruiz / Kiril Simov / Petya Osenova / Zara Kancheva / Ivaylo Radev / Ranka Stanković / Andrej Perdih / Dejan Gabrovšek
The 12th International Conference on Language Resources and Evaluation (LREC 2020)
Loading...
Towards Automatic Linking of Lexicographic Data: the case of a historical and a modern Danish dictionary
Sina Ahmadi / Sanni Nimb / Thomas Troelsgård / John P. McCrae / Nicolai H. Sørensen
The XIX EURALEX International Congress
Loading...
Challenges of Word Sense Alignment: Portuguese Language Resources
Ana Salgado / Sina Ahmadi / Alberto Simões / John McCrae / Rute Costa
Proceedings of the 7th Workshop on Linked Data in Linguistics: Building tools and infrastructure (LREC 2020)
Loading...
Defying Wikidata: Validation of Terminological Relations in the Web of Data
Patricia Martín-Chozas / Sina Ahmadi / Elena Montiel-Ponsoda
The 12th International Conference on Language Resources and Evaluation (LREC 2020)
Loading...
KLPT - Kurdish Language Processing Toolkit
Sina Ahmadi
Proceedings of Second Workshop for NLP Open Source Software (NLP-OSS at EMNLP 2020)
Loading...
Building a Corpus for the Zaza–Gorani Language Family
Sina Ahmadi
Proceedings of the Seventh Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2020)
Loading...
A Tokenization System for the Kurdish Language
Sina Ahmadi
Proceedings of the Seventh Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2020)
Loading...
Towards Machine Translation for the Kurdish Language
Sina Ahmadi / Mariam Masoud
Proceedings of the 3rd Workshop on Technologies for MT of Low Resource Languages (LoResMT 2020 at AACL-IJCNLP)
Loading...
A Corpus of the Sorani Kurdish Folkloric Lyrics
Sina Ahmadi / Hossein Hassani / Kamaladdin Abedi
Proceedings of the 1st Joint Spoken Language Technologies for Under-resourced languages (SLTU) and Collaboration and Computing for Under-Resourced Languages (CCURL) Workshop (LREC 2020)
Loading...
2019
Creating a Multilingual Terminological Resource using Linked Data: the case of archaeological domain in the Italian language
Speranza Giulia / Carola Carlino / Sina Ahmadi
The Sixth Italian Conference on Computational Linguistics (CLiC-it 2019)
Loading...
The ELEXIS Interface for Interoperable Lexical Resources
John P. McCrae / Carole Tiberius / Anas Fahad Khan / Ilan Kernerman / Thierry Declerck / Simon Krek / Monica Monachini / Sina Ahmadi
Proceedings of the Sixth Biennial Conference on Electronic Lexicography (eLex 2019)
Loading...
CoFiF: A Corpus of Financial Reports in French Language
Sina Ahmadi / Tobias Daudert
Proceedings of the First Workshop on Financial Technology and Natural Language Processing (IJCAI 2019)
Loading...
NUIG at the FinSBD Task: Sentence Boundary Detection for Noisy Financial PDFs in English and French
Tobias Daudert / Sina Ahmadi
Proceedings of the First Workshop on Financial Technology and Natural Language Processing (IJCAI 2019)
Loading...
Inferring translation candidates for multilingual dictionary generation with multi-way neural machine translation
Mihael Arcan / Daniel Torregrosa / Sina Ahmadi / John P. McCrae
Proceedings of the Translation Inference Across Dictionaries Workshop (TIAD 2019)
Loading...
TIAD 2019 Shared Task: Leveraging knowledge graphs with neural machine translation for automatic multilingual dictionary generation
Mihael Arcan / Daniel Torregrosa / Sina Ahmadi / John P. McCrae
Shared Task on Translation Inference Across Dictionaries (LDK 2019)
Loading...
Lexical sense alignment using weighted bipartite b-matching
Sina Ahmadi / Mihael Arcan / John P. McCrae
Proceedings of the LDK 2019 Workshops
Lexical sense alignment using weighted bipartite b-matching
Loading...
Towards Electronic Lexicography for the Kurdish Language
Sina Ahmadi / Hossein Hassani / John P. McCrae
Proceedings of Sixth Biennial Conference on Electronic Lexicography (eLex 2019)
Loading...
A Rule-Based Kurdish Text Transliteration System
Sina Ahmadi
ACM Transactions on Asian and Low-resource Language Information Processing (TALLIP)
A Rule-Based Kurdish Text Transliteration System
Loading...
Developing a Fine-grained Corpus for a Less-resourced Language: the case of Kurdish
Roshna Abdulrahman / Hossein Hassani / Sina Ahmadi
Widening Natural Language Processing (ACL 2019)
Loading...
2018
On lexicographical networks
Sina Ahmadi / Mihael Arcan / John McCrae
In Workshop on eLexicography: Between Digital Humanities and Artificial Intelligence
Loading...
2017
Building a Lemmatizer and a Spell-checker for Sorani Kurdish
Shahin Salavati / Sina Ahmadi
Proceedings of the 8th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics
Loading...
Attention-based encoder-decoder networks for spelling and grammatical error correction
Sina Ahmadi
Master's thesis
Loading...
2014
Towards building Kurdnet, the Kurdish Wordnet
Purya Aliabadi / Sina Ahmadi / Shahin Salavati / Kyumars Sheykh Esmaili
Proceedings of the Seventh Global Wordnet Conference
Loading...
Preprints
2024
A Morphologically-Aware Dictionary-based Data Augmentation Technique for Machine Translation of Under-Represented Languages
Md Mahfuz Ibn Alam / Sina Ahmadi / Antonios Anastasopoulos
A Morphologically-Aware Dictionary-based Data Augmentation Technique for Machine Translation of Under-Represented Languages
Loading...
Transfer Learning for Low-Resource Sentiment Analysis
Razhan Hameed / Sina Ahmadi / Fatemeh Daneshfar
Loading...
2021
Hunspell for Sorani Kurdish Spell Checking and Morphological Analysis
Sina Ahmadi
arXiv preprint arXiv:2109.06374
Loading...
A Formal Description of Sorani Kurdish Morphology
Sina Ahmadi
arXiv preprint arXiv:2109.03942
Loading...
2020
Towards Finite-State Morphology of Kurdish
Sina Ahmadi / Hossein Hassani
arXiv preprint arXiv:2005.10652
Loading...
2018
Learning Noun Cases Using Sequential Neural Networks
Sina Ahmadi
arXiv preprint arXiv:1810.03996
Loading...