Publications

Publications by categories in reversed chronological order.

2026

  1. MorphBPE: A Morpho-Aware Tokenizer Bridging Linguistic Complexity for Efficient LLM Training Across Morphologies
    Ehsaneddin Asgari, Yassine El Kheir, Mohammad Ali Sadraei Javaheri, and 1 more author
    In Findings of the Association for Computational Linguistics (ACL), 2026
  2. BloomBench: A Bilingual Multimodal Benchmark for Cognitively Informed Evaluation of Vision-Language Models
    Mohammad Mahdi Abootorabi, Omid Ghahroodi, Marzia Nouri, and 3 more authors
    In Findings of the Association for Computational Linguistics (ACL), 2026
  3. HarfoSokhan: A Comprehensive Parallel Dataset for Transitions between Persian Colloquial and Formal Variations
    Jahad Sarvestani,  Hamid, Vida Ramezanian, and 6 more authors
    In European Chapter of the Association for Computational Linguistics (EACL) (Main), 2026
  4. Detecting Subtle Biases: An Ethical Lens on Underexplored Areas in AI Language Models Biases
    Shayan Bali, Farhan Farsi, Mohammad Hosseini, and 2 more authors
    In European Chapter of the Association for Computational Linguistics (EACL) (Main), 2026
  5. MEENA (PersianMMU): Multimodal-Multilingual Educational Exams for N-level Assessment
    Omid Ghahroodi, Arshia Hemmat, Marzia Nouri, and 8 more authors
    In European Chapter of the Association for Computational Linguistics (EACL) (Findings), 2026
  6. Eye-Q: A Multilingual Benchmark for Visual Word Puzzle Solving and Image-to-Phrase Reasoning
    Ali Najar, Alireza Mirrokni, Arshia Izadyari, and 5 more authors
    arXiv preprint, 2026

2025

  1. Chunlan Ma, Ayyoob ImaniGooghari, Haotian Ye, and 3 more authors
    In NAACL, 2025
  2. ACL
    Ask in Any Modality: A Comprehensive Survey on Multimodal Retrieval-Augmented Generation
    Mohammad Mahdi Abootorabi, Amirhosein Zobeiri, Mahdi Dehghani, and 6 more authors
    In Meeting of the Association for Computational Linguistics (ACL), 2025
  3. Fanar: An Arabic-Centric Multimodal Generative AI Platform
    Fanar Team
    arXiv preprint arXiv:2501.13944, 2025
  4. ChemLM: Domain adaptable language modeling of chemical compounds identifies potent pathoblockers for Pseudomonas aeruginosa
    Georgios Kallergis, Ehsaneddin Asgari, Behrooz Azarkhalili, and 3 more authors
    Communications Chemistry, 2025
  5. CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval
    Mohammad-Mahdi Abootorabi, and Ehsaneddin Asgari
    In The European Conference on Information Retrieval (ECIR), 2025
  6. ACL
    Emo3D: Metric and Benchmarking Dataset for 3D Facial Expression Generation from Emotion Description
    Mahshid Dehghani, Amirahmad Shafiee, Ali Shafiei, and 6 more authors
    In Conference of the North American Chapter of the Association for Computational Linguistics (NAACL) Findings, 2025
  7. Context-Aware Extraction of Quranic References: A Hybrid Language Model- and Rule-Based Approach
    Alireza Sahebi, Mohammadmahdi Hemmatyar, and Ehsaneddin Asgari
    In Muslims in ML Workshop at NeurIPS, 2025
  8. GeoPolRAG: Retrieval Augmented Generation for Contextually Grounded QA on Complex Geopolitical Matters
    Anas Madkoor, Hamza Aljaji, Talha Shahid, and 5 more authors
    In Muslims in ML Workshop at NeurIPS, 2025
  9. Generative AI and its Benchmarking for Quranic Question Answering
    Hamza Aljaji, Rawan Mohamed, Roaa Ibrahim, and 4 more authors
    In Muslims in ML Workshop at NeurIPS, 2025
  10. I Am Aligned, But With Whom? MENA Values Benchmark for Evaluating Cultural Alignment and Multilingual Bias in LLMs
    Pardis Sadat Zahraei, and Ehsaneddin Asgari
    arXiv preprint, 2025
  11. PahGen: Generating Ancient Pahlavi Text via Grammar-guided Zero-shot Translation
    Farhan Farsi, Parnian Fazel, Farzaneh Goshtasb, and 4 more authors
    In LoResMT (Workshop on Low-Resource Machine Translation), 2025
  12. ParsiPy: NLP Toolkit for Historical Persian Texts in Python
    Farhan Farsi, Parnian Fazel, Sepand Haghighi, and 5 more authors
    In Workshop on Ancient Language Processing, 2025
  13. ImageEval 2025: The First Arabic Image Captioning Shared Task
    Ahlam Bashiti, Alaa Aljabari, Hadi Hamoud, and 2 more authors
    In Arabic NLP Conference (Shared Tasks), 2025

2024

  1. Benno Stein Nailia Mirzakhmedova, Johannes Kiesel, Milad Alshomary, and 10 more authors
    In Proceedings of the Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING), 2024
  2. Omid Ghahroodi, Marzia Nouri, Mohammad Vali Sanian, and 5 more authors
    In COLM, 2024
  3. Kaixin Hu, Fernando Meyer, Zhi-Luo Deng, and 4 more authors
    Briefings in Bioinformatics, 2024
  4. Mohammad Mahdi Abootorabi, Nona Ghazizadeh, Seyed Arshan Dalili, and 3 more authors
    SemEval, 2024
  5. Farhan Farsi, Sadra Sabouri, Kian Kashfipour, and 3 more authors
    2024
  6. TuringQ: Benchmarking AI Comprehension in Theory of Computation
    P. Zahraee, and E. Asgari
    In Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
  7. SuperPos-Prompt: Enhancing Soft Prompt Tuning of Language Models with Superposition of Multi Token Embeddings
    M.A. Sadraei, E. Asgari, A. McHardy, and 1 more author
    In Efficient Natural Language & Speech Processing at NeurIPS, 2024
  8. Transformers for Bridging Persian Dialects: Transliteration Model for Tajiki and Iranian Scripts
    MohammadAli SadraeiJavaheri, Ehsaneddin Asgari, and Hamid Reza Rabiee
    In Proceedings of the Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING), 2024
  9. Patent
    Morphologically-Aware Tokenizer
    E. Asgari, and Y. Elkheir
    2024
    U.S. Provisional Patent Application No. 63/679,403
  10. AIMA at SemEval-2024 Task 3: Simple Yet Powerful Emotion Cause Pair Analysis
    Alireza Ghahramali Kure, Mahshid Dehghani, Mohammad Mahdi Abootorabi, and 3 more authors
    In Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval), 2024
  11. M3Face: A Unified Multi-Modal Multilingual Framework for Human Face Generation and Editing
    Mohammadreza Mofayezi, Reza Alipour, Mohammad Ali Kakavand, and 1 more author
    arXiv preprint, 2024

2023

  1. Zeinab Taghavi, Parsa Haghighi Naeini, Mohammad Ali Sadraei Javaheri, and 4 more authors
    In SemEval, 2023
  2. Aryan Sadeghi, Reza Alipour, Kamyar Taeb, and 3 more authors
    In SemEval, 2023
  3. Reihaneh Zohrabi, Mostafa Masumi, Omid Ghahroodi, and 4 more authors
    In Proceedings of the Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING), 2023
  4. Omid Ghahroodi, Seyed Arshan Dalili, Sahel Mesforoush, and 1 more author
    In SemEval, 2023
  5. NLE
    XPASC: Measuring Generalization in Weak Supervision
    L März, E Asgari, F Braune, and 2 more authors
    Natural Language Engineering Journal, 2023
  6. KhabarChin: Automatic Detection of Important News in the Persian Language
    Hamed Hemati, Arash Lagzian, Moein Salimi Sartakhti, and 2 more authors
    arXiv preprint, 2023
  7. Sina at SemEval-2023 Task 4: A Class-Token Attention-based Model for Human Value Detection
    Omid Ghahroodi, Mohammad Ali Sadraei Javaheri, Doratossadat Dastgheib, and 2 more authors
    In Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval), 2023
  8. A platform for deep learning on (meta) genomic sequences
    Philipp C Münch, R Mreches, XY To, and 12 more authors
    Preprint: Europe PMC, 2023

2022

  1. Sven-Kevin Hotop, Susanne Reimering, Aditya Shekhar, and 13 more authors
    Emerging microbes & infections, 2022
  2. Doratossadat Dastgheib, and Ehsaneddin Asgari
    In Proceedings of TextGraphs-16: Graph-based Methods for Natural Language Processing, 2022
  3. *Dickson Andrew M, *Asgari Ehsaneddin, McHardy Alice C, and 1 more author
    Bioinformatics Journal, 2022
  4. ACL
    Hengam: An Adversarially Trained Transformer for Persian Temporal Tagging
    S Mirzababaei, AH Kargaran, H Schütze, and 1 more author
    In Proc. of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics (AACL), 2022
  5. Docalog: Multi-document Dialogue System using Transformer-based Span Retrieval
    SH Alavian, A Satvaty, S Sabouri, and 2 more authors
    In Proc. of the Second DialDoc Workshop on Document-grounded Dialogue and Conversational Question Answering, 2022

2021

  1. Akash Bahai, Ehsaneddin Asgari, Mohammad RK Mofrad, and 2 more authors
    Bioinformatics, 2021
  2. Esmaeil Nourani, Ehsaneddin Asgari, Alice C McHardy, and 1 more author
    IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2021
  3. Luisa März, Ehsaneddin Asgari, Fabienne Braune, and 2 more authors
    Empirical Methods in Natural Language Processing (EMNLP), 2021
  4. Patent
    Method, Computer Program and Apparatus for Relating Text Units
    Ehsaneddin Asgari
    2021
    European Patent 20206180.0.6 - 1231, January 2021

2020

  1. Ariane Khaledi, Aaron Weimann, Monika Schniederjans, and 12 more authors
    EMBO molecular medicine, 2020
  2. Ehsaneddin Asgari, Christoph Ringlstetter, and Hinrich Schütze
    In SemEval, 2020
  3. Ehsaneddin Asgari, Fabienne Braune, Benjamin Roth, and 2 more authors
    Proceedings of The 12th Language Resources and Evaluation Conference (LREC), 2020
  4. Ehsaneddin Asgari, Masoud Jalili Sabet, Philipp Dufter, and 2 more authors
    arXiv preprint arXiv:2012.11657, 2020
  5. Story Fragment Stitching: The Case of the Story of Moses
    M Aldawsari, E Asgari, and MA Finlayson
    In 1st Workshop on Artificial Intelligence for Narratives (AI4N) at the International Conference on Artificial Intelligence (IJCAI), 2020
  6. Data-driven Variable-length Segmentation of Biological Sequences: Applications in Metagenomics and Proteomics
    E Asgari, PC Münch, TR Lesker, and 2 more authors
    In NeurIPS - Computational Biology Workshop, 2020
  7. Patent
    Method, Computer Program and Apparatus for Detecting a Semantic Change of a Word between Domains
    Ehsaneddin Asgari
    2020
    European Patent 20190280.6 - 1231, November 2020

2019

  1. Naihui Zhou, Yuxiang Jiang, Timothy R Bergquist, and 147 more authors
    Genome biology, 2019
  2. Ehsaneddin Asgari, Alice C McHardy, and Mohammad RK Mofrad
    Scientific reports, 2019
  3. Ehsaneddin Asgari, Nina Poerner, Alice C McHardy, and 1 more author
    BioRxiv, 2019
  4. Ehsaneddin Asgari, Philipp C Münch, Till R Lesker, and 2 more authors
    Bioinformatics, 2019
  5. Ehsaneddin Asgari
    2019
  6. Ehsaneddin Asgari, and Mohammad RK Mofrad
    In Academic Press, 2019

2018

  1. Ehsaneddin Asgari, Kiavash Garakani, Alice Carolyn McHardy, and 1 more author
    Bioinformatics, 2018
  2. Zeinab Jahed, Darya Fadavi, Uyen T Vu, and 3 more authors
    Biophysical journal, 2018

2017

  1. ACL
    Ehsaneddin Asgari, and Hinrich Schütze
    In Association for Computational Linguistics, 2017
  2. Heike Adel, Ehsaneddin Asgari, and Hinrich Schütze
    In Springer International Publishing, 2017
  3. Ehsaneddin Asgari, and Ali Sanaei
    In https://ssrn.com/abstract=3029031, 2017

2016

  1. ACL
    Ehsaneddin Asgari, and Mohammad R.K. Mofrad
    In Association for Computational Linguistics, 2016
  2. Hinrich Schuetze, Heike Adel, and Ehsaneddin Asgari
    arXiv preprint arXiv:1610.00479, 2016
  3. ACL
    Ehsaneddin Asgaria, Soroush Nasiriany, and Mohammad Mofrad
    In Association for Computational Linguistics, 2016

2015

  1. Ehsaneddin Asgari, and Mohammad RK Mofrad
    PloS one, 2015
  2. A New Approach for Scalable Analysis of Microbial Communities
    Ehsaneddin Asgari, Kiavash Garakani, and Mohammad RK Mofrad
    arXiv preprint, 2015

2014

  1. Mahmood Neshati, Djoerd Hiemstra, Ehsaneddin Asgari, and 1 more author
    World wide web, 2014

2013

  1. ACL
    Ehsaneddin Asgari, and Jean-Cédric Chappelier
    In Association for Computational Linguistics, 2013
  2. Ehsaneddin Asgari, Marzyeh Ghassemi, and Mark Alan Finlayson
    In NeurIPS Workshop on Topic Models, 2013
  3. Mahmood Neshati, Ehsaneddin Asgari, Djoerd Hiemstra, and 1 more author
    In Springer Berlin Heidelberg, 2013