LLM-Lab

Arabic Language Technologies - QCRI

hbku.jpg

easgari[at]hbku[dot]edu.qa

LLM-Lab is a research group led by Ehsaneddin Asgari at the Arabic Language Technologies group, Qatar Computing Research Institute (QCRI). We develop multimodal, multilingual language technologies with a focus on Arabic and low-resource languages.


Research Focus

Our work is organized around two pillars:

1. Multimodal Document Understanding

  • Optical character recognition (OCR) for diverse scripts and document types
  • Document layout analysis and reasoning over text, tables, and figures
  • Retrieval-augmented generation (RAG) — retrieval, reasoning, and generation components
  • Domain-specific RAG systems (e.g., Quranic studies, legal texts, historical archives)

2. Multimodal Multidialectal Arabic Language Technologies

  • Arabic natural language processing (NLP) across dialects and registers
  • Language resources for the dialects and cultures of the MENA region
  • Language technologies for digital humanities
  • NLP for digital health and well-being

Other Areas of Interest

  • AI for multimedia art and creative applications
  • Bioinformatics and computational biology
  • NLP for MENA region languages beyond Arabic

Collaborations & Opportunities

We collaborate with a global network of researchers and welcome undergraduate and graduate students in Computer Science and Linguistics for research internships (remote or on-site).

Interested? Send your CV to discuss potential opportunities.

news

Mar 29, 2026 SilkRoadNLP workshop held at EACL 2026 in Rabat, Morocco — our initiative on NLP for the Iranian family of languages and cultures along the historical Silk Road.
Mar 27, 2026 We are honored to receive the Best Resource Paper Award at EACL 2026 in Rabat, Morocco!
Mar 24, 2026 Arabic NLP School held at EACL 2026 in Rabat, Morocco — a full-day school on foundational and advanced topics in Arabic language technologies, with over 120 participants selected from 300+ applications.
Jan 03, 2026 Three papers accepted at EACL 2026 — two in the main conference and one in Findings. Congratulations to all co-authors!

latest posts