Esam Ghaleb

EsamProfile2022_edt.JPG

In brief: I develop computational models of multimodal language and behaviours: how speech, text, gesture, sign, facial expressions, and whole-body movement jointly encode meaning in interaction. Previously, I worked at the University of Amsterdam and Maastricht University on linguistic–gestural alignment and explainable multimodal modelling of human behaviour.

I am research staff in the Multimodal Language Department at the Max Planck Institute for Psycholinguistics (Nijmegen), and I lead the department’s Multimodal Modelling Cluster. My work builds and studies machine-learning methods for the segmentation, coding, and representation of visual communicative signals from motion-capture and video data, and uses learned representations as testbeds for theories of multimodal language across languages and interactional settings. I also study how large language and multimodal language models integrate and generate multimodal behaviour, and develop generative models of gesture for virtual agents. Recent work includes an NWO XS-funded project on grounded, object- and interaction-aware gesture generation in context.

Selected Publications

  1. The Visual Iconicity Challenge: Evaluating Vision-Language Models on Sign Language Form-Meaning Mapping
    Onur Keleş, Aslı Özyürek, Gerardo Ortega, and 2 more authors
    In Pre-print at arXiv Oct 2025
  2. SemGes: Semantics-aware Co-Speech Gesture Generation using Semantic Coherence and Relevance Learning
    Lanmiao Liu, Esam Ghaleb, Aslı Özyürek, and 1 more author
    In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Oct 2025
  3. Llms instead of human judges? a large scale empirical study across 20 nlp evaluation tasks
    Anna Bavaresco, Raffaella Bernardi, Leonardo Bertolazzi, and 8 more authors
    In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL) Jul 2025
  4. I see what you mean: Co-Speech Gestures for Reference Resolution in Multimodal Dialogue
    Esam Ghaleb, Bulat Khaertdinov, Aslı Özyürek, and 1 more author
    In Proceedings of the of the 63rd Conference of the Association for Computational Linguistics (ACL Findings) Jul 2025
  5. Analysing Cross-Speaker Convergence in Face-to-Face Dialogue through the Lens of Automatically Detected Shared Linguistic Constructions
    Esam Ghaleb, Marlou Rasenberg, Wim Pouw, and 4 more authors
    In Proceedings of the Annual Meeting of the Cognitive Science Society Jul 2024