Assistant Professor @mcgillu, Core Academic Member @Mila_Quebec, Canada CIFAR AI Chair @CIFAR_News | interested in multilingual NLP | Disciple of Jesus

Joined September 2017
170 Photos and videos
Hallelujah! Iโ€™m excited to share that Iโ€™ve been selected as a 2025 AI2050 Early Career Fellow by @Schmidtsciences This yearโ€™s fellows represent 42 institutions across eight countries, working to ensure AI benefits humankind. Learn more at: lnkd.in/eZA5FHci
We're excited to welcome 28 new AI2050 Fellows! This 4th cohort of researchers are pursuing projects that include building AI scientists, designing trustworthy models, and improving biological and medical research, among other areas. buff.ly/riGLyyj
43
16
306
17,677
David Ifeoluwa Adelani ๐Ÿ‡ณ๐Ÿ‡ฌ retweeted
How do we ensure safety in a world with millions of interacting agents, built and deployed by many different actors? Learn more about our new research fund in partnership with @GoogleDeepMind @Googleorg @coop_ai @ARIA_research: schmidtsciences.org/multi-agโ€ฆ
1
6
21
5,706
David Ifeoluwa Adelani ๐Ÿ‡ณ๐Ÿ‡ฌ retweeted
After the enthusiasm of the shared task last year, we are running another shared task to create a community-made, culturally relevant multilingual benchmark! The deadline to contribute is August 1 AoE. See more details below.
3
17
30
6,652
David Ifeoluwa Adelani ๐Ÿ‡ณ๐Ÿ‡ฌ retweeted
I donโ€™t like self promotion but I really recommend this review paper on synthetic speech evaluation, mostly written by @erica_cooper (I contributed partly as well) jstage.jst.go.jp/article/astโ€ฆ And really hope people understand there is just no good evaluation metric like human ears

5
13
57
3,758
David Ifeoluwa Adelani ๐Ÿ‡ณ๐Ÿ‡ฌ retweeted
Virginia Ceccatelli, Yejin Jeon, David Ifeoluwa Adelani, "SpeechJBB: Probing Safety Alignment and Comprehension in Large Audio Language Models under Code-Switched Speech," arxiv.org/abs/2606.06037
1
3
553
David Ifeoluwa Adelani ๐Ÿ‡ณ๐Ÿ‡ฌ retweeted
Good voice agents need speech that sounds human, keeps up with a real conversation, and works in your language, on affordable hardware. Today @boson_ai and the @lmsysorg SGLang team release Higgs Audio v3, an open 4B text-to-speech model that hits all three.
3
3
11
1,301
David Ifeoluwa Adelani ๐Ÿ‡ณ๐Ÿ‡ฌ retweeted
Meet Gemma 4 12B! A unified, encoder-free multimodal model designed to bring high-performance intelligence directly to your laptop, and released under an Apache 2.0 license. Bridging the gap between edge efficiency and advanced reasoning. Here is whatโ€™s new with Gemma 4 12B: ๐Ÿ‘‡
404
1,789
12,364
3,174,474
David Ifeoluwa Adelani ๐Ÿ‡ณ๐Ÿ‡ฌ retweeted
David Adelani talking about Multicultural LLMs for low resources languages at our @maps_cvpr workshop @CVPR in room 113.
2
10
527
David Ifeoluwa Adelani ๐Ÿ‡ณ๐Ÿ‡ฌ retweeted
โฐ Just 5 days to go! The Multimodal Alignment for a Pluralistic Society workshop @CVPR is happening June 3, 2026. Check out the full schedule below ๐Ÿ‘‡ ๐Ÿ”— sites.google.com/view/maps-cโ€ฆ #CVPR2026
5
6
5,830
David Ifeoluwa Adelani ๐Ÿ‡ณ๐Ÿ‡ฌ retweeted
This is happening tomorrow morning at CVPR (Room 113)! Do stop-by to gather insights from an amazing line-up of speakers!
โฐ Just 5 days to go! The Multimodal Alignment for a Pluralistic Society workshop @CVPR is happening June 3, 2026. Check out the full schedule below ๐Ÿ‘‡ ๐Ÿ”— sites.google.com/view/maps-cโ€ฆ #CVPR2026
3
2
503
David Ifeoluwa Adelani ๐Ÿ‡ณ๐Ÿ‡ฌ retweeted
Releasing a new image editing benchmark -- TECCI: Tricky Edits of Collected and Curated Images. Paper: arxiv.org/abs/2606.01213 Project website: google-deepmind.github.io/teโ€ฆ
1
5
13
1,888
David Ifeoluwa Adelani ๐Ÿ‡ณ๐Ÿ‡ฌ retweeted
If you missed out on contributing to Global PIQA and want to get involved, we will be announcing a new project soon. Fill in this form to be the first to hear about it: forms.gle/Rw1pL5mr4DZgpPbK6
1
1
2
336
Updated GlobalPIQA, now covers 141 languages, great participatory research work at global scale. Paper: arxiv.org/abs/2510.24081 Thank you to all the contributors especially @tylerachang and @linguist_cat for leading this. Another shared task is coming this year.
We are releasing an expanded version of Global PIQA! It now covers 141 language varieties and includes parallel and non-parallel splits. We are also releasing an updated preprint.
13
631
David Ifeoluwa Adelani ๐Ÿ‡ณ๐Ÿ‡ฌ retweeted
๐Ÿ“ข Call for Papers: 6th Multilingual Representation Learning Workshop at EMNLP in Budapest, Hungary! Join us and submit your works relating to multilingual NLP Speakers to be announced, so stay tuned! ๐Ÿ‘€ More info in the CFP: ๐Ÿ”— sigtyp.github.io/ws2026-mrl.โ€ฆ
1
8
14
909
David Ifeoluwa Adelani ๐Ÿ‡ณ๐Ÿ‡ฌ retweeted
Fresh on arXiv! ๐Ÿ˜ Our new paper reformulates tokenisation as a linear program (LP), which we solve to get SOTA tokenisers! As a bonus, this LP allows us to know how close to optimal any tokeniser is! Check it out! ๐Ÿ‘‡
In our new paper, we reinterpret tokenisation as a problem in high-dimensional geometry (100M dims to be precise!), which we can solve efficiently to get a globally near-optimal tokeniser! Our method consistently improves language models over BPE. See ๐Ÿงตfor details.
2
9
111
15,353
David Ifeoluwa Adelani ๐Ÿ‡ณ๐Ÿ‡ฌ retweeted
Some new results I found surprising that Iโ€™m tweeting for Chris (who isnt on here). With enough compute, the best data filter for LMs (on DCLM) might be no filter. Why? Large models can tolerate a surprising amount of nominally 'low quality' data, and can sometimes even benefit.
33
154
1,230
221,958
David Ifeoluwa Adelani ๐Ÿ‡ณ๐Ÿ‡ฌ retweeted
๐Ÿ”ฅThe VoiceMOS Challenge 2026 kicks off today! ๐Ÿ”ฅ Please register using the link below: forms.gle/L6YdkUf1PJdSSwLU7 We will send you the challenge information afterwards! Friday, July 31: Evaluation dataset release. Friday, August 7: Predicted scores submission deadline.
5
19
1,467
David Ifeoluwa Adelani ๐Ÿ‡ณ๐Ÿ‡ฌ retweeted
๐Ÿค– AI text detectors are widely deployed in education and integrity workflows, but what are they actually tracking? We report a surprising finding: text from base models is overwhelmingly judged as human by GPTZero and Pangram. ๐Ÿ‘‡ (1/6)
3
13
62
12,837
David Ifeoluwa Adelani ๐Ÿ‡ณ๐Ÿ‡ฌ retweeted
#NLProc Slides for our LREC tutorial are now available online: Multilingual and Multimodal LLMs in the Wild: Building for Low-Resource Languages Slides: mm-llms-in-the-wild.github.iโ€ฆ Reading list: mm-llms-in-the-wild.github.iโ€ฆ Includes resources on multilingual, multimodal, and low-resource language technologies. W/ @shammur_absar Enamul Haque #LREC2026 #NLProc #MultimodalAI #LLMs
7
21
1,824
David Ifeoluwa Adelani ๐Ÿ‡ณ๐Ÿ‡ฌ retweeted
๐ŸŒ Introducing #GenAI4World Workshop Generative AI for the World: The First Workshop on Globalizing Tasks, Evaluations, and Systems at #COLM2026 ๐Ÿ“… October 9, 2026 ๐Ÿ“ San Francisco, Co-located with @COLM_conf ๐Ÿ”— sites.google.com/view/genai4โ€ฆ Stay tuned for the updates!
1
5
11
6,867
David Ifeoluwa Adelani ๐Ÿ‡ณ๐Ÿ‡ฌ retweeted
๐Ÿšจ ๐—–๐—ฎ๐—น๐—น ๐—ณ๐—ผ๐—ฟ ๐—ฃ๐—ฎ๐—ฝ๐—ฒ๐—ฟ๐˜€ โ€“ #GenAI4World Workshop @COLM_conf #COLM2026 We invite papers on multilingual, multimodal, and culturally grounded LM ๐Ÿ› ๏ธ Two non-archival tracks: 1๏ธโƒฃ Workshop Track 2๏ธโƒฃ Conference Track ๐Ÿ”— Submit: openreview.net/group?id=colmโ€ฆ Details below โฌ‡๏ธ
1
5
11
1,225