François Chollet

François Chollet

30 Photos and videos

Tweets

Ehud Reiter retweeted

François Chollet

@fchollet

Jun 6

Code volume does not represent productivity.

Jen Zhu

@jenzhuscott

Jun 5

Massive output uptick due to agentic AI. Complete flat adoption.

712

65,612

Ehud Reiter

Ehud Reiter @EhudReiter

Jun 8

New blog: I am worried by NLP research culture NLG and NLP are mostly much better in 2026 than when I got my PhD in 1990. Unfortunately research culture has gotten *worse” in this period, which really worries me as I retire. ehudreiter.com/2026/06/08/nl…

I am worried by NLP research culture

In most ways NLG and NLP are much better in 2026 than when I got my PhD in 1990. Unfortunately research culture has gotten *worse” in this period, which really worries me as I retire. We have…

ehudreiter.com

7,345

Ehud Reiter

Ehud Reiter @EhudReiter

Jun 4

I felt honoured and was very happy to get an Outstanding Contribution in NLG award from @siggen_acl !

1,356

Ehud Reiter

Ehud Reiter @EhudReiter

Jun 3

Took lots of noce pictures at my Retroeval retirement symposium. One of my favourites was me with my first and last PhD students, Sandra Williams and Yujun Wang!

545

Ehud Reiter

Ehud Reiter @EhudReiter

May 29

I'm looking forward to my retirement workshop, which starts on Monday 1 June!! Will be great to catch up with former students and colleagues, and also discuss NLG evaluation. retroeval.github.io/

Symposium on Natural Language Generation Evaluations

RetroEval 2026 Aberdeen, United Kingdom, 1-2 June, 2026

retroeval.github.io

227

Ehud Reiter

Ehud Reiter @EhudReiter

May 27

I remember seeing very dubious advice from OpenAI a few years ago on evaluation. So I was happy to see quite sensible recent advice from Anthropic on evaluation anthropic.com/engineering/de…

Demystifying evals for AI agents

anthropic.com

384

Ehud Reiter

Ehud Reiter @EhudReiter

May 26

Really interesting scoping review that points out numerous flaws in LLM-as-Judge evaluation in healthcare, including minimal human oversight, absent bias testing, model monoculture, ignore implicit eval components, no check for consistency over time (etc) arxiv.org/abs/2604.25933

A Scoping Review of LLM-as-a-Judge in Healthcare and the MedJUDGE Framework

As large language models (LLMs) increasingly generate and process clinical text, scalable evaluation has become critical. LLM-as-a-Judge (LaaJ), which uses LLMs to evaluate model outputs, offers a...

arxiv.org

355

Ehud Reiter

Ehud Reiter @EhudReiter

May 26

Someone asked me what were the highlights of my career, I responded with a list of papers which I was proud of. I did not mention grants, awards, jobs, etc. I know some people are proudest of their grants (etc), but for me it was always scientific outputs.

1,219

Ehud Reiter

Ehud Reiter @EhudReiter

May 25

I wrote paper on "NLG Evaluation: Past, Present, Future" for Retroeval. Eval has changed enornously over my career! In future, I expect more on stuff relevant to real-world usage, including impact, qualitative studies, safety in worst/adversarial case arxiv.org/abs/2605.23715

NLG Evaluation: Past, Present, Future

Natural Language Generation (NLG) evaluation has changed dramatically since 1990, and will continue to evolve in the future. In 1990, when NLG had close ties to linguistics, there was very little...

arxiv.org

2,047

Ehud Reiter

Ehud Reiter @EhudReiter

May 20

New blog: Software engineering of prompts Creating complex prompts for LLMs faces similar software engineering challenges as conventional software (requirements, design, testing, maintenance). We need to understand good software engineering for prompts. ehudreiter.com/2026/05/20/so…

Software engineering of prompts

When we create complex prompts for LLMs, we face similar software engineering challenges as conventional software development (requirements, design, implementation and debugging, testing, maintenan…

ehudreiter.com

4,097

Ehud Reiter

Ehud Reiter @EhudReiter

May 19

Congrats to my student Jawwad Baig for passing his PhD viva! Topic was “Data-to-Text NLG Feedback for Safer Driving”. Jawwad did his PhD part-time (ie, evenings and weekends while he worked fulltime) and remote (lives in England), which is very tough, but he still completed

582

INLG 2026

Ehud Reiter retweeted

INLG 2026 @inlgmeeting

May 17

The Call for Papers for #INLG2026 is out! 🗓️ Submit by July 15 (AoE) 💍 ARR commit by August 5 🆕 Squibs welcomed (raising an issue without needing to solve it) 🆕 Non-archival track for WIP 📍Utrecht, NL — Oct 17–21, just before EMNLP 2026.inlgmeeting.org/calls.h… #NLProc #INLG

723

Ehud Reiter

Ehud Reiter @EhudReiter

May 15

I think we need meaningful penalties on fraudulent academic papers. Glad to see Arxiv is taking action!

Thomas G. Dietterich @tdietterich

May 14

Replying to @tdietterich

The penalty is a 1-year ban from arXiv followed by the requirement that subsequent arXiv submissions must first be accepted at a reputable peer-reviewed venue. 4/

2,359

Ehud Reiter

Ehud Reiter @EhudReiter

May 13

Have now resigned as ARR (meta-)reviewer. I will continue to do some reviewing after retirement, but not ARR. I dont think mega-conf are the right way to present important research findings, and ARR reviewing is not enjoyable, eg I have no control over what I am asked to review

1,141

Ehud Reiter

Ehud Reiter @EhudReiter

May 10

Asked about students cheating in CS using AI. Said I was not concerned about cheating distorting marks, but was very concerned that it demotivated students from learning. I gave assess which AI cannot do, failure rate skyrocketed compared to prev year ehudreiter.com/2026/05/05/ai…

AI and CS Teaching

I am often asked how AI will impact Computer Science teaching. The biggest challenge is adapting what we teach so that it is relevant to a world where AI assistants are heavily used in software dev…

ehudreiter.com

323

Ehud Reiter

Ehud Reiter @EhudReiter

May 9

Visiting the old Roman temple (now a museum) beneath Bloomberg's London office, with my wife Ann. Very impressive!

238

Tech At Bloomberg

Ehud Reiter retweeted

Tech At Bloomberg @TechAtBloomberg

May 6

Our CTO #DataScience Speaker Series welcomes Prof. Ehud Reiter (@EhudReiter) of @aberdeenuni to our London office today to talk with our #AI researchers about designing protocols for high quality human evaluation of generated texts bloom.bg/4cOpT1A #NLProc #LLMs

ALT Ehud Reiter is a Professor of Computing Science at the University of Aberdeen.

297

Ehud Reiter

Ehud Reiter @EhudReiter

May 5

New blog: AI and CS Teaching How will AI impact CS teaching? Biggest challenge is adapting what we teach to a world where AI assistants are heavily used. We should also use AI tutors. Least important is making assessments more resistant to AI cheating ehudreiter.com/2026/05/05/ai…

AI and CS Teaching

ehudreiter.com

408

Computational Linguistics Journal

Ehud Reiter retweeted

Computational Linguistics Journal @CompLingJournal

May 2

What % of the NLP papers measure their impact in the real world? This paper proposes an "impact evaluation" of NLP models or systems for real-world usage, changing the research culture of NLP to focus more on real-world impact and less on SOTA-chasing: doi.org/10.1162/COLI.a.18

5,510

Ehud Reiter

Ehud Reiter @EhudReiter

May 1

Our final year UG students turn in their honours projects today. Supervising projects is the nicest part of teaching for me - always learning something, and great to supervise students 1-1. Really nice projects this year on evaluating LLM in real-world, and digital humanities.

230