Attention #NLProc researchers, the EACL 2027 website is officially LIVE: 2027.eacl.org/! ๐
๐ฌ๐ท Join us in Athens, Greece (Mar 9-13, 2027) at #EACL2027
๐ ARR submission deadline: Aug 6, 2026. Open to all areas of CL/NLP related fields. Stay tuned for the detailed CfP!
โข Adaptive surveys LLMs for measuring extreme psychological constructs
โข Planning-based view evaluation of LLM web agents
โข BET: detecting behavioral & emotional themes in narratives
โข Lilo: multi-agent system for childrenโs digital communication
[1/7] Why do frontier LLMs make factual errors?
Is it because they never learned the factโฆ
or because they canโt access knowledge they already encoded?
In our new paper, we show:
The bottleneck is not encoding; it is recall. ๐งต๐
Paper: arxiv.org/abs/2602.14080
Many thanks to @_galyo@bd_eyal@zorikgekhman@eran_ofek59358@GoogleResearch
A strange trend I've noticed at #ACL2025 is that people are hesitant to reach out to papers/"academic products" authors.
This is unfortunate for both parties! A simple email can save a lot of time to the sender, but is also one of my favorite kind of email as the receiver!
Tomorrow morning (9AM๐ ) I'll be giving a keynote talk at LAW (linguistic annotation workshop) at #ACL2025 on annotations in the era of LLMs - see you there!!๐๐
The Alternative Annotator Test (alt-test) is a new statistical procedure proposed in our ACL 2025 paper! ๐ฆ๐น๐ฆ๐น @DrorRotem@roireichart
The goal? To help justify using LLMs over humans. If the LLM passes the test, its annotations can be trusted ๐
arxiv.org/abs/2501.10970
Everyone uses LLMs to annotate data or evaluate models in their research.
But how can we convince others (readers, collaborators, reviewers!!!) that LLMs are reliable? ๐ค
Hereโs a simple (and low-effort) solution: show the LLM is a *comparable alternative annotator* โ
Preferences drive modern LLM research and development: from model alignment to evaluation.
But how well do we understand them?
Excited to share our new preprint:
Multi-domain Explainability of Preferences
arxiv.org/abs/2505.20088@roireichart@LiatEinDor
๐งต๐
1/11
Join us at the University of Haifa for the HiAI Conference, a dynamic event at the forefront of Artificial Intelligence.
Immerse yourself in groundbreaking discoveries, gain exclusive insights into the latest advancements, and navigate the future of AI with leading experts.
Keynote speakers:
. Hod Lipson, Mechanical Engineering, Columbia University, USA
Prof. Mor Naaman, Information, Cornell Tech, USA
Prof. Tanya Berger-Wolf, Computer Science & Engineering, Ohio State University, USA
This is your chance to uncover revolutionary research, forge invaluable connections, and become part of the AI innovation wave.
Conference dates:ย May 25-27, 2025 ๐ทย Information and registration: lnkd.in/dKqAtP6p
In our new preprint, @roireichart ,@DrorRotem and I propose a new statistical procedure:
The Alternative Annotator Test (alt-test)
The goal? To help researchers justify using LLMs over humansโif the LLM passes, its annotations can be confidently trusted๐
arxiv.org/abs/2501.10970
In our new preprint, @roireichart ,@DrorRotem and I propose a new statistical procedure:
The Alternative Annotator Test (alt-test)
The goal? To help researchers justify using LLMs over humansโif the LLM passes, its annotations can be confidently trusted๐
arxiv.org/abs/2501.10970
A few months ago I told you that I'm working on something awesome and I need some datasets...well here is the first output of that effort and I sincerely think it's amazing ๐คฉ check it out! @NitCal@roireichart#llm-as-a-judge, #nlpevaluation
Do you use LLM-as-a-judge or LLM annotations in your research?
Thereโs a growing trend of replacing human annotators with LLMs in researchโthey're fast, cheap, and require less effort.
But can we trust them?๐ค
Well, we need a rigorous procedure to answer this.
๐จNew preprint๐
Do you think LLMs could win a Nobel Prize one day? ๐ค
Can NLP predict heroin addiction outcomes, uncover suicide risks, or simulate brain activity? ๐ง
@lotemi_peled, @roireichart, and I wrote a blog post about NLP for Human-Centric Sciences ๐ค๐ฉโ๐ฌ
๐๐๐
nitaytech.github.io/blog/202โฆ