We are pleased to announce that, based on the rigorous review process used for ICDAR, your submission listed below has been accepted for presentation:
"JaWildText: A Benchmark for Vision-Language Models on Japanese Scene Text Understanding"
See you in Vienna!! 🇦🇹
#ICDAR2026
This is the first time preference learning actually respects how LLMs generate — step by step.
ADPO isn’t just a tweak to DPO, it’s a shift from outcome supervision to process-level alignment.
We propose Autoregressive Direct Preference Optimization (ADPO), a new formulation of DPO that explicitly incorporates autoregressive modeling.
ADPO revisits the foundations of DPO and leads to a more principled objective.
📚️arxiv.org/pdf/2602.09533
Our paper accepted to #ICML2026 🇰🇷(first author)!
This paper is on budget-aligned test-time scaling of LLMs.
It is my first ML conference paper!
Huge thanks to my co-authors ! @dai0NLP@chokkanorg
Preprint: arxiv.org/abs/2602.09574
More details soon!
We propose Autoregressive Direct Preference Optimization (ADPO), a new formulation of DPO that explicitly incorporates autoregressive modeling.
ADPO revisits the foundations of DPO and leads to a more principled objective.
📚️arxiv.org/pdf/2602.09533
We propose HATCH🐣, a human-inspired training framework for multi-image spatial reasoning in VLMs 🐤
HATCH improves multi-image spatial reasoning ability while preserving single-image reasoning capabilities 🐓
📚️arxiv.org/abs/2602.08735
Our paper accepted to #ICML2026 🇰🇷(first author)!
This paper is on budget-aligned test-time scaling of LLMs.
It is my first ML conference paper!
Huge thanks to my co-authors ! @dai0NLP@chokkanorg
Preprint: arxiv.org/abs/2602.09574
More details soon!
Rewriting Pre-Training Data Boosts LLM Performance in Math and Code #ICLR2026
Sat, Apr 25, 10:30 AM – 1:00 PM
See you in Rio. I’d be glad to talk in person about open LLM development, training libraries, and distributed training.
arxiv.org/abs/2505.02881openreview.net/forum?id=45bt…