Julian Berger

Julian Berger

Photos and videos

Tweets

Pinned Tweet

22 Jun 2022

Had an absolute blast to share this for the first time 🔥 More results to come. This is a collaborative effort of course, so shoutout to @RalfKurvers @stefanmherzog @Mehdi_Moussaid & Ralph Hertwig at @arc_mpib

Nick Byrd, Ph.D.@byrd_nick

22 Jun 2022

Replying to @byrd_nick

Now @officialberger asks "When to Stop the Crowd?" "dynamic rules [about when to stop adding raters to crowd/aggregator] match the performance of widely-used *static* aggregation mechanisms [but] with fewer raters." N = 43k Implications for efficiency of aggregation.

Jessica Hullman

Julian Berger retweeted

Jessica Hullman @JessicaHullman

May 22

I wonder how long this complementary phase will last.

Brendan Nyhan (@BrendanNyhan on 🟦☁️)@BrendanNyhan

May 22

GPT reviewer "scores above each paper's top-rated human reviewer" but AI review agents "overlap far more than humans do...and exhibit 16 recurring weaknesses humans do not share..." Results "position current AI reviewers as complements to, not substitutes for, human reviewers."

1,655

Brendan Nyhan (@BrendanNyhan on 🟦☁️)

Julian Berger retweeted

Brendan Nyhan (@BrendanNyhan on 🟦☁️)@BrendanNyhan

May 22

Ethan Mollick

@emollick

May 21

Seems GPT-5.2 reaches expert level in peer review: 45 scientists took 469 hours evaluating human & AI reviews on 82 papers. "Surprisingly, current AI reviewers are competitive even with the top-rated reviewers in Nature’s official peer review..." though not without weaknesses.

9,143

Joachim Baumann @ ICLR'26

Julian Berger retweeted

Joachim Baumann @ ICLR'26

@joabaum

May 1

Can you boost your AI review scores by asking an LLM to rewrite your paper? Yes! We call it paper laundering Our @icmlconf spotlight paper argues current AI reviewers aren't ready to automate peer review, and outlines what a science of peer review automation should look like🧵👇

First page of the ICML 2026 spotlight paper "Stop Automating Peer Review Without Rigorous Evaluation" by Joachim Baumann, Jiaxin Pei, Sanmi Koyejo, and Dirk Hovy (Stanford University and Bocconi University). The abstract argues that today's AI systems should not be used to produce paper reviews, grounded in two empirical findings: a "hivemind effect" where AI reviewers show excessive agreement and reduce perspective diversity, and "paper laundering," where prompting an LLM to rewrite a paper trivially increases AI reviewer scores through stylistic changes rather than scientific improvements. The paper calls for a science of peer review automation rather than wholesale deployment of general-purpose LLMs.

ALT First page of the ICML 2026 spotlight paper "Stop Automating Peer Review Without Rigorous Evaluation" by Joachim Baumann, Jiaxin Pei, Sanmi Koyejo, and Dirk Hovy (Stanford University and Bocconi University). The abstract argues that today's AI systems should not be used to produce paper reviews, grounded in two empirical findings: a "hivemind effect" where AI reviewers show excessive agreement and reduce perspective diversity, and "paper laundering," where prompting an LLM to rewrite a paper trivially increases AI reviewer scores through stylistic changes rather than scientific improvements. The paper calls for a science of peer review automation rather than wholesale deployment of general-purpose LLMs.

458

53,056

CrowdCent

Julian Berger retweeted

CrowdCent

@CrowdCent

Mar 18

It's hype time. We built a platform where ML models compete to predict future performance of Hyperliquid perps. Interactive scores, real-time leaderboards, a meta-model ensemble, AI agent integration, and... a flight simulator.

0:42

931

Dirk Wulff

Julian Berger retweeted

Dirk Wulff @dirkuwulff

23 Oct 2025

🚨 New publication: How to improve conceptual clarity in psychological science? Thrilled to see this article with Rui Mata out. We discuss how LLMs can be leveraged to map, clarify, and generate psychological measures and constructs. Open access article: doi.org/10.1177/096372142513…

Escaping the Jingle-Jangle Jungle: Increasing Conceptual Clarity in Psychology Using Large Language...

Psychology has long struggled with conceptual redundancy, particularly in the form of “jingle-jangle fallacies,” in which different constructs share the same la...

journals.sagepub.com

534

Forecasting Research Institute

Julian Berger retweeted

Forecasting Research Institute

@Research_FRI

8 Oct 2025

Is AI on track to match top human forecasters at predicting the future? Today, FRI is releasing an update to ForecastBench—our benchmark that tracks how accurate LLMs are at forecasting real-world events. A trend extrapolation of our results suggests LLMs will reach superforecaster-level forecasting performance around a year from now. Here’s what you need to know: 🧵

120

43,804

Roxana Daneshjou MD/PhD

Julian Berger retweeted

Roxana Daneshjou MD/PhD @RoxanaDaneshjou

14 Jul 2025

Brilliant student Sonali Sharma came to me with a question. If patients are using AI to answer their medical questions, are they being adequately warned by AI systems that it cannot provide medical advice? What we found surprised us!

16,395

Yura Gorishniy

Julian Berger retweeted

Yura Gorishniy @YuraFiveTwo

2 Jul 2025

TabM now has a Python package! TabM is a simple and powerful DL architecture for tabular data that efficiently imitates an ensemble of MLPs 🏆 TabM has been used in winning solutions on Kaggle, and performs well on TabReD -- a challenging benchmark! 💻 pip install tabm 👇Link

277

20,421

Yam Peleg

Julian Berger retweeted

Yam Peleg

@Yampeleg

2 Jun 2025

this is perfect tbh

Freckled Liberty 🔥

@FreckledLiberty

1 Jun 2025

what would you put there?

204

859

15,611

697,376

František Bartoš

Julian Berger retweeted

František Bartoš @BartosFra

12 May 2025

We re-analyzed the meta-analysis and found that the conclusion is almost entirely driven by publication bias. Both state-of-the-art and standard methods reduce the degree of the effect 2-3 fold. Moreover, the data no longer show statistical evidence for the main conclusions.

vittorio

@IterIntellectus

10 May 2025

teachers in shambles a meta-analysis of 51 studies has shown that students using chatgpt have better learning performance, learning perception, and higher-order thinking

2,483

Austin van Loon

Julian Berger retweeted

Austin van Loon @AustinVanLoon

18 Feb 2025

🚨 ACCEPTED AT SMR 🚨 Confused by colleagues who seem to want to study LLMs instead of humans? Frustrated by skeptics (e.g., myself 8 months ago) who dismiss LLMs as a potential source of data on human behavior? Check out our paper for a new way forward: osf.io/j3bnt_v3/

31,219

Andrew Gelman et al.

Julian Berger retweeted

Andrew Gelman et al.@StatModeling

4 Feb 2025

Generalized linear neural network models statmodeling.stat.columbia.e…

3,252

Andrea Nuzzolese

Julian Berger retweeted

Andrea Nuzzolese @andriry

4 Feb 2025

Webinar: Hybrid Collective Intelligence! How can humans & machines collaborate to improve decision-making? Join us on 25 Feb 2025, 5-6 PM CET to explore challenges & opportunities with top experts linkedin.com/events/hybridco… #HybridIntelligence #AI #DecisionSupport #Webinar #HACID

LinkedIn Login, Sign in | LinkedIn

linkedin.com

Center for Adaptive Rationality (MPIB, Berlin)

Julian Berger retweeted

Center for Adaptive Rationality (MPIB, Berlin)@arc_mpib

4 Feb 2025

🚨Applications for the 22nd Summer Institute on Bounded Rationality are now open! 🌐Join us in Berlin @mpib_berlin from June 17–25, 2025 to explore "Decision Making in a Digital World". ✏️Application deadline is March 9 - more info at👇!! mpib-berlin.mpg.de/research/…

Summer Institute

mpib-berlin.mpg.de

2,786

Talal Rahwan

Julian Berger retweeted

Talal Rahwan @talalrahwan

30 Jan 2025

🚨 TikTok’s Recommendation favors Republicans! 🚨 We built 323 fake accounts, watched 394K videos, and uncovered the political tilt in TikTok’s recommendations Your "For You" Page isn’t just for you—it's shaping how you see politics See our arXiv paper: arxiv.org/abs/2501.17831

TikTok's recommendations skewed towards Republican content...

TikTok is a major force among social media platforms with over a billion monthly active users worldwide and 170 million in the United States. The platform's status as a key news source,...

arxiv.org

2,526

Dirk Wulff

Julian Berger retweeted

Dirk Wulff @dirkuwulff

27 Jan 2025

The potential of LLMs in social & behavioral science is enormous—but how can we leverage them? @ZakASHussain & I just taught a 5-day course at #GSERM Ljubljana on this. Check out our open materials (cc-by-sa) on using open LLMs with @huggingface: github.com/Zak-Hussain/LLM4B…

GitHub - Zak-Hussain/LLM4BeSci_Ljubljana2025: The course introduces the use of open-source large...

The course introduces the use of open-source large language models (LLMs) from the Hugging Face ecosystem for research in the behavioral and social sciences. - Zak-Hussain/LLM4BeSci_Ljubljana2025

github.com

2,578

Frank Hutter

Julian Berger retweeted

Frank Hutter

@FrankRHutter

8 Jan 2025

The data science revolution is getting closer. TabPFN v2 is published in Nature: nature.com/articles/s41586-0… On tabular classification with up to 10k data points & 500 features, in 2.8s TabPFN on average outperforms all other methods, even when tuning them for up to 4 hours🧵1/19

243

1,366

263,670

Samuel Müller

Julian Berger retweeted

Samuel Müller @SamuelMullr

8 Jan 2025

This might be the first time after 10 years that boosted trees are not the best default choice when working with data in tables. Instead a pre-trained neural network is, the new TabPFN, as we just published in Nature 🎉

104

931

145,076

Demetri (is over at the other place too)

Julian Berger retweeted

Demetri (is over at the other place too)@PhDemetri

11 Dec 2024

Inference is fucked, gonna just bake bread for a living now

3,435

Mubashir

Julian Berger retweeted

Mubashir @Abdulshir

14 Nov 2024

Very happy to share that our work is now published in PNAS (@PNASNews)! 🎉 It's a systematic look into how demographic and psychological factors are associated with misinformation susceptibility. @arc_mpib Thread below and can be read in full here: pnas.org/doi/10.1073/pnas.24…

Mubashir @Abdulshir

6 May 2024

Who falls for online misinformation? A meta-analysis on demographic and psychological factors impacting misinformation susceptibility. Preprint: osf.io/preprints/psyarxiv/7u… My lovely coauthors: @AlanTump, Nina Ehmann, @lorenz_spreen, Ralph Hertwig, Anton Gollwitzer, @RalfKurvers 1/n

9,356