Max Idahl

Max Idahl

11 Photos and videos

Tweets

Max Idahl

@maxidahl

Jun 12

Base checkpoint 10,000 lands. Within minutes, forward passes stream the Tier-0 suite; per-domain fitted curves update; the slope monitor flags that the recovery-margin trajectory on the agentic precursor benchmark family is diverging from the sibling run with the alternate code mix. The dashboard doesn't say "MMLU 83.1"; it says "predicted post-probe SWE-style outcome: 41% ± 4.1, up 2.6 from last week, driven by the world-model axis". The data mixing agent picks up on the result. It submits a new sibling run to the queue, eager to continue the hillclimb.

Max Idahl

Max Idahl

@maxidahl

Jun 1

Great to see a tech report with some details on data. Even better to see propella being put to good use

Poolside

@poolsideai

May 26

Today we’re publishing the technical report behind Laguna M.1 and Laguna XS.2. This report opens up more of what went into them: Model Factory, pre-training data, distributed training, post-training, agent RL, quantization, and evaluation. poolside.ai/assets/laguna/la…

1,479

Max Idahl

Max Idahl

@maxidahl

May 11

three /goal per day

Frank Hutter

Max Idahl retweeted

Frank Hutter

@FrankRHutter

May 4

Huge news: @prior_labs has signed a definitive agreement to be acquired by @SAP. €1B invested over four years to build a globally-leading frontier AI lab for structured data — in Europe, in the open. Independent entity. Same team, same mission, same open models. A massive boost to what we can do. The mission just got accelerated. Founders’ statement: priorlabs.ai/blog-posts/prio… (Deal subject to regulatory approval; terms not disclosed.)

510

51,733

Max Idahl

Max Idahl

@maxidahl

Apr 26

a tiny bug snug between my mac's display glass and the panel. my inner monk is gonna have a very rough time

Max Idahl

Max Idahl

@maxidahl

Apr 24

managed to print two posters last minute today in rio. was less of a hassle than expected

Max Idahl

Max Idahl

@maxidahl

Apr 22

Going up the stairs to Christo redentor the day before @iclr_conf, everyone talks about GPU clusters

Max Idahl

Max Idahl

@maxidahl

Apr 19

Just landed in Rio. A few days early to clear the head before @iclr_conf . Ping me if you want to chat about agents, evals, data, or to get out for some hiking/sightseeing

Max Idahl

Max Idahl

@maxidahl

Apr 10

Quality-guided crawling. The easter hack project is turning out too good.

Max Idahl

Max Idahl

@maxidahl

Apr 10

deep crawling 150 domains doubles the number of publicly available German HQ tokens?

Max Idahl

Max Idahl

@maxidahl

Apr 9

Find me here today.

PyTorch

@PyTorch

Mar 26

GPU MODE (@GPU_MODE) & PyTorch Foundation are organizing an ML systems hackathon in Paris on April 9, immediately following PyTorch Conference Europe 2026. Researchers and engineers will compete across two tracks: -Distributed training (LLM speedrun) and inference optimization (leaderboard) - Access to a B300 cluster from Verda and H200 instances from Sesterce - Cloud credits as prizes, including 48-hour access to a GB300 NVL72 rack - Talks from PyTorch (Helion), vLLM, Prime Intellect, and more - Food and refreshments Doors open at 9:30, with the closing ceremony at 20:00. Attendees can join with a pre-formed team or match on-site. Location details are shared upon registration. Spots are limited. Register: luma.com/gpu-mode-paris-2026 #OpenSourceAI #PyTorchCon #PyTorch #vLLM #Helion

Max Idahl

Max Idahl

@maxidahl

Mar 30

This cluster is great. Loads 1T Kimi-K2 from disk to GPU in 108 secs. 512 GPU job, no crash ~5 days in.

Lisan al Gaib

@scaling01

4 Nov 2025

Telekom and NVIDIA building a $1.1B datacenter in Munich with 10k GPUs including DGX B200 and RTX PRO Servers

729

Max Idahl

Max Idahl

@maxidahl

Mar 29

Joe Nemotron also has a pretty good compute multiplier it seems.

Luca Soldaini 🎀

@soldni

Mar 5

This plot undersells how much of a compute multiplier Olmo Hybrid is: 2x compute multiplier on many downstream tasks (and solid LC performance!!!)

1,257

Max Idahl

Max Idahl

@maxidahl

Mar 29

Note that these are not official nemotron checkpoints, but independent reproduction. More on that later. Maybe we could get some official base model checkpoints? Even just a few would be great for comparison @llm_wizard

Max Idahl

Max Idahl

@maxidahl

Mar 27

propella got an oral at the DATA-FM workshop at ICLR 2026. See you there!

Max Idahl

Max Idahl

@maxidahl

Mar 27

-> data-fm-iclr2026.github.io/

Max Idahl

Max Idahl

@maxidahl

Mar 23

Looking at Olmo 3 midtraining. Within 100B tokens, roughly the same gains on downstream evals as within the last 5T tokens. Time to make midtraining alltraining. First 5T token open midtraining dataset when?

Max Idahl

Max Idahl

@maxidahl

Mar 23

If only they would have avoided the drop at stage change

Alexander Doria

Max Idahl retweeted

Alexander Doria

@Dorialexander

Mar 20

Actually we have started to use Propella internally to curate common corpus subcollections and run comparisons with other pretraining dataset. Really filling a missing piece of training/synthetic infra in Europe. huggingface.co/ellamind/prop…

Alexander Doria

@Dorialexander

Mar 20

Great annotation work from @ellamindAI / OpenEuroLLM on French-Science-Commons less than 24 hours after release!

3,556

Max Idahl

Max Idahl

@maxidahl

Mar 20

Annotations are already available. Looks to be very good data. Now go ahead and curate the best seed docs for synth data.

Alexander Doria

@Dorialexander

Mar 19

And new data release: French-Science-Commons, the largest scientific corpus in French in open access including 1.25 million documents/42 million pages re-digitized with VLM (dots ocr).

2,187

Max Idahl

Max Idahl

@maxidahl

Mar 20

Annotations on @huggingface: hf.co/datasets/openeurollm/p… Could maybe be useful for the exploration tool at french-science-commons.pleia… @Dorialexander ?

openeurollm/propella-annotations · Datasets at Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co