Machine Translation & LLM i18n @Apple  | Previously: @IBM, @UNI_TUE, @MPI_IS | Catch me doing street photography or coding in your local coffee shop ☕️

Joined May 2017
8 Photos and videos
Pinned Tweet
New Apple models are here, super exciting stuff that we built with the team! Especially for the AFM 3 Core Advanced architecture it’s worth a read. Technical report is coming later this summer! machinelearning.apple.com/re…
1
3
18
867
Robin M. Schmidt retweeted
We did Shampoo so we kill all the variants of Adam. Now we call Shampoo b2=0.0 as Muon, then created 10s of variants thats marginally better.
Moratorium on new optimizers until we figure out whats going on
9
4
158
18,884
Robin M. Schmidt retweeted
men in their 40s used to have cool midlife crisis but now they just have agentic workflows
158
675
7,599
447,118
Heading to #EMNLP2025 in Suzhou, reach out if you want to grab a drink!
2
303
If you were impacted by the Meta layoffs yesterday and wan't to work on Multilingual LLMs / Machine Translation @Apple, feel free to send me a DM or apply: jobs.apple.com/en-us/details…

23 Oct 2025
Several of my team members myself are impacted by this layoff today. Welcome to connect :)
2
1,082
New on-device improvements!
At WWDC we introduce a new generation of LLMs developed to enhance the Apple Intelligence features. We also introduce the new Foundation Models framework, which gives app developers direct access to the on-device foundation language model. machinelearning.apple.com/re…
1
386
It’s extremely upsetting when I write an extensive review for a paper submission and the authors just withdraw the paper without any rebuttal whatsoever
2
397
Exciting times to join Apple for AI/ML! machinelearning.apple.com/re…

2
1
11
1,356
Check out #WWDC24 for more info what we do in ML @ Apple!
1
260
you get priority boarding because you fly business, I get it because of extra security screening, we’re not the same
1
275
take a shot each time Jensen says accelerate during the GTC keynote
1
233
the Starbucks honor pickup system is the only thing holding our society together at this point
1
177
why have a blog post when you can just drop a torrent link to the model weights without any context
8 Dec 2023
magnet:?xt=urn:btih:5546272da9065eddeb6fcd7ffddeef5b75be79a7&dn=mixtral-8x7b-32kseqlen&tr=udp:/%2Fopentracker.i2p.rocks:6969/announce&tr=http:/%https://t.co/g0m9cEUz0T:80/announce RELEASE a6bbd9affe0c2725c1b7410d66833e24
387
Show everyone that your optimizer is truly better than AdamW and claim the price money 👇🏻💰
After 3 years of hard work, our unprecedented neural network training algorithm competition is finally open! The exciting part starts now, seeing what the community can create. 🏆Submit, become the next Adam, and bag $50,000 in prizes! mlcommons.org/2023/11/mlc-al…
1
3
525
my landlord is trying to increase my rent by 26.7% yoy 🤡
2
1
287
I knew nyc was bad but come on now
1
1
191
Is it bad that the grid size of 1 (as is standard) used for hyperparameter tuning is reminding me of optimizer papers 🤔
Found a kindred spirit at #icml2023 #feelthelearn
1
2
444
Presenting this tomorrow 9:00 — 10:30am in Frontenac Ballroom / Queen’s Quay, drop by everyone and say hi !! #ACL2023NLP
5 May 2023
Learning Language-Specific Layers for Multilingual Machine Translation abs: arxiv.org/abs/2305.02665 paper page: huggingface.co/papers/2305.0…
1
1
8
1,953
It’s on Poster board 64!
103
well seems like the air space above nyc is closed and getting to #ACL2023NLP is going to be tougher than expected… 3 hours waiting in the plane and now we are heading back to the gate lol
2
2
366
aaaand they cancelled all flights, hopefully I can get there tomorrow 🤞🏻
1
170