@ReflectionAI. Prev: @GoogleDeepMind: Gemini Pretraining

Joined June 2010
5 Photos and videos
Anudhyan Boral retweeted
Here you go, sir. Muon is a good optimizer. I think Keller's attempt at implementing it is great -- this primarily helped me look at why his effort produced much worse looking curve than what I could get. This ended up being a nerdsnipe into hyper parameters, grafting, and eigh calls in the distributed shampoo package from Meta in my car ride home. The main delta's are below:
I've added two optimizers to the public benchmark: (1) Shampoo (with its original 1/4 power). (2) Spectral descent, which is equivalent to both Muon(mu=0) and Shampoo(b1=b2=0). Result: Shampoo falls halfway between Muon & Adam; Spectral descent is ~2x slower. Thread below 1/6
21
39
446
142,565
Anudhyan Boral retweeted
25 Nov 2025
they're all good models, sir! 🫡
25 Nov 2025
hmmm 🤔
7
9
337
60,667
Anudhyan Boral retweeted
Huge congrats to the team on an amazing model. Thousands of contributors. Billions of use cases. Feeling the AGI!
26
49
1,605
113,655
Anudhyan Boral retweeted
MathArena results for gemini-2.5-pro
11
49
603
71,403
Super proud to have co-led the development of new training algorithms that were critical for pre-training of the awesome Gemini 2.0 models. Looking forward to users and devs unleashing their creativity and driving innovations using these models!🚀♊️⚡️
Gemini 2.0 is now available to everyone. ✨ ⚡ Start using an updated 2.0 Flash in @Google AI Studio, @GoogleCloud’s #VertexAI and in @GeminiApp. We’re also introducing: 🔵 2.0 Pro Experimental, which excels at coding. 🔵 2.0 Flash-Lite, our most cost-efficient model yet. 🔵 2.0 Flash Thinking Experimental, in @GeminiApp. 🧵 goo.gle/40OsRfj
3
184
Anudhyan Boral retweeted
Gemini-2.0 makes a huge leap on our MEGA-Bench leaderboard to beat all the competitors! With the other benchmarks being either overfitted or leaked, I believe MEGA-Bench serves a more reliable indicator to show the multimodal models' true performance to generalize to 505 real-world tasks. Leaderboard Link: huggingface.co/spaces/TIGER-… Congrats to Gemini team @OfficialLoganK @JeffDean
9
29
247
28,274
Anudhyan Boral retweeted
Me (and everyone else TBH)
1
15
325
17,101
Anudhyan Boral retweeted
I am in awe of Gemini's ability to solve JEE level maths questions with stepwise reasoning as well as with the option elimination method.
8
9
200
15,464
Anudhyan Boral retweeted
Short Stories: "Rational Approximation" (by Lloyd N. Trefethen, in Notices of the @amermathsoc): ams.org/journals/notices/202…

3
1
799
Just planned a Seattle getaway with airial.travel - it's an absolute game changer! Offering everything from handpicked destinations to perfectly timed itineraries and seamless booking. It's one of the most unique and well-put-together AI products I've seen recently.
We are announcing the launch of Airial Travel’s open-to-all beta version for desktop today. Airial is your personal travel agent with AI superpowers which makes planning and booking trips as easy as dreaming them up. airial.travel Me and Sanjeev co-founded Airial Travel a year ago to solve a problem we faced repeatedly. Being avid travelers living in the US with our families in India, we were traveling for several months a year and spending multiple days planning and booking each trip. Hours and hours of research, browsing, watching videos, form-filling, spreadsheets, refinements etc. At the end of the process in most cases, we just booked because we were exhausted and wanted to get it over with. As we talked to people and read up about this, we realized that the scale and the intensity of this problem is stunning - hundreds of millions of trips are booked online every year and planning each of them takes over five hours on average. Our vision “Just imagine your trip, and Airial it!” stems from our ultimate wish as travelers - AI that can figure out all the intricacies of trip planning for you - hotels, activities, flights, trains, transits, deals, date options, restaurants, interests, research, travel videos and everything else. Our defining features originate from the core beliefs that Airial is built on: 📅 Detailed intricately crafted plans: Attention to detail makes trip plans incredible. Airial plans trips in a level of detail that is simply unmatched, taking care of hundreds of common sense constraints across thousands of variables in seconds. 🚀 From Reels to Itineraries: TikTok / IG reels to Trips is work that millions do manually. We now enable all this in a click. This is the intersection of the two big trends in travel - AI and Socials. 🏖️ Personalized Planning: Travel portals today are one-size-fits-all. We plan trips tailored to your specific interests - architecture tours, scenic hikes or samurai sword fighting lessons! 👆 Actionable Itineraries: Just having a chatbot pick out one combination for you isn’t practically useful. Which was the last trip you planned that didn’t need any refinement? Every decision Airial makes for you is changeable via chat or UI controls. 🌎 Discovery in context: As you plan your trip, Airial gives you the tools to discover incredible ideas and expert advice specific to your itinerary and interests which can be instantly imported into your trip. Now you can “Just imagine your trip and Airial it!”. Try it out now on your laptop at airial.travel!
1
2
235
Anudhyan Boral retweeted
6 Dec 2024
What a way to celebrate one year of incredible Gemini progress -- #1🥇across the board on overall ranking, as well as on hard prompts, coding, math, instruction following, and more, including with style control on. Thanks to the hard work of everyone in the Gemini team and elsewhere at Google! 🎊
6 Dec 2024
Big news on Chatbot Arena 🔥 The new @GoogleDeepMind model gemini-exp-1206 is crushing it, and the race is heating up. Google is back in the #1 spot 🏆overall and tied with O1 for the top coding model! Highlights (improvement since gemini-exp-1121 in parentheses) - First place overall (2->1) - Tied with GPT-4o-1120 after style control (4->1) - Tied with O1 on coding leaderboard (3->1) - First place on hard prompts (2->1) Keep it up @GoogleDeepMind! The rate of progress is crazy. For analysis and to test the model, see below 👇
88
311
1,583
782,998
Thrilled to have helped train this awesome model. Algorithmic efficiency FTW!⚡️⚡️⚡️
3 Oct 2024
Flash8B General Availability: We originally trained Flash 8B giving it all our algorithmic efficiency improvements to pack as much as possible in a small form factor which then was scaled up to Flash On benchmarks, it is closely matching Flash announced during May at I/O
33
4,046
Anudhyan Boral retweeted
Excited to share that the Machine Learning and Optimization team at @GoogleDeepMind India is hiring Research Scientists and Research Engineers! If you're passionate about cutting-edge AI research and building efficient, elastic, and safe LLMs, we'd love to hear from you. Check out our open roles and apply: boards.greenhouse.io/deepmin… boards.greenhouse.io/deepmin… cc: @ManishGuptaMG1 @PNetrapalli @adityakusupati @divy93t @partha_p_t

10
59
469
136,046
Anudhyan Boral retweeted
Gemini Flash is now tied with gpt-4o for #2 on the lmsys *vision* leaderboard! combine that with a 1M context length and you can do some seriously cool multimodal work for super cheap ⚡️⚡️⚡️
7
19
195
62,936
Anudhyan Boral retweeted
A modest 59% win-rate over the original GPT4 too. I remember when that model was considered halfway to AGI. Now the price of that quality has dropped by almost 100x in 15 months.
28 May 2024
These win rates are quite interesting too Flash has a win rate of 55% over gpt-4-1106 while loses to Yi-Large at 49% which is ranked lower. Use the model that fits you!
1
4
45
9,404
Anudhyan Boral retweeted
28 May 2024
Big news – Gemini 1.5 Flash, Pro and Advanced results are out!🔥 - Gemini 1.5 Pro/Advanced at #2, closing in on GPT-4o - Gemini 1.5 Flash at #9, outperforming Llama-3-70b and nearly reaching GPT-4-0125 (!) Pro is significantly stronger than its April version. Flash’s cost, capabilities, and unmatched context length make it a market game-changer! Huge congrats to @GoogleDeepMind on the incredible Gemini launches! Can't wait to see what new applications Gemini unlocks! More breakdown analysis below👇
35
242
1,134
394,124
Anudhyan Boral retweeted
20 May 2024
Just finished reading the Gemini 1.5 report and I'm blown away by the depth of information shared in such a competitive environment! 🤯 Most surprising was the revelation about their optimizer - they didn't just use Adam! Optimization is still alive and kicking! Kudos to the team for their great work! 👏
6
41
347
73,727
If you're at NeurIPS this year, come check out our work on modeling turbulence with probabilistic generative models! (Find us at the poster session today 12/14 at 5pm) A short thread đź§µ(1/N):
1
1
1,218
...and let SGD and empirical risk minimization discover the latent low-dimensional dynamics! The latent dim is low enough that we can solve the SDE much faster than a numerical solver. We get stable rollouts across 1000s of steps while maintaining high fidelity throughout. (8/N)
1
183
For more, come visit our poster #603 at 5pm at #NeurIPS2023 or check out our paper (arxiv.org/abs/2306.01174). Joint work with my awesome colleagues @GoogleAI Zhong Yi Wan, Leonardo Zepeda-Núñez, James Lottes, Qing Wang, Yi-fan Chen, John Anderson, @feishaAI (9/N; N=9)
137