rohan anil

rohan anil

5 Photos and videos

Tweets

Anudhyan Boral retweeted

rohan anil

@_arohan_

Jun 9

Here you go, sir. Muon is a good optimizer. I think Keller's attempt at implementing it is great -- this primarily helped me look at why his effort produced much worse looking curve than what I could get. This ended up being a nerdsnipe into hyper parameters, grafting, and eigh calls in the distributed shampoo package from Meta in my car ride home. The main delta's are below:

Keller Jordan

@kellerjordan0

Jun 8

I've added two optimizers to the public benchmark: (1) Shampoo (with its original 1/4 power). (2) Spectral descent, which is equivalent to both Muon(mu=0) and Shampoo(b1=b2=0). Result: Shampoo falls halfway between Muon & Adam; Spectral descent is ~2x slower. Thread below 1/6

446

142,565

Susan Zhang

Anudhyan Boral retweeted

Susan Zhang

@suchenzang

25 Nov 2025

they're all good models, sir! 🫡

Susan Zhang

@suchenzang

25 Nov 2025

hmmm 🤔

337

60,667

Noam Shazeer

Anudhyan Boral retweeted

Noam Shazeer

@NoamShazeer

19 Nov 2025

Huge congrats to the team on an amazing model. Thousands of contributors. Billions of use cases. Feeling the AGI!

1,605

113,655

Ankesh Anand

Anudhyan Boral retweeted

Ankesh Anand

@ankesh_anand

30 Mar 2025

MathArena results for gemini-2.5-pro

603

71,403

Anudhyan Boral

Anudhyan Boral @bloopsie

10 Feb 2025

Super proud to have co-led the development of new training algorithms that were critical for pre-training of the awesome Gemini 2.0 models. Looking forward to users and devs unleashing their creativity and driving innovations using these models!🚀♊️⚡️

Google DeepMind

@GoogleDeepMind

5 Feb 2025

Gemini 2.0 is now available to everyone. ✨ ⚡ Start using an updated 2.0 Flash in @Google AI Studio, @GoogleCloud’s #VertexAI and in @GeminiApp. We’re also introducing: 🔵 2.0 Pro Experimental, which excels at coding. 🔵 2.0 Flash-Lite, our most cost-efficient model yet. 🔵 2.0 Flash Thinking Experimental, in @GeminiApp. 🧵 goo.gle/40OsRfj

184

Wenhu Chen

Anudhyan Boral retweeted

Wenhu Chen @WenhuChen

6 Jan 2025

Gemini-2.0 makes a huge leap on our MEGA-Bench leaderboard to beat all the competitors! With the other benchmarks being either overfitted or leaked, I believe MEGA-Bench serves a more reliable indicator to show the multimodal models' true performance to generalize to 505 real-world tasks. Leaderboard Link: huggingface.co/spaces/TIGER-… Congrats to Gemini team @OfficialLoganK @JeffDean

247

28,274

François Fleuret

Anudhyan Boral retweeted

François Fleuret

@francoisfleuret

1 Jan 2025

Me (and everyone else TBH)

325

17,101

Ashish Tendulkar

Anudhyan Boral retweeted

Ashish Tendulkar @ashish_vt

20 Dec 2024

I am in awe of Gemini's ability to solve JEE level maths questions with stepwise reasoning as well as with the option elimination method.

200

15,464

SIAM Activity Group on Dynamical Systems

Anudhyan Boral retweeted

SIAM Activity Group on Dynamical Systems @DynamicsSIAM

18 Dec 2024

Short Stories: "Rational Approximation" (by Lloyd N. Trefethen, in Notices of the @amermathsoc): ams.org/journals/notices/202…

799

Anudhyan Boral

Anudhyan Boral @bloopsie

11 Dec 2024

Just planned a Seattle getaway with airial.travel - it's an absolute game changer! Offering everything from handpicked destinations to perfectly timed itineraries and seamless booking. It's one of the most unique and well-put-together AI products I've seen recently.

Archit Karandikar

@KarchitK

10 Dec 2024

We are announcing the launch of Airial Travel’s open-to-all beta version for desktop today. Airial is your personal travel agent with AI superpowers which makes planning and booking trips as easy as dreaming them up. airial.travel Me and Sanjeev co-founded Airial Travel a year ago to solve a problem we faced repeatedly. Being avid travelers living in the US with our families in India, we were traveling for several months a year and spending multiple days planning and booking each trip. Hours and hours of research, browsing, watching videos, form-filling, spreadsheets, refinements etc. At the end of the process in most cases, we just booked because we were exhausted and wanted to get it over with. As we talked to people and read up about this, we realized that the scale and the intensity of this problem is stunning - hundreds of millions of trips are booked online every year and planning each of them takes over five hours on average. Our vision “Just imagine your trip, and Airial it!” stems from our ultimate wish as travelers - AI that can figure out all the intricacies of trip planning for you - hotels, activities, flights, trains, transits, deals, date options, restaurants, interests, research, travel videos and everything else. Our defining features originate from the core beliefs that Airial is built on: 📅 Detailed intricately crafted plans: Attention to detail makes trip plans incredible. Airial plans trips in a level of detail that is simply unmatched, taking care of hundreds of common sense constraints across thousands of variables in seconds. 🚀 From Reels to Itineraries: TikTok / IG reels to Trips is work that millions do manually. We now enable all this in a click. This is the intersection of the two big trends in travel - AI and Socials. 🏖️ Personalized Planning: Travel portals today are one-size-fits-all. We plan trips tailored to your specific interests - architecture tours, scenic hikes or samurai sword fighting lessons! 👆 Actionable Itineraries: Just having a chatbot pick out one combination for you isn’t practically useful. Which was the last trip you planned that didn’t need any refinement? Every decision Airial makes for you is changeable via chat or UI controls. 🌎 Discovery in context: As you plan your trip, Airial gives you the tools to discover incredible ideas and expert advice specific to your itinerary and interests which can be instantly imported into your trip. Now you can “Just imagine your trip and Airial it!”. Try it out now on your laptop at airial.travel!

1:05

235

Jeff Dean

Anudhyan Boral retweeted

Jeff Dean

@JeffDean

6 Dec 2024

What a way to celebrate one year of incredible Gemini progress -- #1🥇across the board on overall ranking, as well as on hard prompts, coding, math, instruction following, and more, including with style control on. Thanks to the hard work of everyone in the Gemini team and elsewhere at Google! 🎊

Arena.ai

@arena

6 Dec 2024

Big news on Chatbot Arena 🔥 The new @GoogleDeepMind model gemini-exp-1206 is crushing it, and the race is heating up. Google is back in the #1 spot 🏆overall and tied with O1 for the top coding model! Highlights (improvement since gemini-exp-1121 in parentheses) - First place overall (2->1) - Tied with GPT-4o-1120 after style control (4->1) - Tied with O1 on coding leaderboard (3->1) - First place on hard prompts (2->1) Keep it up @GoogleDeepMind! The rate of progress is crazy. For analysis and to test the model, see below 👇

311

1,583

782,998

Anudhyan Boral

Anudhyan Boral @bloopsie

5 Oct 2024

Thrilled to have helped train this awesome model. Algorithmic efficiency FTW!⚡️⚡️⚡️

rohan anil

@_arohan_

3 Oct 2024

Flash8B General Availability: We originally trained Flash 8B giving it all our algorithmic efficiency improvements to pack as much as possible in a small form factor which then was scaled up to Flash On benchmarks, it is closely matching Flash announced during May at I/O

4,046

Prateek Jain

Anudhyan Boral retweeted

Prateek Jain

@jainprateek_

4 Oct 2024

Excited to share that the Machine Learning and Optimization team at @GoogleDeepMind India is hiring Research Scientists and Research Engineers! If you're passionate about cutting-edge AI research and building efficient, elastic, and safe LLMs, we'd love to hear from you. Check out our open roles and apply: boards.greenhouse.io/deepmin… boards.greenhouse.io/deepmin… cc: @ManishGuptaMG1 @PNetrapalli @adityakusupati @divy93t @partha_p_t

469

136,046

Zachary Nado

Anudhyan Boral retweeted

Zachary Nado @zacharynado

5 Sep 2024

Gemini Flash is now tied with gpt-4o for #2 on the lmsys *vision* leaderboard! combine that with a 1M context length and you can do some seriously cool multimodal work for super cheap ⚡️⚡️⚡️

195

62,936

Jonas Adler

Anudhyan Boral retweeted

Jonas Adler

@JonasAAdler

28 May 2024

A modest 59% win-rate over the original GPT4 too. I remember when that model was considered halfway to AGI. Now the price of that quality has dropped by almost 100x in 15 months.

rohan anil

@_arohan_

28 May 2024

These win rates are quite interesting too Flash has a win rate of 55% over gpt-4-1106 while loses to Yi-Large at 49% which is ranked lower. Use the model that fits you!

9,404

Arena.ai

Anudhyan Boral retweeted

Arena.ai

@arena

28 May 2024

Big news – Gemini 1.5 Flash, Pro and Advanced results are out!🔥 - Gemini 1.5 Pro/Advanced at #2, closing in on GPT-4o - Gemini 1.5 Flash at #9, outperforming Llama-3-70b and nearly reaching GPT-4-0125 (!) Pro is significantly stronger than its April version. Flash’s cost, capabilities, and unmatched context length make it a market game-changer! Huge congrats to @GoogleDeepMind on the incredible Gemini launches! Can't wait to see what new applications Gemini unlocks! More breakdown analysis below👇

242

1,134

394,124

Tianle Cai

Anudhyan Boral retweeted

Tianle Cai

@tianle_cai

20 May 2024

Just finished reading the Gemini 1.5 report and I'm blown away by the depth of information shared in such a competitive environment! 🤯 Most surprising was the revelation about their optimizer - they didn't just use Adam! Optimization is still alive and kicking! Kudos to the team for their great work! 👏

347

73,727

Anudhyan Boral

Anudhyan Boral @bloopsie

14 Dec 2023

If you're at NeurIPS this year, come check out our work on modeling turbulence with probabilistic generative models! (Find us at the poster session today 12/14 at 5pm) A short thread 🧵(1/N):

1,218

more replies

Anudhyan Boral

Anudhyan Boral @bloopsie

14 Dec 2023

...and let SGD and empirical risk minimization discover the latent low-dimensional dynamics! The latent dim is low enough that we can solve the SDE much faster than a numerical solver. We get stable rollouts across 1000s of steps while maintaining high fidelity throughout. (8/N)

183

Anudhyan Boral

Anudhyan Boral @bloopsie

14 Dec 2023

For more, come visit our poster #603 at 5pm at #NeurIPS2023 or check out our paper (arxiv.org/abs/2306.01174). Joint work with my awesome colleagues @GoogleAI Zhong Yi Wan, Leonardo Zepeda-Núñez, James Lottes, Qing Wang, Yi-fan Chen, John Anderson, @feishaAI (9/N; N=9)

Neural Ideal Large Eddy Simulation: Modeling Turbulence with...

We introduce a data-driven learning framework that assimilates two powerful ideas: ideal large eddy simulation (LES) from turbulence closure modeling and neural stochastic differential equations...

arxiv.org

137