Saurabh Bhatnagar

Saurabh Bhatnagar

966 Photos and videos

Tweets

Pinned Tweet

Saurabh Bhatnagar @analyticsaurabh

25 Apr 2018

I just published “How I scaled Machine Learning to a Billion dollars: Strategy” medium.com/p/how-i-scaled-ma…

How I scaled Machine Learning to a Billion dollars: Strategy

Rent The Runway is valued at $700 million and is well on its way to be a unicorn. I joined in 2012 to bring data science to then, a small…

medium.com

Saurabh Bhatnagar

Saurabh Bhatnagar @analyticsaurabh

The good news is that in 5-10 years we will have British Spring

Jack

@tracewoodgrains

it really sucks to be small and powerless and to have a bunch of distant suits decide that cutting you off from the people and places you appreciate is the right move for your own good

Saurabh Bhatnagar

Saurabh Bhatnagar @analyticsaurabh

Jun 14

Amazing And the same core idea as 200M funded startup but on commodity hardware Expect more innovation here

Fabio Guzman

@FGuzmanAI

Jun 13

56,000 tokens/sec at just 80 MHz. 🤯 I burned a full Transformer with KV cache into a custom chip. Designed gate by gate as a 100% digital integrated circuit. Prototyped on a FPGA. (No GPU. No CPU) Just pure digital silicon running @karpathy microGPT, spelling out names on a tiny LCD. This is GateGPT 👇

0:24

141

Saurabh Bhatnagar

Saurabh Bhatnagar @analyticsaurabh

Jun 14

Dimes square

0:09

Saurabh Bhatnagar

Saurabh Bhatnagar @analyticsaurabh

Jun 14

Saurabh Bhatnagar

Saurabh Bhatnagar @analyticsaurabh

Jun 14

You’re being made a fool But why? It’s Amazon, right! Don’t have have investments in Ant?

Matthew Berman

@MatthewBerman

Jun 13

Wait the jailbreak technique is…asking the model to review a codebase???

Saurabh Bhatnagar

Saurabh Bhatnagar @analyticsaurabh

Jun 13

It’s more than that If they can’t sell the improving model outputs then they are forced to compete (harder) on non ai things Popping the bubble and making them more entrenched. But don’t think for a second Ant didn’t anticipate this

Gergely Orosz

@GergelyOrosz

Jun 13

Well this is massive: - Every non-US company sees how they can be cut off from US vendors with a snap of a finger. Non-US vendors getting free advertisement - What does this mean for US AI labs’ offices and employees outside the US? Reads pretty draconian What a mess

Saurabh Bhatnagar

Saurabh Bhatnagar @analyticsaurabh

Jun 12

A bit more than observability Also all execution All thinking tokens Building something here

John Suh

@john_ssuh

Jun 11

Increasingly, I believe companies may need to be rebuilt from the ground up, where you have a single timeline of all observability product metrics file changes laid out in a retrievable system, like Datadog Posthog Google Drive Slack (really unified filesystem of Claude Code chats Codex chats). This might be the new data foundation for any and all companies to maximize AI. Needs to be rebuilt because keeping track of diffs on existing system basically impossible to produce longitudinal information on decisions and rollbacks, something coding agent storage companies are actively trying to figure out, but this should extend to businesses as a whole. Highly skeptical existing businesses will adopt this though because it means overhauling everything about their instrumentation and business data, but I think businesses built on this foundation probably can execute 100x better and faster

Ben Cohen

Saurabh Bhatnagar retweeted

Ben Cohen

@blc_16

Jun 11

Resisting the urge to: > Start AI for Enterprise Co > Name it Aurelion Labs > Get two LOIs from my Uncle Larry’s old fraternity brothers > Raise $50M seed pre-revenue > Hire an FDE army of recent CS grads who can’t get software engineer jobs > Have them install OpenClaw to automate email-to-JIRA-ticket workflows > Add OpenAI and Anthropic logos to our partner wall because we use their models > Steal an unknown researcher’s paper and release it as a blog from our internal AGI lab > Sign my Uncle’s friends’ enterprises to 3-year ramp-up contracts with 12-month opt-outs and recognize the terminal year as ARR > Vaguepost about our success on LinkedIn until we get 5 more ramp-up contracts > Start having FDEs label their own computer-use data > Bribe a Hayes Valley realtor for Anthropic and OpenAI researchers’ contacts > Start selling them our FDEs’ computer-use data to triple revenue > Raise $100M Series A > Immediately sell secondaries > Beg Sam and Dario to acquire the company to “expand their Forward Deployed Operations” > Get acquired and escape the permanent underclass Alas, I will resist the urge and remain overseeing my 100 agents that are one loop away from discovering AGI

394

41,975

Saurabh Bhatnagar

Saurabh Bhatnagar @analyticsaurabh

Jun 11

We will have an AI VC when it autonomously funds Fabrice Bellard’s next side project

Saurabh Bhatnagar

Saurabh Bhatnagar @analyticsaurabh

Jun 11

It’s important to understand that @ShaneLegg likely means rollouts here not RAG

Dwarkesh Patel

@dwarkesh_sp

Jun 10

DeepMind cofounder Shane Legg thinks that search is essential for a model to be genuinely creative. Pre-trained base models can do incredible things. But Shane thinks this is just a matter of them mixing together existing concepts from their training data. If he's right, coming up with genuinely novel ideas always involves searching a large space for "hidden gems".

0:52

Nick Frosst

Saurabh Bhatnagar retweeted

Nick Frosst

@nickfrosst

Jun 9

this model is the opposite of mythos. Its small, cost effective, apache 2.0, and locally deployable. This is the way LLMs should go. small, open source, transparent and sovereign vs large, expensive, proprietary and hegemonic

Cohere

@cohere

Jun 9

Introducing Cohere's first open-source coding model: North Mini Code Small & efficient, designed for agentic performance and built for community input.

6:10

1,332

163,103

Saurabh Bhatnagar

Saurabh Bhatnagar @analyticsaurabh

Jun 10

One weird behavior It really likes going upstream and checking other repos Like ~/work/…/another_repo Need to think more seriously about sandboxing

Saurabh Bhatnagar

Saurabh Bhatnagar @analyticsaurabh

Jun 10

" - Edge case handled: if the duplicate arrives while the original is still mid-pipeline, the linked asset mirrors its status and flips to ready automatically when the original finishes" the thing is, I never asked about edge case, or sampled fingerprint First model I respect

Saurabh Bhatnagar

Saurabh Bhatnagar @analyticsaurabh

Jun 10

Ha This is lost of most VCs because they are chasing API wrapper not ML people

🧙🏼‍♂️Dustin Freeman @dustinfreeman

Jun 9

I’ve gotten “Won’t AI solve [what you just pitched me]?” And I’m like, yeah, I’m describing how I’ll be the one to do that.

Jeremy Howard

Saurabh Bhatnagar retweeted

Jeremy Howard

@jeremyphoward

Jun 8

Also, those who focus on using AI to help improve the skills of themselves and their teams will be diamonds in great demand, since they will be the rare A players in a sea of mediocrity.

martin_casado

@martin_casado

Jun 8

There will be an extreme irony if these models really are bound by human generated training data. RL doesn't generalize and is only useful in a handful of areas. And we all loose our skills to something that'll forever be a B player.

195

25,483

Luca Soldaini 🎀

Saurabh Bhatnagar retweeted

Luca Soldaini 🎀

@soldni

Jun 7

the most talented people I ever worked with OBSESSIVELY look at the data. literally #1 skill. steer clear of those who don’t…

Gergely Orosz

@GergelyOrosz

Jun 6

Just learned: Software engineers used to do manual data labeling at Scale AI while Alex Wang was CEO. After he left, new leadership joined, and were HORRIFIED to learn this. Stopped it ASAP Now at Meta, software engineers are assigned manual data labeling... see the pattern?

916

121,872

Saurabh Bhatnagar

Saurabh Bhatnagar @analyticsaurabh

Jun 7

RT @suchenzang: if your bread-and-butter consists solely of: - tuning hyperparams/config files - fitting points on a log-log plot - twea…

Saurabh Bhatnagar

Saurabh Bhatnagar @analyticsaurabh

Jun 6

Absolutely hate entitled engineers who don’t look at the data and think they can just slap a model run on top The data is the work

Gergely Orosz

@GergelyOrosz

Jun 6

125

Saurabh Bhatnagar

Saurabh Bhatnagar @analyticsaurabh

May 29

If you are a @polynoamial follower This should disgust you

Mitchell Hashimoto

Saurabh Bhatnagar retweeted

Mitchell Hashimoto

@mitchellh

May 28

I've got an agent in a loop optimizing a renderer with the goal to minimize frame times (and tests to measure). It got times down from 88ms to 2ms and allocations down from ~150K to 500. Sounds good, right? Wrong. This is exactly why agent psychosis is a big fucking problem. As an experiment, I rewrote the Ghostty core render state in Go, with access to identically laid out data structures as Ghostty and the exact same validation tests. I made a purposely naive renderer (simple, correct, but slow). 88ms per frame with 150,000 allocations (horrendous, lol)! I then kickstarted a Ralph loop to bring the frame times down. I told it it can't modify input data structures or the public API or tests (they're correct), but it can do anything else it wants. It got to work. It has worked for about 4 hours. I've spent around $350 on this experiment so far. The results? 88ms => 1.5ms 150K allocs => ~500 allocs Incredible right? Nope. My hand-written renderer I ported has frame times (same benchmark) of ~20us (0.020ms) and 0 allocations in the update path. This is the problem with psychosis and lacking systems understanding. If you don't understand the system, you're going to accept that this is an incredible result. If you understand the system, you'll see better solutions immediately and can do roughly 75x better on throughput. The people who blindly trust agent output are in the former camp. They're sheeple, overdrinking from a fountain of mediocrity. Standard disclaimer: I use AI all the time. I like AI. The point I'm making is to not blindly accept results. Think. Analyze. Learn.

308

980

8,938

791,636