Div Garg

Div Garg

248 Photos and videos

Tweets

Pinned Tweet

Div Garg

@divgarg

Jan 1

x.com/i/article/200678461737…

493

230,854

Div Garg

Div Garg

@divgarg

Jun 12

Up to 96% lower AI inference latency on @Samsung's and @SKhynix's chips, with zero hardware changes. All of it came from the compiler. That's new research from @AdamA7741, who built it during his PhD with Prof. Aviral Shrivastava at MPS Lab at @ASU and is now on our on-device team. AI chips spend a surprising amount of their time and power moving data around, not doing compute. The industry's most promising fix is processing-near-memory (PNM): compute right where the data sits. @Qualcomm, @Samsung, @AnkerOfficial and @SKhynix are all building these chips. First ones ships this year. But hardware is only half the story. You also need a compiler, the software that translates an AI model into instructions the chip can run. Today every PNM compiler is tuned by hand for one chip and one task. Change either, and performance collapses. Adam's compiler, DPC, goes after exactly that: one compiler that works across any of these chips. It beat both vendors' own hand-tuned compilers on the architectures they were built for. We’re building off this approach at @agi_inc. Our agents run on phones, cars, and wearables, and on small devices memory and moving data are the main constraints. So we optimize the model's workload and memory use first, because those gains transfer from chip to chip. Cost models decide how the model maps to each device. Hardware-specific tuning comes last. Same playbook as DPC, running on today's silicon. It's how our models run well across very different chips, and they can run on a phone locally handling tasks most people assume would need a desktop. The silicon is coming. The software is the race.

3,008

Div Garg

Div Garg

@divgarg

Jun 12

Full writeup, with the benchmarks and where we think edge compute goes from here: theagi.company/blog/processi…

Up to 96% Faster Processing-Near-Memory, Without Touching the Hardware

theagi.company

1,299

Div Garg

Div Garg

@divgarg

May 21

Spent Tuesday watching @Google I/O thinking about a question. If you build on @Android, what just changed for you? Short answer: your screen belongs to Gemini now. After this week, Google owns the AI, the operating system, the browser, the app store, and a phone that competes with every device you ship. The data your users generate trains the same model. If you build hardware on that stack, your device starts to look like a user acquisition channel for Gemini. The screen you fought to win belongs to whoever Gemini decides to surface. Your users belong to whoever trains on their data. Android used to be open source. That was the antitrust defense. In practice, the GMS and Play Store is the lock-in. Ship Google's version of Android or lose the Play Store, and without it you do not have a viable phone. We have seen this play before: @Microsoft and Internet Explorer on @Windows, Google and @googlechrome on Android. Each time, the platform's interface layer became its distribution moat and the pattern repeats. The stakes are bigger now, because an assistant is a deeper relationship to the user than the browser. Every OEM and silicon partner we talk to wants the ability to choose what AI runs on their device, and the right to keep user data on the device. None of that exists by default in what shipped this week. So they will find it elsewhere. The initial point of Android was optionality for the user. That is what we are keeping alive. On-device action AI. Any screen. Any app. Any OS? More soon.

805

Div Garg

Div Garg

@divgarg

May 7

We started running monthly research sessions at the @agi_inc office. First one was last Friday on PNM compilation. If you do on-device research and want to come present, apply here: form.typeform.com/to/AZDHOWg… or rsvp to attend the next one on May 15: luma.com/ny79q2ow

Edge Cases by AGI, Inc. Application Form

Turn data collection into an experience with Typeform. Create beautiful online forms, surveys, quizzes, and so much more. Try it for FREE.

form.typeform.com

AGI, Inc.

@agi_inc

May 7

Last Friday we had 12 researchers at the office for the first of our monthly on-device research sessions. Topic: Processing Near Memory. One of our researchers presented PhD thesis work on automated compilation for in-memory processing units. PNM hardware has been moving fast. UPMEM is shipping, Samsung HBM-PIM is in production, but every team building on top still has to do partitioning and data movement by hand. That's the part keeping PNM from going mainstream. We keep the group small and bring in researchers actively working in the area. Topic changes month to month. Next session signup and a separate application to present your own work below.

770

Div Garg

Div Garg

@divgarg

Apr 26

If you want to build proactive agents on real hardware, come build with us today. We’re co-hosting the Proactive Agent Build Day at @agihouse_org Ambient agents. persistent memory. accessibility. @EvenRealities G2 smart glasses. i’ll be speaking alongside @richminer (Android co-founder), Will Wang (@EvenRealities CEO), @TobinSouth (@AnthropicAI /@Stanford ), @melissapan (@Berkeley SKY, ex-Google), @hamudinaanaa (@PortalDotAI) $1K G2 glasses per track winner (whole team)

7,947

Div Garg

Div Garg

@divgarg

Apr 26

Join here: app.agihouse.org/events/proa…

Proactive Agent Build Day

The next frontier in AI isn't a better model. It's an agent that's already with you — one that notices, remembers, and acts before you think to ask. On April 26, AGI House and Even Realities are brin

app.agihouse.org

568

Div Garg

Div Garg

@divgarg

Apr 22

"The phone's real interface layer is AI, not apps" - this is the thesis behind everything we're building. @tomsguide got the early look

AGI, Inc.

@agi_inc

Apr 21

@tomsguide just called our Android app "the future of Siri in iOS 27" Their managing editor watched our AI look at the screen, understand the interface, and book a ride in 15 seconds. No APIs. No cloud. The model sees what you see and taps what you'd tap. This is what on-device agents actually look like when they work.

1,107

Div Garg

Div Garg

@divgarg

Apr 15

We believe the future of intelligence is decentralized. Tonight we're opening the new @agi_inc office in SF with a dinner, gathering chipmakers, OEM partners, and investors to discuss what happens when inference moves on-device. Computer-use is already here. Mobile and hardware are next. This one's full. More coming. Founders, investors, platform execs building at the edge - DM me

10,345

Div Garg

Div Garg

@divgarg

Apr 15

Rsvp to be invited to our future events: luma.com/9zbd9pqq

Edge AI Dinner by AGI, Inc. · Luma

Computer use hit desktop. Mobile and hardware are next. We're inviting the people building and betting on the device layer to think about what this means for…

luma.com

802

Div Garg

Div Garg

@divgarg

Mar 14

Fun @agi_inc x @Qualcomm collab 🔥

Qualcomm

@Qualcomm

Mar 13

Straight from the floor at #MWC2026, join Qualcomm's Charlotte Mallo to see how agentic AI powered by @Snapdragon autonomously completes tasks for you, right on your personal device.

0:44

1,533

AGI, Inc.

Div Garg retweeted

AGI, Inc.

@agi_inc

Mar 11

At #MWC26, we showed what it looks like when a phone does what you ask with real apps. Built with @Lenovo as a proof of concept inside Lenovo Qira.

1,445

Div Garg

Div Garg

@divgarg

Mar 3

Excited to work together with @Qualcomm and push the boundaries of what's possible with on-device AI at @AGI_Inc 🚀 🚀

Qualcomm

@Qualcomm

Mar 3

Replying to @agi_inc @divgarg

This is the shift to on‑device agentic AI—efficient, secure, and scalable. 🤝

2,194

Qualcomm

Div Garg retweeted

Qualcomm

@Qualcomm

Mar 3

Replying to @agi_inc @divgarg

This is the shift to on‑device agentic AI—efficient, secure, and scalable. 🤝

3,101

AGI, Inc.

Div Garg retweeted

AGI, Inc.

@agi_inc

Mar 3

Most AI companies rent intelligence from the cloud. We're building it into the device. @agi_inc is collaborating with @Qualcomm to bring our agent stack to Snapdragon®-powered devices, an agent that sees your screen, understands context, and acts across any app. On-device. No cloud. No APIs. See it live at #MWC26 in Barcelona, Qualcomm booth, Hall 3 - 3E10

10,641

Marc Andreessen 🇺🇸

Div Garg retweeted

Marc Andreessen 🇺🇸

@pmarca

Feb 24

369

3,027

139,329

AGI, Inc.

Div Garg retweeted

AGI, Inc.

@agi_inc

Feb 24

We'll be at @MWCHub Barcelona next week. Booth 7G21. Live demos of our Android agent. You talk. It operates your phone. Any app. Fully autonomous. Come see it in person, it’s different when you watch it happen on a real phone in your hand.

1,380

AGI, Inc.

Div Garg retweeted

AGI, Inc.

@agi_inc

Feb 18

If you’re building at the frontier of on-device AI, you should be part of this discussion. Tomorrow, a handpicked group of researchers, founding engineers, and technical leaders pushing edge compute forward will gather for a private dinner in SF. A few seats just opened up. RSVP in comments.

5,905

Chrys Bader

Div Garg retweeted

Chrys Bader

@chrysb

Feb 15

welp, it was fun while it lasted folks

182

1,045

19,787

824,908

AGI, Inc.

Div Garg retweeted

AGI, Inc.

@agi_inc

Feb 13

We built the world’s first AI that can actually use your phone. Think Siri that actually works, fully on-device (booking flights, managing your calendar, sending messages, all done without you lifting a finger). We’re showing it off tonight at @southpkcommons Demo Night. @divgarg (founder & CEO, @agi_inc) will be there with live demos. If you’re building in AI or want to see what’s next, this is the conversation you need to be part of. Get in early: agi.app RSVP in comments.

8,545

Jason Warner

Div Garg retweeted

Jason Warner

@jasoncwarner

Feb 11

:taps the sign: x.com/jasoncwarner/status/17… This isn't about Cursor, so forget the name used. This is about what is happening in the world. Cursor, as I understand it, is finetuning chinese models so at least they realize what I'm about to say. Let's walk through this so we fully understand it. In the '90s a bunch of tech companies built out the internet. Those tech companies became critical infrastructure and were massively rewarded for it and became new tech giants. That infrastructure allowed a whole new type of company to exist, companies like Amazon and Google and eventually Facebook. And for companies like Microsoft to make a transition if they could see the future (folks like IBM and Oracle didn't see the future). In the '00s, those same companies built out new critical infrastructure called hyperscaler clouds which enabled a whole new generation of company to exist. Those hyperscalers became the most valuable companies in the world bc they controlled the most valuable commoditized asset on the planet at the time. And the new companies they enabled, the likes of AirBnB, Uber, GitHub, Shopify etc, became great companies in their own right...but nothing like the scale of AWS, Azure, GCP etc. Now it's happening again. Intelligence is becoming the new critical infrastructure upon which every company on the planet will build. It is enabling new types of companies to exist that couldn't before. And like the previous transitions, companies that see this transition can create and capture value in bold new ways. Right now it's down to a handful of companies. Google is the only full stack player: They have dirt, datacenters campuses, TPUs, GCP, and Gemini. By default, they lap the Amazon and Microsoft (don't get me started on Microsoft's continued fumbles here). They are 100% fully vertically integrated. When they got good at building Gemini, it all fell together for them for the next two decades. OpenAI, Anthropic (best independent lab on the planet and only investable frontier lab IMO), and perhaps xAI round out the frontier capable players. Poolside, Chinese labs (with Poolside, the most efficient labs in the world...see notes at bottom), and (perhaps) SSI will eventually become frontier capable. Those round out the list. Three things matter for the next decade: 1. Energy Infrastructure 2. Compute Infrastructure 3. Intelligence Infrastructure Most things after those are rounding errors. 1 2 = powered datacenter campuses 3 = frontier model providers or domain specific model providers 1 2 3 = full vertical integration At Poolside we have a saying which is "Everything collapses into the model" which means that eventually all things we think of as valuable become part of the models (and agents*) that get produced. *Note: first party agents from model providers will *always* be better than third party agents that come from non-model providers. This is simply a reality that modern agents and models are trained together. This really really means a large company producing an agent without backing of it's own model is especially vulnerable. Another way to think of it is that the surface area available to applications is exactly equal to the current capability gaps of this generation models. Each time models become more capable, they eat up more surface area available to third party applications *that are simply arbitraging the current model generation capability gap*. This doesn't mean third party apps aren't valuable, but it does mean if the only value third party apps have is the current capability gaps of models, those third party apps have diminishing value. This is not debatable in 2026. It arguably wasn't debatable when I wrote the post a full two years ago either, except people are really bad at living in an exponential. You know who isn't bad at living in an exponential? Jensen. Nvidia gets exactly what is happening while wall street futzes around with "capex buildout costs", Jensen knows exactly where every gigawatt in the world is going because he knows the future. Do you think you are smarter than Jensen? I know I'm not. How should non-model companies (and companies who are incapable of building models) behave in this moment? It's simple but not easy: Get good at building smaller but still capable models that push *your value prop*. Do not cede your ground to third party model providers hoping you can hold on and survive, you won't be able to with the fullness of time. The tech is just too powerful. If you are a company with a mid double digit billions market cap and above and you are not making models, you are default long-tail dying right now. It is your job to figure out how you don't die. "But Jason, what about data as a moat?" Data is valuable but if your *only* valuable asset is data, you are fighting a multi-pronged war. And if this is true it is even more paramount that you get good at training your own models. My god, this makes the whole thing more existential for you! "But Jason, it costs so much to build models! How can one compete with how much money the model providers raised?" Here's real dirty little secret of the AI industry. That's not real. epoch.ai/data-insights/opena… OpenAI 2024 compute spend above ^^. It cost them roughly $500m to build frontier model in 2024. But they spend $4.5b on RnD, which means, clusters for their researchers. The dirty little secret of the AI industry is there is no such thing as a gigawatt training cluster, there is no such thing as networking a million GPUs to train a model. The cost to train frontier AI is people with knowledge, a system capable of experimenting to find and iterate on model recipes, and $500m to train the final recipe. The first two are the hard part. The third one is just cost of doing business. And we wrote extensively about Poolside's way of building models which allows Poolside to do things that frontier labs do but at a fraction of the cost and fraction of the time. We call it the Model Factory and it's part of our secret weapons (along with our proprietary RL research): poolside.ai/blog/introducing… Every single company worth double digit billions not getting good at training their own model is the modern equivalent of saying "I can't possibly run a database as good as Oracle therefore I shouldn't try and just pay Oracle to do it" or "It costs too much for me to have engineers build and maintain our software, I'll just pay Accenture and Microsoft to do it". How many of us would build our forever homes (our companies) on two year leased back land (api call to model provider where we pass all our data to them)? I prefer to own the ground my home is built on. Get good at building your company specific models or look back in 10 years and realize you IBMed yourself.

ily⚡️

@0xIlyy

Feb 10

Remember when people said Cursor would win AI because "the model doesn’t matter, the product is where the money is"? Turns out that was completely wrong.

341

77,341