Chip Huyen

Chip Huyen

110 Photos and videos

Tweets

Pinned Tweet

Chip Huyen

@chipro

7 Jan 2025

My 8000-word note on agents: huyenchip.com//2025/01/07/ag… Covering: 1. An overview of agents 2. How the capability of an AI-powered agent is determined by the set of tools it has access to and its capability for planning 3. How to select the best set of tools for your agent 4. Whether LLMs can plan and how to augment a model’s capability for planning 5. Agent’s failure modes AI-powered agents are an emerging field with no established theoretical frameworks for defining, developing, and evaluating them. This post is a best-effort attempt to build a framework from the existing literature, but it will evolve as the field does. As always, feedback is much appreciated!

Agents

Foundation models enable many new application interfaces, but one that has especially grown in popularity is the conversational interface, such as with chatbots and assistants. The conversational...

huyenchip.com

451

2,810

376,991

Chip Huyen

Chip Huyen

@chipro

Jun 5

bored at the airport so i made this killedbygpt.com

18,496

Chip Huyen

Chip Huyen

@chipro

Apr 23

Congrats to @AlecRad @Luke_Metz and @soumithchintala! It's really cool to see work produced by a group of 20-somethings without a single PhD between them winning this award. I hope to see these three collaborate again one day :)

ICLR @iclr_conf

Apr 23

We are honored to announce the Test of Time awards for #ICLR2026 🏆 This award recognizes papers published 10 years ago at ICLR 2016 that have had a lasting impact on the field: blog.iclr.cc/2026/04/22/anno…

107

16,385

NeetCode

Chip Huyen retweeted

NeetCode

@neetcode1

Mar 23

Got to meet the wonderful Chip Huyen @chipro She’s so nice and smart!!

849

40,959

Pamela Fox

Chip Huyen retweeted

Pamela Fox @pamelafox

Mar 19

We have some sweeet speakers at #MSBuild in SF this year, June 2-3: @chipro: author of my fav AI eng book @simonw: author of my fav blog @swyx: creator of AI engineer confs @steipete: creator of OpenClaw michael chiang: co-founder at @ollama even more aka.ms/build26

Microsoft Build

Go deep on real code and real systems with the teams building and scaling AI at Microsoft Build, June 2–3, 2026, in San Francisco and online.

build.microsoft.com

7,839

Karina

Chip Huyen retweeted

Karina

@karinanguyen

Mar 11

Excited to release PostTrainBench v1.0! This benchmark evaluates the ability of frontier AI agents to post-train language models in a simplified setting. We believe this is a first step toward tracking progress in recursive self-improvement 🧵:

0:44

724

174,594

Chip Huyen

Chip Huyen

@chipro

Feb 11

How long do you think AI will be able to fully automate your job?

11% Already automated

41% <= 3 years

29% > 3 years

19% Never

2,306 votes • Final results

25,584

Chip Huyen

Chip Huyen

@chipro

Feb 11

For those that chose Never, what do you do?

23,462

Chip Huyen

Chip Huyen

@chipro

Jan 29

I built GoodAIList.com to help me stay-up-date with new AI stuff. It's tracking 14K open source repos so far, with contributions from over 145K developers. Every day, it: - searches for new AI repos (based on 123 keywords and topics) - surfaces repos that are gaining traction, and - categorizes each repo The annotations are done by AI so they are not super accurate, but they've helped me find some useful stuff. It also lets me see where the contributors are, so when I travel, I can find folks doing cool stuff in a new city or country.

228

15,611

Chip Huyen

Chip Huyen

@chipro

Jan 29

Comparing the AI open source ecosystem in the US, China, Europe, and the rest of Asia over time

0:46

9,536

Chip Huyen

Chip Huyen

@chipro

Jan 14

Super impressed by the projects at the Agentic Hackathon last weekend! Many teams work on really hard/important problems: * Long running tasks: memory management, recovering from mid-task failures, and maintaining consistency across steps and sub-agents * Adaptive retrieval from multiple sources: databases, search indices, and websites * Agents that work with voice, video, and even 3D environments If you are in SF, come check out the finalist demos tomorrow! luma.com/6bd4bt9j There will be talks by Douglas Eck, who is doing amazing work with Veo and Imagen and many other awesome folks. Thanks @MongoDB and @cerebral_valley for hosting and for letting me serve as a judge for these fantastic projects.

10,515

Chip Huyen

Chip Huyen

@chipro

4 Dec 2025

OMG there's a bookstore dedicated to technical books in Taipei!

2,074

240,644

Chip Huyen

Chip Huyen

@chipro

14 Nov 2025

Sam!!!

Sam Altman

@sama

14 Nov 2025

Small-but-happy win: If you tell ChatGPT not to use em-dashes in your custom instructions, it finally does what it's supposed to do!

190

192

10,379

1,472,119

Chip Huyen

Chip Huyen

@chipro

30 Oct 2025

After years of following @lennysan's wonderful takes on product, I finally had the opportunity to chat with him about AI products! youtube.com/watch?v=qbvY0dQg… 1. Many AI product problems aren’t because of AI. It’s usually because of user experience, data quality, or organizational structure. A chatbot failed to get traction because their targeted users simply couldn’t type (because their hands were usually busy -- taking care of kids or driving), so showing pre-populated questions and adding a voice option significantly improved traction. Another team told me their lead scoring model was broken. It turns out that it’s because the marketing team wasn’t asking the right questions to get data. The biggest product improvements still come from understanding your users, preparing your data, and investing in your team! 2. Senior engineers see the most productivity improvement with AI coding because they have more experience with writing design docs and API specs, which help them write better instructions. However, they’re also more resistant to using AI for coding. Senior folks are often more opinionated and get frustrated easily when AI doesn’t do what they want. 3. Many teams spend a lot of time debating which tool to use, which can be counter-productive. When teams ask me which of the 2 tools to use, I usually ask 2 questions: “How much performance improvement will the optional tool give over the less optimal one?” --> If the improvement is small, then spend less time debating. “How hard is it to change from one tool to another once you’ve adopted it?” --> If the tool is new and not yet battle tested, I’d think twice about adopting something that I can’t get out later. 4. Many people know that the most effective way to learn AI is to build with AI. Yet, people keep asking me: “But what should I build?” We seem to be having an “idea crisis”. We have all these wonderful tools to help us build things, and no idea what to build. An exercise I often recommend is to spend a week noticing what frustrates you in your daily work, then build small tools to solve those specific pain points.

Al Engineering 101 with Chip Huyen (Nvidia, Stanford, Netflix)

Chip Huyen is a core developer on Nvidia’s Nemo platform, a former ...

youtube.com

221

40,753

Chip Huyen

Chip Huyen

@chipro

22 Oct 2025

A hiring manager just told me that it's a red flag 🚩 if a software engineering candidate hasn't experimented with vibe coding. Thoughts?

42% yes, candidate is 🚩🚩🚩

32% no, interviewer is 🚩🚩🚩

26% hmmm

814 votes • Final results

17,455

Chip Huyen

Chip Huyen

@chipro

3 Aug 2025

$100 for anyone who can show me how to get ChatGPT to stop using emdashes. it's driving me insane

1,313

220

8,840

3,843,506

Chip Huyen

Chip Huyen

@chipro

4 Aug 2025

this seems to have reached a corner of the internet that my innocent soul wasn't ready for. "why do you hate em dashes so much?" this isn't about punctuation. this is about getting AI to follow simple instructions "what model was it?" 4o "why not use a thinking model? it works fine on o3" i don't want to have to use an expensive, slow model just to fix some typos. there's also a limit for o3 usage. "just add the instruction to exclude em dash to every message" yes, i can, but we shouldn't have to "it's not that hard to remove the em dashes yourself" not the point "write your own words lol" also not the point

165

46,707

Chip Huyen

Chip Huyen

@chipro

30 Jul 2025

13 years. 6 books. All metrics are flawed, but it still makes me happy to see this. I love writing, and I hope to improve at it over time.

467

25,196

Chip Huyen

Chip Huyen

@chipro

24 Jul 2025

Very useful tips on tool use and memory from Manus's context engineering blog post. Key takeaways. 1. Reversible compact summary Most models allow 128K context, which can easily fill up after a few turns when working with data like PDFs or web pages. When the context gets full, they have to compact it. It’s important to compact the context so that it’s reversible. Eg, removing the content of a file/web page if the path/URL is kept.

697

82,841

more replies

Chip Huyen

Chip Huyen

@chipro

24 Jul 2025

3. Dynamic few shot prompting They cautioned against using the traditional few shot prompting for agents. Seeing the same few examples again and again will cause the agent to overfit to these examples. Ex: if you ask the agent to process a batch of 20 resumes, and one example in the prompt visits the job description, the agent might visit the same job description 20 times for these 20 resumes. Their solution is to introduce small structured variations each time an example is used: different phrasing, minor noise in formatting, etc.

14,588

Chip Huyen

Chip Huyen

@chipro

24 Jul 2025

Manus AI Manus AI chose to focus on context engineering rather than developing models. If you were to start an agentic company today, which would you invest in?

16% Post-training (eg RL)

48% Context engineering

31% Both

5% Other

357 votes • Final results

12,019