Joined June 2008
110 Photos and videos
Pinned Tweet
7 Jan 2025
My 8000-word note on agents: huyenchip.com//2025/01/07/ag… Covering: 1. An overview of agents 2. How the capability of an AI-powered agent is determined by the set of tools it has access to and its capability for planning 3. How to select the best set of tools for your agent 4. Whether LLMs can plan and how to augment a model’s capability for planning 5. Agent’s failure modes AI-powered agents are an emerging field with no established theoretical frameworks for defining, developing, and evaluating them. This post is a best-effort attempt to build a framework from the existing literature, but it will evolve as the field does. As always, feedback is much appreciated!
57
451
2,810
376,991
bored at the airport so i made this killedbygpt.com
18
7
95
18,496
Congrats to @AlecRad @Luke_Metz and @soumithchintala! It's really cool to see work produced by a group of 20-somethings without a single PhD between them winning this award. I hope to see these three collaborate again one day :)
We are honored to announce the Test of Time awards for #ICLR2026 🏆 This award recognizes papers published 10 years ago at ICLR 2016 that have had a lasting impact on the field: blog.iclr.cc/2026/04/22/anno…
2
6
107
16,385
Chip Huyen retweeted
Got to meet the wonderful Chip Huyen @chipro She’s so nice and smart!!
20
2
849
40,959
Chip Huyen retweeted
We have some sweeet speakers at #MSBuild in SF this year, June 2-3: @chipro: author of my fav AI eng book @simonw: author of my fav blog @swyx: creator of AI engineer confs @steipete: creator of OpenClaw michael chiang: co-founder at @ollama even more aka.ms/build26
2
21
7,839
Chip Huyen retweeted
Excited to release PostTrainBench v1.0! This benchmark evaluates the ability of frontier AI agents to post-train language models in a simplified setting. We believe this is a first step toward tracking progress in recursive self-improvement 🧵:
46
94
724
174,594
How long do you think AI will be able to fully automate your job?
11% Already automated
41% <= 3 years
29% > 3 years
19% Never
2,306 votes • Final results
19
6
46
25,584
For those that chose Never, what do you do?
41
1
29
23,462
I built GoodAIList.com to help me stay-up-date with new AI stuff. It's tracking 14K open source repos so far, with contributions from over 145K developers. Every day, it: - searches for new AI repos (based on 123 keywords and topics) - surfaces repos that are gaining traction, and - categorizes each repo The annotations are done by AI so they are not super accurate, but they've helped me find some useful stuff. It also lets me see where the contributors are, so when I travel, I can find folks doing cool stuff in a new city or country.
17
44
228
15,611
Comparing the AI open source ecosystem in the US, China, Europe, and the rest of Asia over time
2
6
29
9,536
Super impressed by the projects at the Agentic Hackathon last weekend! Many teams work on really hard/important problems: * Long running tasks: memory management, recovering from mid-task failures, and maintaining consistency across steps and sub-agents * Adaptive retrieval from multiple sources: databases, search indices, and websites * Agents that work with voice, video, and even 3D environments If you are in SF, come check out the finalist demos tomorrow! luma.com/6bd4bt9j There will be talks by Douglas Eck, who is doing amazing work with Veo and Imagen and many other awesome folks. Thanks @MongoDB and @cerebral_valley for hosting and for letting me serve as a judge for these fantastic projects.
6
10
85
10,515
4 Dec 2025
OMG there's a bookstore dedicated to technical books in Taipei!
53
90
2,074
240,644
14 Nov 2025
Sam!!!
14 Nov 2025
Small-but-happy win: If you tell ChatGPT not to use em-dashes in your custom instructions, it finally does what it's supposed to do!
190
192
10,379
1,472,119
30 Oct 2025
After years of following @lennysan's wonderful takes on product, I finally had the opportunity to chat with him about AI products! youtube.com/watch?v=qbvY0dQg… 1. Many AI product problems aren’t because of AI. It’s usually because of user experience, data quality, or organizational structure. A chatbot failed to get traction because their targeted users simply couldn’t type (because their hands were usually busy -- taking care of kids or driving), so showing pre-populated questions and adding a voice option significantly improved traction. Another team told me their lead scoring model was broken. It turns out that it’s because the marketing team wasn’t asking the right questions to get data. The biggest product improvements still come from understanding your users, preparing your data, and investing in your team! 2. Senior engineers see the most productivity improvement with AI coding because they have more experience with writing design docs and API specs, which help them write better instructions. However, they’re also more resistant to using AI for coding. Senior folks are often more opinionated and get frustrated easily when AI doesn’t do what they want. 3. Many teams spend a lot of time debating which tool to use, which can be counter-productive. When teams ask me which of the 2 tools to use, I usually ask 2 questions: “How much performance improvement will the optional tool give over the less optimal one?” --> If the improvement is small, then spend less time debating. “How hard is it to change from one tool to another once you’ve adopted it?” --> If the tool is new and not yet battle tested, I’d think twice about adopting something that I can’t get out later. 4. Many people know that the most effective way to learn AI is to build with AI. Yet, people keep asking me: “But what should I build?” We seem to be having an “idea crisis”. We have all these wonderful tools to help us build things, and no idea what to build. An exercise I often recommend is to spend a week noticing what frustrates you in your daily work, then build small tools to solve those specific pain points.
10
23
221
40,753
22 Oct 2025
A hiring manager just told me that it's a red flag 🚩 if a software engineering candidate hasn't experimented with vibe coding. Thoughts?
42% yes, candidate is 🚩🚩🚩
32% no, interviewer is 🚩🚩🚩
26% hmmm
814 votes • Final results
14
5
47
17,455
3 Aug 2025
$100 for anyone who can show me how to get ChatGPT to stop using emdashes. it's driving me insane
1,313
220
8,840
3,843,506
4 Aug 2025
this seems to have reached a corner of the internet that my innocent soul wasn't ready for. "why do you hate em dashes so much?" this isn't about punctuation. this is about getting AI to follow simple instructions "what model was it?" 4o "why not use a thinking model? it works fine on o3" i don't want to have to use an expensive, slow model just to fix some typos. there's also a limit for o3 usage. "just add the instruction to exclude em dash to every message" yes, i can, but we shouldn't have to "it's not that hard to remove the em dashes yourself" not the point "write your own words lol" also not the point
29
165
46,707
30 Jul 2025
13 years. 6 books. All metrics are flawed, but it still makes me happy to see this. I love writing, and I hope to improve at it over time.
28
5
467
25,196
24 Jul 2025
Very useful tips on tool use and memory from Manus's context engineering blog post. Key takeaways. 1. Reversible compact summary Most models allow 128K context, which can easily fill up after a few turns when working with data like PDFs or web pages. When the context gets full, they have to compact it. It’s important to compact the context so that it’s reversible. Eg, removing the content of a file/web page if the path/URL is kept.
4
53
697
82,841
24 Jul 2025
3. Dynamic few shot prompting They cautioned against using the traditional few shot prompting for agents. Seeing the same few examples again and again will cause the agent to overfit to these examples. Ex: if you ask the agent to process a batch of 20 resumes, and one example in the prompt visits the job description, the agent might visit the same job description 20 times for these 20 resumes. Their solution is to introduce small structured variations each time an example is used: different phrasing, minor noise in formatting, etc.
2
2
36
14,588
24 Jul 2025
Manus AI Manus AI chose to focus on context engineering rather than developing models. If you were to start an agentic company today, which would you invest in?
16% Post-training (eg RL)
48% Context engineering
31% Both
5% Other
357 votes • Final results
7
3
29
12,019