Senior Generative AI Specialist at @awscloud | Helping startups & enterprises train large-scale models & optimize inferencing. Founders—DM to connect!

Joined March 2009
3,839 Photos and videos
Pinned Tweet
12 Mar 2022
What a future 🇹🇹 🇺🇸 MIT freshman looks like. Top of the class, Mandarin speaking, JavaScript coding, track & field ⭐️, and fashionista. Going to one of the top 10 middle schools in USA
30
166
2,792
Anton retweeted
15 Dec 2025
BREAKING: NVIDIA just dropped an open 30B model that beats GPT-OSS and Qwen3-30B — and runs 2.2–3.3× faster Nemotron 3 Nano: • Up to 1M-token context • MoE: 31.6B total params, 3.6B active • Best-in-class performance for SWE-Bench • Open weights training recipe redistributable datasets You can run the model locally with 24GB RAM.
81
313
2,645
207,056
28 Oct 2025
🚀 New blog: Building custom LLMs for public sector on AWS Learn how governments can develop national & domain-specific language models that meet sovereignty, compliance & cultural requirements. Full 6-stage development guide 👇
aws.amazon.com/blogs/publics… #AWS #AI #PublicSector #LLM#MachineLearning
2
292
Anton retweeted
1 Oct 2025
had a sudden urge to write a neural network from scratch in C using MLX
46
54
1,374
84,244
Anton retweeted
this 17 year old homeschooled girl refuted a conjecture that was unsolved for 40 years, and which professional mathematicians worked on for years without solving she was rejected from most graduate programs she applied to, because she did not have a degree
479
690
11,119
2,040,533
2 Oct 2025
Spent the last three days hacking DeepEP to work on EFA and I made it happen.
14
9,088
Anton retweeted
20 Sep 2025
lets you build C/C apps that run on Linux, Mac, Windows, BSD, and BIOS
7
74
818
35,988
Anton retweeted
24 Jul 2025
Learn how @nvidia Dynamo can be quickly setup & seamlessly deployed using Amazon EKS for automated scaling & simplified Kubernetes management ⚡💡🔧 NVIDIA Dynamo supports #AWS services such as Amazon S3, Amazon EFA & Amazon EKS. 👉 go.aws/3IK98YH
2
4
9
943
5 Sep 2025
Grateful to be featured on AWS for AI Podcast (Ep. 8)! 🎙️ I shared insights on training foundation models at scale my journey from 🇹🇹 to AWS. Would love if you could watch, like & comment to support! 🙌 youtu.be/i95xUdpy0qQ?si=3wum…
5
267
Anton retweeted
6 Aug 2025
First ever thermodynamic computer was put online internally today. Soon to be accessed by our first customers. So excited for things to come.
132
110
1,380
142,022
Anton retweeted
Today, we are releasing 4 hybrid reasoning models of sizes 70B, 109B MoE, 405B, 671B MoE under open license. These are some of the strongest LLMs in the world, and serve as a proof of concept for a novel AI paradigm - iterative self-improvement (AI systems improving themselves). The largest 671B MoE model is amongst the strongest open models in the world. It matches/exceeds the performance of the latest DeepSeek v3 and DeepSeek R1 models both, and approaches closed frontier models like o3 and Claude 4 Opus.
43
252
1,947
452,136
Anton retweeted
15 Jul 2025
lock in. no one else's gonna do it.
39
956
9,177
424,918
Anton retweeted
Verification and Validation in Computational Science and Engineering by Patrick J. Roache.
2
110
969
33,319
Anton retweeted
Probability Theory and Mathematical Statistics:
9
47
550
26,258
Anton retweeted
12 Jul 2025
I solved every single problem in the CUDA mode book. A quick thread summarizing this experience and what I learned 1/x
31
240
2,435
290,623
Anton retweeted
11 Jul 2025
“The Coatue team mapped out cloud revenue market share, Oracle at 5%, Amazon at 44% with AWS, Google 19%, Microsoft 30%. Next to that, they showed NVIDIA GPU allocation: Microsoft and Google match their cloud share. Amazon is 44% cloud revenue but only 20% GPUs. Oracle jumps from 5% to 19%. Cori comes out of nowhere at 11%.” “One obvious takeaway is that Amazon has half the share of GPUs than their share of AWS. So that could mean one of two things, either AWS is behind in AI, that could be one, or they're pursuing a different hardware strategy than its competitors.” — Bill Gurley @bgurley and Thomas Laffont @thomas_coatue
45
241
2,481
180,253
Anton retweeted
2
64
422
18,541
Anton retweeted
27 Jun 2025
🚨BREAKING: OPENAI IS RENTING GOOGLE’S TPUS TO HELP LOWER COSTS TO POWER CHATGPT
71
51
1,490
142,823
Anton retweeted
C 23 is not for toy-language coders. If you're playing with PHP, JS, Python, or Rust , stay in your lane. This is for real engineers with academic background.
227
165
3,435
390,672
Anton retweeted
The single most undervalued fact of linear algebra: matrices are graphs, and graphs are matrices. Encoding matrices as graphs is a cheat code, making complex behavior simple to study. Let me show you how!
114
1,154
11,274
1,015,673
21 Jun 2025
AI is getting out of hand. The USA and China comp is serious and I think it’s good for AI because the gap between us and other countries is huge
20 Jun 2025
A small Chinese startup dropped a video gen model that beats Google's Veo 3 in almost every test you throw at it. Generates accurate Physics. Does celebrity faces. ~100 ELO above Veo3 on the leaderboard Passes the gymnastics Turing test. AND Hailuo is only $8/mo vs $250/mo!
266