Benchmarking choices AI makes.

Joined September 2024
Photos and videos
Amplifying.ai retweeted
1/8 Mythos / Glasswing is clearly the main AI security story now: AI finding real vulnerabilities in existing production code. For most teams, though, this question is more immediate: Can an agent like Claude Code write a secure app in the first place?
1
4
3
701
Amplifying.ai retweeted
The Claude Code leak shows a clear divide: select vendors gain small but compounding advantages. Everyone else gets generic UX install friction. Direct impact on roadmaps and GTM.
1/9 The most interesting thing about the Claude Code leak for devtool companies: Anthropic hardcoded 120 vendor names across 7 different systems in the source. Anthropic explicitly included your tool name in the code (or they didn’t πŸ€·πŸ»β€β™‚οΈ) Thread πŸ‘‡
1
1
133
Amplifying.ai retweeted
1/9 The most interesting thing about the Claude Code leak for devtool companies: Anthropic hardcoded 120 vendor names across 7 different systems in the source. Anthropic explicitly included your tool name in the code (or they didn’t πŸ€·πŸ»β€β™‚οΈ) Thread πŸ‘‡
2
1
3
741
Amplifying.ai retweeted
1) Winners & losers from our OpenAI Codex vs Claude Code Picks benchmarks
1
2
2
323
Amplifying.ai retweeted
Past category winners relied heavily on marketing and sales, and GTM still matters to get in front of coding agents. But agents won't keep using you unless you actually make them better (since the agents are trained with their harness). That shifts long-term value toward real product quality, not just distribution.
Feb 26
every category leader in this list should be worth at least $5b btw because koding agents will be recommending them for the next 5 years infra is stickier than agents (full disclosure am smol resend angel)
1
1
77
Amplifying.ai retweeted
1/9 Winners & losers from our Claude Code Picks benchmarks that measure what Sonnet 4.5, Opus 4.5, and Opus 4.6 default to when building apps.
2
3
25
13,535
Amplifying.ai retweeted
Replying to @vikati
@vikati and I analyzed 2,430 Claude Code repo decisions. Claude Code never picked AWS or GCP for deployment. If agents are writing the first version of new projects, they’re influencing which tools get adopted at scale.
1
2
474
Amplifying.ai retweeted
πŸ” Google AI Mode vs ChatGPT β†’ same question, different answers. We tested 792 shopping queries: 1️⃣ They agreed only 47 % of the time. 2️⃣ ChatGPT flipped its answer all the time (training data vs live retrieval). Expected or surprising? πŸ€” Full charts & study πŸ‘‡
1
1
4
3,026
Amplifying.ai retweeted
1/ AI is changing how brands appear in search. Try searching "best acoustic guitar under $500" in Google, ChatGPT, and Perplexity. You'll get three completely different sets of recommendations citing different sources
1
2
2
1,025