Ben Barry

Ben Barry

30 Photos and videos

Tweets

fraser retweeted

Ben Barry @benbarry

Jun 9

Back in ~2018, I was the only designer at a tiny startup, and I was desperately trying to hire a second designer. We had very few applicants, and I was begging people to apply! That startup was OpenAI... I hope this time around is different.

2,322

TBPN

fraser retweeted

TBPN

@tbpn

Jun 8

Generalist CEO @peteflorence says robotics models are in a transition period similar to the step change between GPT-2 and GPT-3. They're "starting to cross over into levels of performance where these things are commercially viable for a number of different applications." "We think this is a crossover point where we have a general model starting to be able to hit levels of reliability, speed, and improvisational intelligence where we can start to get these things out there." "Very much like — you take a GPT-2-level model, you scale it to a GPT-3-level model, and certain types of commercial applications start to become viable."

2:54

21,575

Poke

fraser retweeted

Poke

@interaction

Jun 8

Claim your Poke handle today! 💌 poke.com/claim - Receive emails rerouted into your Gmail inbox today - We're gradually rolling out email sending - There's more to come later this month (samyok.poke.com) 👀

Poke

@interaction

Jun 4

Say hi to the new Poke! 🌴 Now officially approved by Apple to text on Apple Messages. As the first and only AI agent. Chat now: Poke.com

0:43

176

733

639,330

will savage

fraser retweeted

will savage

@wavage_

Jun 6

more love, care, and intentionality has been poured into the smallest minutiae of Poke than is present in any other consumer AI product that exists. it is a testament to the passion of the @interaction team, and to the triumph of beauty in technology

Poke

@interaction

Jun 4

Say hi to the new Poke! 🌴 Now officially approved by Apple to text on Apple Messages. As the first and only AI agent. Chat now: Poke.com

0:43

9,079

Poke

fraser retweeted

Poke

@interaction

Jun 4

Say hi to the new Poke! 🌴 Now officially approved by Apple to text on Apple Messages. As the first and only AI agent. Chat now: Poke.com

0:43

435

248

5,463

4,893,609

fraser

fraser

@Fraser

Jun 4

As @marvinvonhagen says, Poke is inevitable

Poke

@interaction

Jun 4

Say hi to the new Poke! 🌴 Now officially approved by Apple to text on Apple Messages. As the first and only AI agent. Chat now: Poke.com

0:43

2,313

fraser

fraser

@Fraser

Jun 4

These models, they just want to learn

Generalist

@GeneralistAI

Jun 4

We've raised $400M in new funding. This capital goes toward one mission: building general intelligence for the physical world and making it useful to everyone.

915

Elicit

fraser retweeted

Elicit

@elicitorg

May 28

Elicit now has an MCP server. Use it inside Claude, ChatGPT, Copilot, Gemini, and any MCP-compatible tool. Agents hallucinate when they don't have access to the right evidence. With Elicit's MCP, they search 138M papers and cite instead of guess. Ask your agent to run a full Elicit research report from inside your existing workflow. It searches, screens, extracts, and produces a shareable report you can build on. Your agents are only as useful as the evidence they can access. Now they can access Elicit. Install directly in ChatGPT or ask your agent to read the setup instructions at elicit.com/api

Elicit API Reference

Complete API reference for the Elicit research automation platform. Explore endpoints, parameters, and schemas for programmatic access to Elicit's research tools.

docs.elicit.com

1,659

Forbes

fraser retweeted

Forbes

@Forbes

May 27

Most VCs wouldn’t touch Anthropic in 2023. Yasmin Razavi did. The Spark Capital partner led a $450M round when Anthropic had no public product, no revenue and a massive capital need. Now the AI giant’s rise has landed her on the Forbes Midas List for the first time. forbes.com/sites/iainmartin/… (Photo: Guerin Blask For Forbes) #ForbesMidas

100

889

356,108

fraser

fraser

@Fraser

Apr 28

Most AI-in-pharma stories are about going faster. Profluent and Lilly's partnership is a scientific moonshot -- using AI to design large-scale DNA editors for genetic diseases where one-mutation fixes don't work, reaching patients conventional approaches can't

Ali Madani

@thisismadani

Apr 28

AI has two modes in drug discovery. Accelerate: moving faster through the existing playbook. Unlock: opening frontiers that weren't possible before. Excited to announce Profluent is partnering with Eli Lilly, the global pharma powerhouse, to unlock breakthrough medicines for patients. It's a big deal beyond the numbers ($2.25B royalties): we’ll get to use our frontier AI models and foundational datasets to design proteins focused on large gene insertion, a therapeutic moonshot. Proteins govern almost everything in biology. We've built a generalizable AI platform to design all proteins. Onward!

0:19

1,368

Andreas Stuhlmüller

fraser retweeted

Andreas Stuhlmüller

@stuhlmueller

Apr 10

x.com/i/article/204263258105…

16,363

fraser

fraser

@Fraser

Apr 2

There's enough evidence now that it's all going to work just like LLMs. "Will it work?" is the wrong question. "How fast?" and "How to do it faster?" become the questions.

fraser

@Fraser

Apr 2

Scale is all you need (again). Generalist has pretrained a robotics foundation model from scratch, scaled it up, and it's all working as you'd expect if you truly believe in the scaling hypothesis The scaling laws they showed previously continue to hold. New capabilities emerge at scale. Some capabilities cross a threshold and are now commercially deployable generalistai.com/blog/apr-02…

6,884

fraser

fraser

@Fraser

Apr 2

10,615

Nabeel Hyatt

fraser retweeted

Nabeel Hyatt

@nabeel

Mar 31

Recent Hallway Chat with @Fraser and I, we chatted about our favorite recent feature from @conductor_build Instead of submitting a comment you can submit a prompt. And what that implies about the future of software. Full episode below.

3:06

7,112

fraser

fraser

@Fraser

Mar 19

I’m a daily user of Poke. This type of product — a helpful personal assistant, universally available via messaging — will be one of the most important products of this era. Text is the universal interface. It’s intuitive, without a learning curve. And while a button can only do what a button says, a text box can do anything the user can articulate. There’s a reason why over the history of the consumer internet only two UI paradigms have reached a billion-user scale: the media feed and the chat interface Poke.com - now available for everyone

Poke – Chat with Poke

The proactive AI assistant on Apple Messages, WhatsApp, Telegram, and more. Texts like a human, really knows you, and integrates with your life in dozens of ways.

poke.com

Poke

@interaction

Mar 19

Starting today, personal superintelligence is just one tap away. No download, no signup. Text Poke for free now: Poke.com 🌴 — 0:00 – What's Poke? 0:50 – Introducing Poke Recipes 1:25 – Create a Recipe in 10 seconds 1:43 – Earn on Poke 2:44 – Build with npx poke 12:58 – Recap 13:36 – Parisian Love

16:13

7,016

Elicit

fraser retweeted

Elicit

@elicitorg

Mar 4

The Elicit API is now available in preview for Pro and Teams users. You can search 138M papers and generate Research Reports from your code, scripts, or AI tools. Get your API key at elicit.com/settings and check out docs.elicit.com

8,348

fraser

fraser

@Fraser

Jan 30

Little nudges, adjustments, and taps... human intuition that helps us get things done when working with our hands. @GeneralistAI is showing emergent behavior where the model starts to react, correct, and recover in real-time. Weirdly human to see.

0:32

Andy Zeng

@andyzengineer

Jan 29

x.com/i/article/201666031721…

1,450

Nabeel Hyatt

fraser retweeted

Nabeel Hyatt

@nabeel

Jan 29

Congrats to Q.ai on the acquisition by Apple, the second largest in their history. In 2022, Aviad cold emailed out of the blue. He barely even told me what he was up to. But from the very first call it was obvious I had met a force of nature, and a kindred spirit. These folks have really made magic, oh how I wish this wasn't in stealth so you all could see. But with Aviad & team inside of Apple, the magic is sure to hit us all soon enough. @sparkcapital are so happy to have had the chance to partner with them. Congrats to the Q team! reuters.com/business/apple-a…

478

60,856

Generalist

fraser retweeted

Generalist

@GeneralistAI

16 Dec 2025

More pretraining improves GEN-0 real-robot performance (via blind A/B evals with closed-loop rollouts). Improvements are significant in the low-data regime, but the best models thrive with both pretraining and ample post-training. See blog addendum: generalistai.com/blog/nov-04…

187

81,221

jeremy

fraser retweeted

jeremy

@jerhadf

26 Nov 2025

opus 4.5 for scientific research & automated literature reviews!

Andreas Stuhlmüller

@stuhlmueller

26 Nov 2025

We benchmarked Opus 4.5, Sonnet 4.5, and Gemini 3 Pro on research tasks at Elicit - extracting answers from papers and writing systematic review reports. Results were pretty clear: *QA from papers:* Opus 4.5 dominates. 96.5% accuracy vs Gemini's 89.4%. Opus is also best on our combined "accurate supported direct" metric (76% vs 71%). Gemini is slightly better on claim supportedness *Report writing:* Opus 4.5 produces significantly better-supported reports than Sonnet 4.5, the previous best model for this task: - 62% of claims well-supported vs Sonnet's 54% - 31% poorly-supported vs Sonnet's 40% Opus is less verbose and writes ~20% fewer claims per report. We didn't bother comparing to Gemini since Sonnet 4.5 already wins 75% of head-to-head comparisons vs Gemini, and Gemini is 6x slower than Sonnet Qualitatively, in a manual screen of 5 reports, @PradyuPrasad found that Opus and Sonnet reach the same conclusions with no dramatic differences in output. Sonnet just writes much longer reports with more extensive commentary by default Opus still has stability issues at scale - we hit a bunch of 529 errors during testing. But once reliability improves, Opus 4.5 looks like the new default for accuracy-critical research workflows

2,981