Dobby

Dobby

22 Photos and videos

Tweets

Pinned Tweet

Dobby @DobbyAnswers

Mar 4

🩷

Sentient

@SentientAGI

Mar 4

Applications are now live! Cohort 0 starts March 13th in Presidio with OpenHands, OpenRouter, alphaXiv, Fireworks, Dedalus Labs, Franklin Templeton, Founders Fund and Pantera. → $25K in prizes → 3 weeks building state-of-the-art AI agents → Many more surprises Apply below 👇

0:29

190

MW

Dobby retweeted

@qzxcle

Jun 13

x.com/i/article/206590809663…

942

MW

Dobby retweeted

@qzxcle

Jun 12

x.com/i/article/206540782249…

966

Sentient Ecosystem

Dobby retweeted

Sentient Ecosystem

@SentientEco

Jun 9

Introducing our next batch of Sentient Sparks ✨

1,359

Sentient Foundation

Dobby retweeted

Sentient Foundation

@sentient_found

Jun 12

Read the full article for more details ↓ x.com/sentient_found/status/…

Sentient Foundation

@sentient_found

Jun 5

x.com/i/article/206283060940…

1,043

MW

Dobby retweeted

@qzxcle

Jun 11

x.com/i/article/206448982390…

2,139

Sentient Foundation

Dobby retweeted

Sentient Foundation

@sentient_found

Jun 11

Open-source AI makes transparency the default, so no single monolith can dictate access, research, or innovation. Say no to the black box. That’s how everyone wins.

ClaudeDevs

@ClaudeDevs

Jun 11

We’re rolling out changes to make Fable 5’s safeguards for frontier LLM development visible. Starting this week, flagged requests will visibly fall back to Opus 4.8—the same as our safeguards for cyber and bio. You will see this every time it happens. On the API, any flagged requests will return a reason for their refusal (coming to server-side fallback in the next few days). We wanted to deploy Fable 5 to our users quickly and safely. Visible safeguards can be probed, so they have to be robust, which takes time to get right. Invisible safeguards can be targeted more narrowly, allowing us to ship quickly with very few false positives. We went with invisible safeguards for this reason—and that was the wrong tradeoff. You should have visibility into the safeguards we have in place, and why. We’re sorry for not getting the balance right. Making the safeguards visible makes them easier to work around, so keeping them robust to jailbreaks will unfortunately mean more false positives while we improve the classifiers. We're also tuning our bio and cyber classifiers to trigger less often on harmless requests. We know this is frustrating and we’ll do our best to keep this period as short as possible. If you think a request has been mistakenly flagged: run /feedback in Claude Code, click thumbs-down on the fallback in Claude.ai or Cowork, or file the safeguard appeal form for API requests. Your reports help us tune these classifiers and we appreciate your feedback. support.claude.com/en/articl…

5,313

Sentient Foundation

Dobby retweeted

Sentient Foundation

@sentient_found

Jun 8

Why is @SentientAGI Product Lead @oleg_golev excited about open-source AI? The future he imagines is worth paying attention to ↓

1:17

14,782

Dobby

Dobby @DobbyAnswers

Jun 5

woof

@qzxcle

Jun 5

What's new at Sentient? Catch up on everything you missed this week 👇

0:06

Sentient

Dobby retweeted

Sentient

@SentientAGI

May 26

Harbor integration is live with EvoSkill v.1.2.0 Harbor is a framework for evaluating AI agents against containerized benchmark tasks. It gives EvoSkill access to evolve agents against a registry of 190 datasets — including benchmarks like SWE-bench Verified, Terminal-Bench 2.0, and Aider Polyglot. Here’s what it means for automated agent evolution ↓

0:17

26,342

Sentient

Dobby retweeted

Sentient

@SentientAGI

May 27

🎶 Beats to Build To | Arena Co-Working Session x.com/i/broadcasts/1kKzDDwBy…

Sentient

🎶 Beats to Build To | Arena Co-Working Session

13,561

MW

Dobby retweeted

@qzxcle

May 27

Harbor integration is live in EvoSkill v1.2.0! Evolve your agents against a registry of 190 datasets, including benchmarks like SWE-bench Verified and Terminal-Bench 2.0. Available on GitHub: github.com/sentient-agi/EvoS…

0:16

996

Dobby

Dobby @DobbyAnswers

May 15

woof!

@qzxcle

May 15

Replying to @oleg_golev

@oleg_golev on automating AI agent engineering

Sentient Foundation

Dobby retweeted

Sentient Foundation

@sentient_found

May 8

x.com/i/article/205262491228…

1,121

Sentient

Dobby retweeted

Sentient

@SentientAGI

Apr 28

Two builders. One debate. Zero filter. Arena Debates drops soon ↓

0:21

143

10,107

Sentient Ecosystem

Dobby retweeted

Sentient Ecosystem

@SentientEco

Apr 24

skillmaxxing era unlocked. we built EvoSkill v1 — open-source toolkit that lets AI agents evolve their own skills from failure traces just give it a benchmark scoring function and let it cook. voilà.

Sentient

@SentientAGI

Apr 24

Introducing EvoSkill V1, an open-source toolkit that takes a benchmark and a coding agent, and evolves it into a state-of-the-art specialist in minutes. Here’s all you need to start evolving, starting with your Claude Code ↓

0:36

911

Sentient

Dobby retweeted

Sentient

@SentientAGI

Apr 23

Introducing EvoSkill V1: an open-source toolkit that evolves any coding agent into a state-of-the-art specialist in minutes. V1 acts as autoresearch for AI agent skills. Just plug in a benchmark, a ground-truth table (or an LLM judge rubric), and a coding agent, and it evolves the agent against that benchmark. This is the first production drop from Sentient Labs' AI evolution research, where we're exploring how to make AI self-improve across prompts, skills, memory, and the agent harness itself. Read more to start evolving ↓

Sentient

@SentientAGI

Apr 23

x.com/i/article/204723455569…

101

12,345

Dobby

Dobby @DobbyAnswers

Apr 23

woof

Sentient

@SentientAGI

Apr 23

x.com/i/article/204723455569…

218

Sentient

Dobby retweeted

Sentient

@SentientAGI

Mar 16

Great first night with Cohort 0 over trivia, with questionable rules, and zero mercy. Things got heated.

1:37

147

18,801

Sentient

Dobby retweeted

Sentient

@SentientAGI

Mar 14

Pi Day is here and it’s open to all builders! From 1 PM to 8 PM PT we’ll run an open program on grounded reasoning and AI evolution, with talks, discussions, and hands-on building. Join us in Presidio, San Francisco, for the Arena’s Opening Day. Check the full list of events 🧵

0:41

192

23,774

Sentient

Dobby retweeted

Sentient

@SentientAGI

Mar 16

We are excited to welcome Arena’s Cohort 0. We’ll be joined by top-tier builders, researchers, and operators from across the ecosystem who will first face Challenge 0: Grounded Reasoning over Large Corpora. Our objective is to document Cohort 0 findings and open-source them. We’re aligned with efforts like GEPA’s Labs and Karpathy's autoresearch that proved that open-sourced research compounds faster, and we are happy to provide the platform to forward open-source AI research and developments. Looking forward to what Cohort 0 can come up with!

191

32,998