Yapay zekanın açık kaynaklı olmasını ve hiçbir yapı tarafından kontrol edilememesini sağlamak için çalışıyoruz. @SentientEco @OpenAGISummit

Joined June 2025
33 Photos and videos
Pinned Tweet
Resmi @SentientAGI Türkiye X hesabımız artık sizinle! Sentient ile ilgili güncel haberler, blog yazıları ve çok daha fazlasını paylaşacağımız hesabımızı takip etmeyi unutmayın.
27
3
60
8,309
Sentient Türkiye retweeted
This Spark was too bright to scroll past ✨ Congratulations @Fr0oZi, you're our Spark of the Month! Check out some of his best work ↓
8
2
25
1,079
Sentient Türkiye retweeted
Jun 13

4
12
1,179
Sentient Türkiye retweeted
Open source ai models: 1. Accesible 2. Decentralized 3. Customizable 4. Private if hosted locally 5. Everyone contributes 6. Cannot be banned
12
4
27
819
Sentient ekibi yılın en aykırı ajan makalesini yayımladı. Microsoft, Alibaba ve Google'ın hepsi EvoSkill, Çoklu Ajan Sistemleri için Otomatik Beceri Keşfi makalesine atıfta bulunuyor. Aşağıdaki makaleyi oku ve EvoSkill'i GitHub'dan kur: github.com/sentient-agi/EvoS…
Jun 13
Sentient team published the most heretical agents paper of the year. Microsoft, Alibaba, and Google all cite EvoSkill: Automated Skill Discovery for Multi-Agent Systems. Sentient researchers and Virginia Tech co-authors open by showing a Claude agent confidently botching a Treasury question and only get more unhinged from there. They demand the AI industry abandon its biggest obsession, hand-crafted skills. Right now, everyone from solo devs to AI labs assumes hand-crafting is the only path. An expert studies the domain, then writes the playbook for the machine. Sentient argues that this entire workflow is a manual illusion. Agents do not need "human" teachers. They are flexible general-purpose machines, held back simply by the domain expertise we never gave them. We only think skills must be written by hand because we are completely blind to the millions of lessons hiding inside the failures we never show them. Which brings us to the Treasury argument. Claude with Opus 4.5 is one of the greatest coding agents in history. But dropped into 89,000 pages of Treasury data, where humans average 50 minutes per question? It is fundamentally lost. Our belief that frontier agents are "ready" out of the box is pure benchmark bias. They aren't objectively prepared. They're just better than the rest, which are hopelessly worse at it. Sentient says we need to stop writing playbooks for our machines. Instead, they propose a new loop: EvoSkill. Automated Skill Discovery. Instead of trying to author every skill with our flawed, time-limited human hands, we need to embrace self-evolution. EvoSkill is about the speed of learning from failure. It is a system where one agent fails a task, a second autopsies the failure trace, and a third writes a brand new skill to patch the gap. The model stays completely frozen. More importantly, the skills it discovers are designed to transfer where hand-crafted ones never could. Things like exploding 12 points on noisy web search. Or jumping 7 points on Treasury reasoning. Or transferring zero-shot to a benchmark it never saw, and still gaining 5. The entire AI industry is obsessed with humans writing playbooks in our own image. Sentient's paper is a brutal wake-up call.
1
3
230
Sentient Türkiye retweeted
Welcome @SentientAGI to the Dataline Launch Partner cohort. When ROMA's agents act on crypto data, Dataline is the call: spot, perp funding, prediction markets in one structured response, ready for the Planner / Executor / Verifier loop. One integration. All markets.
11
82
291
2,322
Sentient Türkiye retweeted
Introducing our next batch of Sentient Sparks ✨
17
5
41
1,429
Sentient Türkiye retweeted
Jun 12

2
4
15
1,221
Sentient Türkiye retweeted
Jun 12
What's new at Sentient? 1/ Catch up on everything you missed this week 👇
3
3
19
424
Sentient Türkiye retweeted
Open-source AI makes transparency the default, so no single monolith can dictate access, research, or innovation. Say no to the black box. That’s how everyone wins.
We’re rolling out changes to make Fable 5’s safeguards for frontier LLM development visible. Starting this week, flagged requests will visibly fall back to Opus 4.8—the same as our safeguards for cyber and bio. You will see this every time it happens. On the API, any flagged requests will return a reason for their refusal (coming to server-side fallback in the next few days). We wanted to deploy Fable 5 to our users quickly and safely. Visible safeguards can be probed, so they have to be robust, which takes time to get right. Invisible safeguards can be targeted more narrowly, allowing us to ship quickly with very few false positives. We went with invisible safeguards for this reason—and that was the wrong tradeoff. You should have visibility into the safeguards we have in place, and why. We’re sorry for not getting the balance right. Making the safeguards visible makes them easier to work around, so keeping them robust to jailbreaks will unfortunately mean more false positives while we improve the classifiers. We're also tuning our bio and cyber classifiers to trigger less often on harmless requests. We know this is frustrating and we’ll do our best to keep this period as short as possible. If you think a request has been mistakenly flagged: run /feedback in Claude Code, click thumbs-down on the fallback in Claude.ai or Cowork, or file the safeguard appeal form for API requests. Your reports help us tune these classifiers and we appreciate your feedback. support.claude.com/en/articl…
4
11
47
6,484
Sentient Türkiye retweeted
In Microsoft Research's new SkillOpt paper, EvoSkill is named the “strongest harness-side competitor” tested, and the closest system to their own method when run inside Codex and Claude Code agent loops. The biggest labs in AI are paying attention, and @salahalzubi401 and the Sentient AI research team are the reason why.
33
15
106
29,710
Sentient Türkiye retweeted
Arena Challenge 0 is now public! 🏆 $6,000 in prizes MiniMax credits 🗓️ May 20 - June 22, 2026 Built on @databricks' enterprise OfficeQA benchmark, the Grounded Reasoning Challenge is now open for everyone.
50
100
155
53,149
Sentient Türkiye retweeted
Jun 5
What's new at Sentient? Catch up on everything you missed this week 👇
5
4
20
1,558
Sentient Türkiye retweeted
PewDiePie builds a private version of ChatGPT, Fortune 500 companies join forces to advance open-source AI, and new tools emerge to level up your AI workflow. Stay ahead of the curve ↓
2
4
23
3,199
Sentient Türkiye retweeted
What does our Co-Founder @hstyagi fear about AI? His answer is exactly why open-source AI matters ↓
6
6
21
14,840
Harbor entegrasyonu EvoSkill v.1.2.0 ile yayında.
Harbor integration is live with EvoSkill v.1.2.0 Harbor is a framework for evaluating AI agents against containerized benchmark tasks. It gives EvoSkill access to evolve agents against a registry of 190 datasets — including benchmarks like SWE-bench Verified, Terminal-Bench 2.0, and Aider Polyglot. Here’s what it means for automated agent evolution ↓
17
Sentient Türkiye retweeted
A new ally is stepping into the Arena: @MiniMax_AI $6K prize pool AI credits sponsored by Minimax
112
12
128
17,332
Sentient Türkiye retweeted
Arena: Challenge 0 pre-reg opens to the public soon. Before you pre-register, meet our runner-up Leon Liu (@0xHermes_). He used Sentient's EvoSkill framework as the foundation for his methodology, discovering agent skills from failed trajectories to score 71.5% on OfficeQA with a 10B parameter model. His work proves the method works, but EvoSkill is what makes it scale ↓
63
17
92
10,919
Sentient Türkiye retweeted
Two builders. One debate. Zero filter. Arena Debates drops soon ↓
44
43
142
10,112
Sentient Türkiye retweeted
Arena Debates Episode 1 is LIVE 🎙️ Cohort 0 builders Cat (@hypecatv2) and Jessup go head-to-head on Open vs Closed Source AI. Who won? Watch and tell us your pick ↓
28
8
88
10,985