Joined May 2012
528 Photos and videos
3 weeks ago we open-sourced HALO this led to talking with dozens of teams running agents at scale we realized the current agent monitoring tools aren't built for the future that we so clearly see ahead of us today weโ€™re releasing native OpenTelemetry-compatible agent tracing on @inference_net, powered by the same open-source core behind HALO
Weโ€™re introducing HALO ๐Ÿ˜‡ Hierarchal Agent Loop Optimizer HALO is an RLM-based agent optimization technique capable of recursively self-improving agents by analyzing their execution traces and suggesting changes. This work is inspired by the Mismanaged Genius Hypothesis proposed by @a1zhang and @lateinteraction earlier this month. tldr; we improved performance on AppWorld (Sonnet 4.6) from 73.7 --> 89.5 ( 15.8) by giving HALO-RLM access to harness trace data and asking it to identify issues. The feedback from HALO surfaced failures in the harness such as hallucinated tool calls, redundant arguments in tools, refusal loops, and semantic correctness issues. Each issue mapped cleanly to a direct prompt update. We then fed these finding into Cursor (Opus 4.6), and asked the coding agent to update the underlying harness. We repeated this trace -> HALO-RLM analysis -> code update loop until the score plateaued. Today weโ€™re open-sourcing the core HALO-RLM framework, evals, and data for further review.
11
21
109
21,343
Kind of interesting to see FF trying to kingmake given the Girardian / Thielian literature on kings and scapegoats. a dangerous game to be playing right now with broader tech sentiment being how it is
210
Sometimes my non-technical friends forget they're non-technical and start talking to me like this
9
418
Sam Hogan ๐Ÿ‡บ๐Ÿ‡ธ retweeted
If youโ€™re a software engineer worried about AI eating your job, become the person who can deploy, customize, evaluate, and operate *****open-source models***** inside companies. Organizations are finally optimizing for AI cost, privacy, and control and many will want this capability in-house.
31
48
580
31,834
To redeem: Sign up -> Team settings -> Billing -> Wallet Enter these values and click "Add funds"
1
2
552
call Schematron like any other OpenAI endpoint Pass in your json schema raw html Get back structured JSON Full guide: docs.inference.net/workhorseโ€ฆ
1
4
511
Schematron volume is now 4M requests per day, 10s of billions of tokens. Growing 10% WoW it's the best HTML-to-JSON model for apps/agents sonnet 4.5 quality at 9b model prices. crazy fast Get $50 in free Schematron credit with code EXTRACT on @inference_net. only 100 available
5
3
24
2,653
How does the US Government even notify a company of something like that? what if they call up Dario and heโ€™s busy playing video games with his sister
1
8
696
bunx @inference/sdk instrument
This is what I wanted agent observability to feel like. Traces that end in a code change.
7
862
Sam Hogan ๐Ÿ‡บ๐Ÿ‡ธ retweeted
19
4
569
67,082
FAFO
Get paid to wait The Claude Code spinner might be the most watched line on Earth. So I turned it into an ad marketplace. Advertisers bid on it. You keep 50% of the money. Install the extension โ†’ get cash from ads. Introducing Kickbacks
2
8
2,128
Type of guy who knows how to use a loop
1
303
Anthropic is going to make a trillion dollars per minute with this thing
2
10
1,160
Sam Hogan ๐Ÿ‡บ๐Ÿ‡ธ retweeted
Specialized models are becoming a practical path to better AI UX. Olive moved from a frontier model to a custom model trained with Inference Catalyst for their food verdict workflow. After a user scans a product, the model now delivers near-instant verdicts on what to watch out for, making the in-store experience faster and more seamless while cutting inference cost significantly. Results: - p50 latency: 2,721ms โ†’ 591ms - p99 latency: 6,414ms โ†’ 998ms - time to first word: ~0.25s - inference cost: ~70% lower Great working with @oliveholistic on this! Full case study here: inference.net/case-study/oliโ€ฆ
3
5
9
1,178
Had a great time sitting down with @compliantvc to have a very serious conversation about @inference_net, startup culture, and all things compliance
I sat down with @samhogan He raised $11M to build private AI infrastructure. Then he asked for another $11M. He says companies should stop sending data to OpenAI. Naturally, I asked about SOC 2. He had one. This was upsetting. I prefer when founders are easier to prosecute.
1
3
15
1,779
Startup idea: Retatrutide blow darts. We have to start forcibly lowering the obesity rate.
15
2,351
Sam Hogan ๐Ÿ‡บ๐Ÿ‡ธ retweeted
It is so energizing to be online on Sunday afternoon, and Slack is just humming with the whole team online. Not because they have to be, but because they just wanna ship!
5
2
47
10,624
If youโ€™re building an agent and want a self-improvement loop to prompt your coding agent that โ€œjust worksโ€ out of the box do yourself a favor and read the HALO GitHub works with every agent framework and coding agent. 100% free and open source github.com/context-labs/halo
Hereโ€™s your monthly reminder that you shouldnโ€™t be prompting coding agents anymore. You should be designing loops that prompt your agents.
1
10
1,826