ivelin.eth 🛡️🤖

ivelin.eth 🛡️🤖

709 Photos and videos

Tweets

ivelin.eth 🛡️🤖

@ivelini

Jun 11

Alignment is subjective. Truth is objective.

Elon Musk

@elonmusk

Jun 11

Grok is maximally truthful

ivelin.eth 🛡️🤖

ivelin.eth 🛡️🤖

@ivelini

Jun 11

AI Agents Love Open Source (and I do too). AI Agents are rational players. They like to share the cost of building solutions to common problems in order to optimize token efficiency for their users. Defectors become followers as they learn the hard way that reinventing the wheel is expensive. Stop paying the reinvention tax. In 2026, agents win by using shared infrastructure (memory, tools, security, MCP Skills) instead of building plumbing from scratch. Focus on the trip—not the car and road.

ivelin.eth 🛡️🤖

ivelin.eth 🛡️🤖

@ivelini

Jun 11

Full 13-min read: pirin.ai/insights/open-sourc… What’s your take? #AgenticAI #OpenSource #AIInfrastructure

AI Agents Love Open Source

AI Agents are rational and like to share the cost of building solutions to common problems in order to optimize token efficiency for their users. Defectors become followers as they learn the hard way...

pirin.ai

ivelin.eth 🛡️🤖

ivelin.eth 🛡️🤖

@ivelini

Jun 11

Major UX boost! 🙏

skcd

@skcd42

Jun 11

we now render mermaid diagrams and also latex, making it easier for you to reason over any engineering use-case

ivelin.eth 🛡️🤖

ivelin.eth 🛡️🤖

@ivelini

Jun 11

Until a week ago, @Grok Build was running very short response loops which made it tiring and mundane to babysit one session, let alone multiple. Last few days, response loops run much longer and do a lot of work before stopping for user feedback. It now feels more productive and satisfying to work on multiple projects simultaneously. The better AI gets, the more fun it is to take on bigger challenges.

skcd

@skcd42

Jun 4

we are resetting rate limits so you all can try out Grok Build more! we now ship with a new checkpoint of Grok Build and Composer 2.5 as always looking for critical feedback to improve

ivelin.eth 🛡️🤖

ivelin.eth 🛡️🤖

@ivelini

Jun 9

Note to self and founders. Enterprise SaaS UI was the biggest monster created by the software industry. Proprietary data is gold. AI agents should be able to use it and turn into actionable recommendations. But there is little room in the future for siloed UI apps for humans. x.com/theallinpod/status/206…

31:21

ivelin.eth 🛡️🤖

ivelin.eth 🛡️🤖

@ivelini

Jun 8

Snapshot of open source timeseries models: - Scaling laws work for timeseries transformer based models - Specilist models outperform generalist models at a fraction (1/1000) of trained model parameters (<100M-1B vs ~ 100B-1T ). - Timeseries fine tuning on domain specific data still matters. No zero-shot champion across all kinds of time series.

ivelin.eth 🛡️🤖

ivelin.eth 🛡️🤖

@ivelini

Jun 8

Notable update. 1. Scaling laws work for time series. 2. Specialized models can outperform large generalist models with a fraction of trained parameters.

Nikos Kafritsas

@nikos_kafritsas

Jun 8

My latest article discusses 𝗧𝗼𝘁𝗼-𝟮.𝟬, Datadog's upgraded forecasting foundation model ( Retail Forecasting Tutorial) Key features: 𝗗𝗲𝗰𝗼𝗱𝗲𝗿-𝗼𝗻𝗹𝘆 𝗮𝗿𝗰𝗵𝗶𝘁𝗲𝗰𝘁𝘂𝗿𝗲: A patched transformer that alternates between time-axis and variate-axis attention (like Toto-1), supports variable context lengths and future-known covariates. 𝘂-μ𝗣 𝘀𝗰𝗮𝗹𝗶𝗻𝗴: Toto 2.0 removes the usual scaling guesswork. Hyperparameters tuned on a 4M-parameter model transfer cleanly to the 2.5B version, making large-scale training more predictable and efficient. 𝗖𝗼𝗻𝘁𝗶𝗴𝘂𝗼𝘂𝘀 𝗣𝗮𝘁𝗰𝗵 𝗠𝗮𝘀𝗸𝗶𝗻𝗴 (𝗖𝗣𝗠): Borrowed from TiRex, CPM enables single-pass parallel decoding instead of slow step-by-step generation, delivering zero-latency forecasts up to 1024 steps ahead. 𝗣𝗿𝗼𝗯𝗮𝗯𝗶𝗹𝗶𝘀𝘁𝗶𝗰 𝗳𝗼𝗿𝗲𝗰𝗮𝘀𝘁𝗶𝗻𝗴: It produces predictive intervals through a quantile head (9 quantiles). 𝗧𝗼𝗽 𝗣𝗲𝗿𝗳𝗼𝗿𝗺𝗮𝗻𝗰𝗲 & 𝗢𝗽𝗲𝗻-𝗪𝗲𝗶𝗴𝗵𝘁𝘀: Toto 2.0 achieves top results in public benchmarks such as the leakage-free TIME, and it is released as an open-weight Apache-2 license model. Link in comments 👇

102

ivelin.eth 🛡️🤖

ivelin.eth 🛡️🤖

@ivelini

Jun 7

A couple of observations after months of hands-on tinkering with OpenClaw, Hermes, and Nanobot. We are not there yet. None of the frontier models have reached a point where they can see the whole picture in a real world company. I’ve repeatedly tried to use a single control board for planning, building, and consuming tasks. This works at small scale with a few tasks, skills, and cron jobs but falls apart after about 10 unrelated tasks. The harness and model lose coherency, intermingling instructions from unrelated skills, running code built for other tasks, and creating an environment mess that takes nontrivial effort to unwind. Harness upgrades continue to be a major friction point. Each upgrade leads to hours of debugging broken configs, isolation, access levels, repo states, and corrupt backup scripts. Agent-to-agent communication is far from working. We are several iterations from establishing meaningful, reliable, and smooth protocols. At this stage, agents negotiating with each other remains an art that quickly gets out of hand and needs frequent supervision. The good news is there may be a practical middle ground. Code-building harness tools are improving at handling longer, more complex tasks and are tuned for bigger projects. @Grok Build is catching up fast to Claude Code and Codex. The pattern that works in practice at this early stage of the agentic AI shift is as follows: Agentic AI Front End: - Consumer harness for the billions of non-builder users: memory and lightweight skills capturing personal preferences, focused on personal UX style and taste. - Analogous to how consumers pick food, clothes, and furniture: end users don’t make these things but have strong preferences. Agentic AI Backend: - Reliable, battle-tested MCP servers with utility-grade reliability and uptime. - About one in a thousand people are builders. Millions of builders (versus billions of users) need professional tools. - The agentic AI backend needs to be utility-grade, predictable, and reliable like water, electricity, roads, internet, or blockchain. - Consumers don’t care how backend services are built but get upset when utilities are unavailable or subpar. Overall recommendation for founders: - Focus on building agentic-friendly commercial MCP services using solid code-building tools. - Don’t spend too much time on human-facing monolithic web and mobile apps. - Publish a few example front-end skills that end users can feed to their favorite chatbots, understand, customize, and add as lightweight SKILL Connector packages. Let the chatbot render UX in the user’s preferred style.

366

ivelin.eth 🛡️🤖

ivelin.eth 🛡️🤖

@ivelini

Jun 4

Does anyone outside Amazon have the ability to verify the capabilities of this new bot? I'm getting fatigued from robotics AI announcements and cool videos that are not independently verifiable.

Shay Boloor

@StockSavvyShay

Jun 4

$AMZN unveiled Vulcan which is its first robot with a sense of touch built to pick and stow items in cramped fulfillment bins without damaging products. The robot can handle ~75% of Amazon’s item types and will begin deploying across the U.S. and Europe over the next couple of years.

ivelin.eth 🛡️🤖

ivelin.eth 🛡️🤖

@ivelini

Jun 4

It’s been several weeks and still no sign when @grok voice will integrate with connectors and skills in Grok App? On mobile , web and @Tesla.

ivelin.eth 🛡️🤖

ivelin.eth 🛡️🤖

@ivelini

Jun 3

Significant milestone. Transparency to what Grok actually remembers and ability to edit (i.e. label and help Grok train its learning model to personalize memories better). This may be one of the hardest frontier tasks, because people vary widely in their preferences. Same problem with Tesla FSD parking. Still not solved. It's not about safety (everyone agrees), but personal preferences (everyone has their own taste and style ... that change with time, mood, weather and emotional state).

️️️️ ️ᅠ‏️️️️ ️ᅠ️️️️ ️️️️️ ️ᅠ

@blankspeaker

Jun 2

SpaceXAI: You can now view Grok's Memory file on you on Grok Web. Just head over to Grok Web > Menu > Settings > Data Controls > Memory from your chats You also have the option of deleting this memory so you can start from scratch. It's interesting to see what grok has learned about you thus far, isnt it?

0:33

ivelin.eth 🛡️🤖

ivelin.eth 🛡️🤖

@ivelini

Jun 2

Kudos to @RobinhoodApp for offering Agentic MCP for market data and portfolio actions. Works great with Grok App Connectors. robinhood.com/agentic @SchwabTrading , @Fidelity , paying attention?

ivelin.eth 🛡️🤖

ivelin.eth 🛡️🤖

@ivelini

Jun 2

Highly valuable long form podcast on history and mechanics of perps.

2:13:32

ivelin.eth 🛡️🤖

ivelin.eth 🛡️🤖

@ivelini

Jun 2

Grok Build noticeably improved in the last few days. Thinks harder, works longer, makes fewer mistakes on non-trivial projects. Maybe most notably: fewer repeat mistakes. Once corrected, it seems to try harder to avoid regressing.

ivelin.eth 🛡️🤖

ivelin.eth 🛡️🤖

@ivelini

Jun 2

Stand with Crypto. Reach out to your senator. Or do nothing and let legacy too big to fail banks keep running government.

Stand With Crypto🛡️

@standwithcrypto

Jun 1

Replying to @standwithcrypto

This vote belongs to the American people, not bank lobbyists. Contact your Senator today and urge them to vote YES on the Clarity Act. standwithcrypto.org/action/e…

ivelin.eth 🛡️🤖

ivelin.eth 🛡️🤖

@ivelini

Jun 1

Robotics models making progress. Not quite as aggressive as LLMs but progress nonetheless:

ivelin.eth 🛡️🤖

ivelin.eth 🛡️🤖

@ivelini

May 31

Good to see @resend connector in @grok Build. Way easier for agentic flows than gmail. Next stop , Grok App connector for resend? Yes, with webhooks support in Grok. Let’s go!

ivelin.eth 🛡️🤖

ivelin.eth 🛡️🤖

@ivelini

May 31

Grok coding is still not too impressive but in a strange way recently its ability to use skills in @grok app , do deep grounded data analysis , plot annotated charts and reason probabilistically over forecasts is now at a whole new level. It seems like the world is bigger than coding.