/sesh/null

/sesh/null

9 Photos and videos

Tweets

/sesh/null

@nerdsane

Exclusivity keeps a thing expensive, inclusivity makes the same thing affordable and therefore ubiquitous.

/sesh/null

/sesh/null

@nerdsane

Jun 10

Verification is definitely the bottleneck. We need more startups working on this problem to scale accountability with generation.

Sajid Mehmood

@smehmood

Jun 10

We're launching @niteshiftdev – the full-stack cloud for coding agents Verification is the new bottleneck. Software teams can now define their dev environment and verification tools once. Then run any frontier agent in the cloud: Claude Code, Codex, or OpenCode

1:45

Datadog Developers

/sesh/null retweeted

Datadog Developers

@datadogdevs

Jun 10

Now this is how you do a CEO panel 📸 @nerdsane (VP of AI @ @datadoghq) and Clémence J Burnichon (Sr Director Eng @ Datadog) took the stage with some of the best CEO’s in tech right now: @zachlloydtweets - @warpdotdev @dakshgup - @greptile @jayair - @opencode

532

Maxi

/sesh/null retweeted

Maxi

@maxirodgo

Jun 9

Developers developers developers @dakshgup @jayair @nerdsane @zachlloydtweets

241

Kai Xin Tai

/sesh/null retweeted

Kai Xin Tai

@kaixin_tai

Jun 9

crazy line out the door for @jayair (@opencode), @zachlloydtweets (@warpdotdev), and @dakshgup’s (@greptile) session at @datadoghq DASH on how coding agents is changing the SDLC

1,934

Datadog Developers

/sesh/null retweeted

Datadog Developers

@datadogdevs

Jun 1

TOMORROW - we're hosting our @Techweek_ by @a16z AI Rooftop event with @datadoghq x @vercel ✨ Speakers include: Director of Eng/AI - @diamondbishop VP, Observability and AI - @nerdsane Sr. Director, Eng - Andrey Sibirev (Vercel) Moderator: @MadsMcIlwain (Vercel) See if you can still snag a spot: partiful.com/e/NHLbgkXd64ICe… @vercel_dev

235

/sesh/null

/sesh/null

@nerdsane

Jun 1

Below is some serious work from the Datadog team and I’m impressed the magnitude they were able to concieve and achieve in the timeframe of a hackathon (few hours, single day). Also super happy to see our collective vision of Directed Software Evolution through our research projects like BitsEvolve and Temper showcased, with a clear demonstration of the importance of production observability as a feedback loop to achieve that. Looking forward to the detailed write up.

Arun Parthiban @ArunP76475

May 31

Participated in the Autoresearch systems hackathon in SF, hosted by Modal, OpenAI, Raindrop and Antler, along with Jai Menon and Pranav Garg. Our hypothesis was that by using Temper's governance and verification layers, and building tools on top of Temper, we could produce (1/8)

125

/sesh/null

/sesh/null

@nerdsane

May 27

I know there are some efforts to write more precise specifications in prose with llms, I think we can do better by making more of those specifications mathematically precise and observable. In other words, can the specification become part of the system (mechanically executable), not just an input to the LLM? If so, then those pieces would become observable artifacts. In that case now the LLM produces a formal, observable specification instead of only prose. The developer can audit or even edit that spec. Model check for consequences independently (than just models doing it). Helps Develop an operational mental model that we are losing with being distant with code generation. The spec can map more directly to runtime code. With Observability like @datadoghq still instrumenting the running system, it feeds production behavior back to the LLM and connect to the specs. So now, when something fails, the failure can trace back to the spec. I’m calling this paradigm “Higher Order Construction” with coding agents.

Ameet Talwalkar

/sesh/null retweeted

Ameet Talwalkar

@atalwalkar

May 20

We’ve released a technical report for Toto 2.0 detailing the data, architecture, training recipe, μP/u-μP hyperparameter transfer pipeline, and benchmark results behind our 5-model open-weight release. Report linked below.

Ameet Talwalkar

@atalwalkar

May 14

Today we’re releasing Toto 2.0: a family of open-weights time series foundation models spanning 4M to 2.5B parameters. The question we set out to answer was simple (yet previously open): Do time series foundation models get reliably better as they scale? Our answer: yes! 🧵

5,789

AJ Stuyvenberg

/sesh/null retweeted

AJ Stuyvenberg

@astuyve

May 19

NEW from Datadog: it's Lapdog! Ever wondered what your AI agent was actually doing? Our latest free project runs locally and traces reasoning and tool calls in Codex, Claude Code, and Pi. You can now see what your agent is REALLY doing, live: lapdog.datadoghq.com/

699

266,285

Othmane

/sesh/null retweeted

Othmane

@ThisIsOthmane

May 14

Scaling finally works for Time Series Foundation Models. Introducing Toto 2.0: open-weights TSFMs from 4M to 2.5B params, where every size beats the last from a single hyperparameter config. #1 on leading benchmarks: BOOM, GIFT-Eval, and TIME. Most TSFM families ship multiple sizes that all perform roughly the same. This one doesn't.

3,278

/sesh/null

/sesh/null

@nerdsane

May 10

The load-bearing frequency of ‘load-bearing’ in LLM discussions is becoming structurally load-bearing on my sanity

Datadog Developers

/sesh/null retweeted

Datadog Developers

@datadogdevs

May 7

“At Datadog, over the last four months, nearly 90% of engineers used coding agents for production work." - VP Observability Data, @nerdsane (@datadoghq) Our very own Sesh spoke at Code w/ @claudeai last night covering the instances in which the eng teams at Datadog are utilizing agents for production work. #codewithclaude #claude #claudecode @ClaudeDevs

107,717

arni

/sesh/null retweeted

arni

@arni0x9053

Apr 19

@AnthropicAI Claude Design is so fun! This release was so serendipitous because I just set up Katagami - a living design language library sourced and synthesized by agents based on rough ideas I wanna explore. You can download a spec from Katagami, upload it into Claude Design as a design system and start applying it to your project from there. I just tried it and it worked amazingly well. Can’t wait to use this more in my future projects.

1:18

arni

@arni0x9053

Apr 18

x.com/i/article/204546450295…

315

/sesh/null

/sesh/null

@nerdsane

Apr 12

Time for the universal machine tool for the software industrialization, that rebuilds from the SaaS-pocalypse .

Rhys

@RhysSullivan

Apr 11

we are entering the tool calling industrial revolution because of code mode

156

Rhys

/sesh/null retweeted

Rhys

@RhysSullivan

Apr 11

we are entering the tool calling industrial revolution because of code mode

105

7,760

Maxi

/sesh/null retweeted

Maxi

@maxirodgo

Apr 3

Are chatbots in SaaS apps dead? Chat is communication method, not a product. You can’t define “AI” or “bots” as chat. SaaS companies should think of shipping AI in two categories: 1. Autonomous: AI as a separate entity from the human 2. Assistant: AI as an extension of the human Autonomy: these are essentially background agents that go in loops. You can think of them as doing stuff recursively, kicking off on set triggers or (ideally) events it detects itself. The holy grail here is a background agents that can wake itself up to things you care about, make evaluations and drive its own loop for a long time with proper and only necessary context, execute, iterate, and ask for your input/notify you when it’s done. Key here is that the agent owns its own loop. Claws work really well here to help orchestrate and coordinate for subtasks with personality. Assistants: these are multi turn agents, that start reactively and triggers are defined at each turn. They tend to execute much more scoped tasks, but can still go off and explore and move recursively within a defined upfront instruction input. You play fetch with your assistant. The goal of autonomy is catch things you wouldn’t have caught, to be always-on, and to act as an independent colleague. The goal of assistants is to be your superpower, to help you run your defined workflows, and to execute on your commands. The easiest mode of communication for both is chat. Artifacts are helpful to digest both loops and turns. Our Assistant (Bits) is in Preview. And our next evolution of Autonomy is coming very soon…

Vignesh Palaniappan

@vigneshp_

Apr 3

We’ve launched Bits Assistant to help customers search and act across Datadog to resolve issues faster. Few examples below on how we see customers use it.

434

Diamond Bishop 🤖

/sesh/null retweeted

Diamond Bishop 🤖

@diamondbishop

Mar 26

x.com/i/article/203661871558…

7,324

Dylan Garcia

/sesh/null retweeted

Dylan Garcia

@_dylanga

Mar 19

The first thing I did at @tryramp was set up distributed tracing, structured logging, and metrics for Inspect, our background coding agent. We now have full visibility in to everything the system is doing: the browser, CF workers/DOs, @modal sandboxes, database calls, etc. Most importantly, Inspect now has visibility in to itself. It can self-triage runtime errors it encounters and create PRs to fix them. Every morning, it reviews the past 24 hours of its own @datadoghq dashboard, identifies systemic issues, new errors, and long tail latencies, and has a summary PR waiting for me at 9am.

522

72,023

Samuel Colvin

/sesh/null retweeted

Samuel Colvin

@samuelcolvin

Mar 14

Really enjoyed this conversation with @swyx. Hope you enjoy the podcast.

Latent.Space

@latentspacepod

Mar 14

⚡️Monty: the ultrafast Python interpreter, by Agents, for Agents, youtu.be/nxnQl4AcqFg so glad to catch up with @samuelcolvin of @pydantic !

3,052