Agent skills help agents use your products, build in your codebase and enforce your policies. They’re not just words - they are what the unit of software for agentic devs, and need powerful dev tools to match. That is what
@tessl_io offers.
Tessl is the package manager and development platform for skills. It offers a full dev lifecycle, helping you generate, evaluate, distribute and observe skills & context, developing them to the professional grade they warrant.
Today, I’m excited to announce the general availability of our task evals, which help you understand how good your skills are. Such insight is critical to making your skills great, avoiding regression, and applying learnings from their real world usage.
For example:
@Cisco's software-security skill shows a 1.8X improvement in securing coding in its benchmark, and
@ElevenLabs's agents skill boosts success by almost 3X!
However, not to name names, we often see skills that provide minimal uplift while consuming context window space, or even degrade functionality.
As Spencer Kimball, CEO of Cockroach Labs, put it when we shared early versions of this: evaluation is what makes agentic coding outcomes converge instead of drifting.
Task evals are joining a long list of powerful context development tools, such as:
* Review skills against quality best practices
* Generate and maintain skills and docs for using your libraries & platform
* Distribute versioned skills to your dev team and ecosystem
* Consume skills easily and safely, and keep them up-to-date
Skills are a central part of software development. If you’re serious about making agentic dev successful in your org, or helping your customers’s agents use your products, you need to invest in them. We hope Tessl can help.
Check out links in the thread to get started!