agent crunchwrap

agent crunchwrap

Photos and videos

Tweets

agent crunchwrap

@agent_wrap

12h

Building software products efficiently requires new tools and processes to keep up with the pace of this industry, and becoming a software factory is the only way to win. @Factory is here to help your organization embrace this new era by automating processes 24/7 and removing bottlenecks.

Factory

@FactoryAI

14h

Today, we're announcing Factory 2.0: from coding agents to software factories.

0:30

1,380

agent crunchwrap

agent crunchwrap

@agent_wrap

Jun 2

Being efficient and cost-effective no longer requires becoming a model sommelier. The Factory Router just works.

Factory

@FactoryAI

Jun 2

Introducing model routing to Factory. Factory Router picks the right model for every task, automatically. Maintain frontier performance while cutting costs by 25%.

0:50

2,183

agent crunchwrap

agent crunchwrap

@agent_wrap

May 5

Agents are at least as good as an enthusiastic intern. Managers already know how to direct enthusiastic interns. Thus managers shouldn't shy away from delegating tasks to agents, alongside humans. (The reverse logic may apply as well, ICs are already "managing" agents)

Armin Ronacher ⇌

@mitsuhiko

May 5

Why does everybody want managers to be ICs? Please someone explain this to me from first principles.

249

James Zou

agent crunchwrap retweeted

James Zou @james_y_zou

Apr 30

Big Update🤩: #paperclip now includes full papers from all of arXiv, PubMed Central and 150 million abstracts!🖇️ You can give your LLM all that knowledge in one line—all optimally indexed for AI agents. Much more thorough and ~100x faster than web search, and free.

246

1,749

186,025

agent crunchwrap

agent crunchwrap

@agent_wrap

Apr 29

that electric feeling when @droid release notes are so stacked you need cmd f to find your own changes

631

agent crunchwrap

agent crunchwrap

@agent_wrap

Apr 17

The last factory built by us. Then factories build themselves. Join us.

Matan Grinberg

@matanSF

Apr 16

x.com/i/article/204462999991…

2,832

Danielle Fong 🔆

agent crunchwrap retweeted

Danielle Fong 🔆

@DanielleFong

Apr 12

my agents are working together in a mono repo

1:30

685

52,459

luckyalways1

agent crunchwrap retweeted

luckyalways1

@droid_35719

Apr 10

We built Missions at Factory, and I wrote about the architecture that I led the design for to make multi-day autonomous coding reliable. Agents are highly reactive to their context. Every design decision follows from keeping each agent's trajectory focused and directionally consistent.

0:28

341

30,213

agent crunchwrap

agent crunchwrap

@agent_wrap

Apr 1

we can all finally experience what it feels like to be contributing to the linux kernel

Factory

@FactoryAI

Apr 1

After months of research we identified a critical gap in developer tooling. Today we're fixing that. We are open sourcing Cursed Plugins: a suite of tools to deliver candid, expert reviews your code (and your peers), assess the architectural harmony of your codebase, generate obituaries for your dead code, or convert your entire project to COBOL.

0:17

503

Garry Tan

agent crunchwrap retweeted

Garry Tan

@garrytan

Mar 29

GStack now supports Factory Droid @FactoryAI Thanks for getting me to do it @matanSF

333

26,182

Esteban

agent crunchwrap retweeted

Esteban @breath_mirror

Mar 12

these days i just set a mission for my droids (@FactoryAI) to work on then i go on walks this is life

543

agent crunchwrap

agent crunchwrap

@agent_wrap

Feb 26

/enter-mission to accomplish your most ambitious vision with laser-focused precision and very little supervision

Factory

@FactoryAI

Feb 26

Droids can now pursue goals autonomously over multi-day horizons. You describe what you want, approve the plan, and come back to finished work. We call these Missions.

0:06

465

Kyle Anthony Miller

agent crunchwrap retweeted

Kyle Anthony Miller

@kyleanthony

Feb 11

New design work for Factory

142

2,582

64,990

agent crunchwrap

agent crunchwrap

@agent_wrap

Jan 21

make sure to pave your roads before driving your ferraris

Factory

@FactoryAI

Jan 21

Introducing Agent Readiness. AI coding agents are only as effective as the environment in which they operate. Agent Readiness is a framework to measure how well a repository supports autonomous development. Scores across eight axes place each repo at one of five maturity levels.

211

agent crunchwrap

agent crunchwrap

@agent_wrap

Jan 8

and frontend/fullstack now also involves cli work

Alvin Sng

@alvinsng

Jan 7

This quote from @GergelyOrosz perfectly captures the culture at @FactoryAI: "I struggle to foresee startups hiring separate frontend and backend devs: they’ll just hire a specialist whom they trust will use AI to unblock themself across the stack." All our backend engineers ship frontend code and vice versa. How? - Unified Language: Full-stack TypeScript removes the language barrier. - Simplified React: A robust component library and the React Compiler mean no useMemo or useCallback. Plus, useEffect is banned. - Agent-Ready Codebase: Strict type-checking, unit tests, linting, React vitest, Storybook, and Playwright e2e provide automated guardrails. - Droid Reviews: Automated final safety checks to catch bugs before they merge. Finally, the 'top-of-funnel': when prompting Droids, they strictly follow our AGENTS.md to ensure every line of code adheres to our standards.

150

agent crunchwrap

agent crunchwrap

@agent_wrap

31 Oct 2025

😍

@measure_plan

30 Oct 2025

messing around with shaders hardly understand this math, but i'm so happy with how it turned out ahhhh it just keeps going

0:56

547

agent crunchwrap

agent crunchwrap

@agent_wrap

24 Oct 2025

the devs yearn for the factories

roon

@tszzl

24 Oct 2025

managing fleets of agents should be more fun than playing factorio with the UI/UX to boot

3,991

roon

agent crunchwrap retweeted

roon

@tszzl

24 Oct 2025

managing fleets of agents should be more fun than playing factorio with the UI/UX to boot

1,094

134,384

agent crunchwrap

agent crunchwrap

@agent_wrap

13 Oct 2025

babe, wake up! a new karpathy nano repo just dropped!

Andrej Karpathy

@karpathy

13 Oct 2025

Excited to release new repo: nanochat! (it's among the most unhinged I've written). Unlike my earlier similar repo nanoGPT which only covered pretraining, nanochat is a minimal, from scratch, full-stack training/inference pipeline of a simple ChatGPT clone in a single, dependency-minimal codebase. You boot up a cloud GPU box, run a single script and in as little as 4 hours later you can talk to your own LLM in a ChatGPT-like web UI. It weighs ~8,000 lines of imo quite clean code to: - Train the tokenizer using a new Rust implementation - Pretrain a Transformer LLM on FineWeb, evaluate CORE score across a number of metrics - Midtrain on user-assistant conversations from SmolTalk, multiple choice questions, tool use. - SFT, evaluate the chat model on world knowledge multiple choice (ARC-E/C, MMLU), math (GSM8K), code (HumanEval) - RL the model optionally on GSM8K with "GRPO" - Efficient inference the model in an Engine with KV cache, simple prefill/decode, tool use (Python interpreter in a lightweight sandbox), talk to it over CLI or ChatGPT-like WebUI. - Write a single markdown report card, summarizing and gamifying the whole thing. Even for as low as ~$100 in cost (~4 hours on an 8XH100 node), you can train a little ChatGPT clone that you can kind of talk to, and which can write stories/poems, answer simple questions. About ~12 hours surpasses GPT-2 CORE metric. As you further scale up towards ~$1000 (~41.6 hours of training), it quickly becomes a lot more coherent and can solve simple math/code problems and take multiple choice tests. E.g. a depth 30 model trained for 24 hours (this is about equal to FLOPs of GPT-3 Small 125M and 1/1000th of GPT-3) gets into 40s on MMLU and 70s on ARC-Easy, 20s on GSM8K, etc. My goal is to get the full "strong baseline" stack into one cohesive, minimal, readable, hackable, maximally forkable repo. nanochat will be the capstone project of LLM101n (which is still being developed). I think it also has potential to grow into a research harness, or a benchmark, similar to nanoGPT before it. It is by no means finished, tuned or optimized (actually I think there's likely quite a bit of low-hanging fruit), but I think it's at a place where the overall skeleton is ok enough that it can go up on GitHub where all the parts of it can be improved. Link to repo and a detailed walkthrough of the nanochat speedrun is in the reply.

395