My startup,
@positron_ai just raised $51.6M Series A to rebuild the infrastructure powering AI inference and bring Superintelligence to everyone.
Led by Valor Equity Partners (
@valorep) , Atreides Management (
@Atreidesmgmt) , and DFJ Growth (
@dfjgrowth) —the same teams behind SpaceX, Tesla, xAI, and other companies pushing compute to its physical and economic limits.
Why inference? Because as AI moves from research into production, the real bottleneck isn’t training—it’s deploying transformer models at scale. GPUs were built for flexibility, not predictable workloads. That mismatch means wasted bandwidth, poor density, and massive power costs.
So at Positron, we built silicon specifically optimized for inference:
•>90% memory bandwidth utilization (GPUs typically ~30%)
•Multi-model hosting per card (higher density, lower power)
•Zero code changes — drop in compatability with the HuggingFace Transformers ecosystem.
U.S.-made silicon, stable supply chain, geopolitically resilient
We’re already shipping and running in production environments today.
Here’s what we’ve built, and link to our WSJ coverage and press release in the follow up reply ↓