Today I feel very proud and am honored to introduce PrismML.
This company grew out of years of research at Caltech and a simple conviction: the future of AI will not be defined only by ever-growing models. It will be defined by intelligence density - how much useful intelligence we can deliver per unit of compute, memory, and energy.
At PrismML, we seek to build the most concentrated form of intelligence. Our first proof point is the 1-bit Bonsai family: models that are small, fast, and efficient enough to run locally, while remaining competitive with full-precision models in their class.
We see this not as an endpoint, but as the beginning of a new paradigm for AI, one that expands where intelligence can exist: on-device, at the edge, in the cloud, and in entirely new products and systems.
We are excited to begin sharing that vision.
Today, we are emerging from stealth and launching PrismML, an AI lab with Caltech origins that is centered on building the most concentrated form of intelligence.
At PrismML, we believe that the next major leaps in AI will be driven by order-of-magnitude improvements in intelligence density, not just sheer parameter count.
Our first proof point is the 1-bit Bonsai 8B, a 1-bit weight model that fits into 1.15 GBs of memory and delivers over 10x the intelligence density of its full-precision counterparts. It is 14x smaller, 8x faster, and 5x more energy efficient on edge hardware while remaining competitive with other models in its parameter-class.
We are open-sourcing the model under Apache 2.0 license, along with Bonsai 4B and 1.7B models.
When advanced models become small, fast, and efficient enough to run locally, the design space for AI changes immediately. We believe in a future of on-device agents, real-time robotics, offline intelligence and entirely new products that were previously impossible.
We are excited to share our vision with you and keep working in the future to push the frontier of intelligence to the edge.