Bytez

Bytez

16 Photos and videos

Tweets

Pinned Tweet

Bytez @Bytez

9 Jun 2025

Run 100,000 AI Models for Free Build AI faster with the largest inference API on the internet. Instantly demo and deploy thousands of models through one unified API. Serverless inference. No infra or DevOps required. Free for developers. Try now👇

AI Model Hub

bytez.com

579

804,944

Bytez

Bytez @Bytez

Apr 6

How Bytez 0.1 works: Think of an exam. 1,000 students in the room — each one the top mind in their field. Lawyers, doctors, engineers, artists. Teacher asks a question. Everyone writes their answer. Our model gets to cheat. It sees all 1,000 answers. It figures out what kind of question was asked. Legal question? It pulls the top 3 legal minds' answers. Medical? Top 3 medical minds. Then it either combines their answers or picks the best one. Every other model gets 1-shot at the answer. Ours gets N-shots, from N experts. This is what we call a Web-Scale MoE. Each "expert" isn't a subnetwork inside a single model — it's an entirely separate model. As more experts show up on the web, our model gets smarter without retraining. The upside: it scores higher than Opus 4.6, Gemini 3.1 Pro, and GPT-5.4 across benchmarks. The downside: it behaves like a bigger model and costs more to run. We think the tradeoff is worth it. 1,000 minds wired together are smarter than any single mind. Is the path to AGI training one massive mind — or wiring together every mind that comes into existence?

147

Bytez

Bytez @Bytez

Apr 6

Gemma 4 Models Now Available On Bytez Hey all, as promised, we keep up with the latest and greatest in open source and closed source machine learning. The following models are now available: google/gemma-4-E2B-it google/gemma-4-E4B-it google/gemma-4-26B-A4B-it google/gemma-4-31B-it Models support the ability to understand text, image, audio, and video as context. Hit them either via the Bytez.js client, or via our chat/completions endpoint! May thy vibe harvest be fruitful, and may thy cup overfloweth with success! bytez.com

139

NeurIPS Conference

Bytez retweeted

NeurIPS Conference

@NeurIPSConf

Mar 30

The Position Paper Track is back at NeurIPS 2026 for the second year, with an expanded scope, and better alignment with the main and Evaluation and Dataset tracks! Head to the Call for Paper at neurips.cc/Conferences/2026/… for all the important dates and information and read our accompanying blog post at blog.neurips.cc/2026/03/30/w… to learn more about the changes we are making this year and how we adapted the process based on the feedback we got from the community! The submission deadline is the same as for the main and ED track: May 6, 2026 AoE. We are looking forward to read your papers and any feedback you may have!

111

26,348

Bytez

Bytez @Bytez

Apr 1

Bytez 0.1 update: more evals ran, matching or beating Opus 4.6, Gemini 3.1 Pro, and GPT-5.4 across benchmarks PS: the 0.1 model also does pro-level 3D generation What happens after models learn to generate 3D worlds?

Bytez

Bytez @Bytez

Apr 1

Early access coming soon Benchmarks: x.com/Bytez/status/203691465…

Bytez @Bytez

Mar 25

Bytez 0.1 beats Opus, Gemini Pro, and GPT-5.4 across benchmarks. Achieved without spending millions on GPUs. Karpathy recently said LLM ensembles are "under-explored." He vibe-coded a weekend prototype to test the idea. We've been building the production version for 2 years. Bytez 0.1 is a Web-Scale MoE — instead of training one massive model, we fuse thousands of models into a single intelligence. Instead of training intelligence, we absorb it. More benchmarks dropping soon. What's a faster path to a v1 of AGI? A) One massive model that tries to be an expert on everything B) Thousands of experts fused into one

Bytez

Bytez @Bytez

Mar 25

352

Bytez

Bytez @Bytez

Mar 25

→ Methodology: these are preliminary results. Running benchmarks repeatedly costs many thousands of dollars, so we sampled. We randomly shuffle each dataset and evaluate the first 100 questions. We then ask all 4 models the same questions and record their performance. To make the evaluation repeatable and replicable, we used a fixed random seed in shuffling and for each model. Our goal is anyone should be able to replicate these results → We're looking for anyone who wants to fund, build, or break this. Comment or DM me

Julien Chaumond

Bytez retweeted

Julien Chaumond

@julien_c

Mar 13

Dataset Editing has landed for Parquet Datasets on the HF Hub ✍️

0:23

14,317

a16z

Bytez retweeted

a16z

@a16z

Mar 13

.@illscience says the future of AI isn’t one model to rule them all—and explains why platforms that integrate multiple models will benefit the most: "I think we're going to need and rely on all of the models." "It's sort of like if you have a team of people... if you have five people, they could all do a basic set of things pretty capably." "But then they all have their specializations. Maybe one of them is really good at closing a customer who doesn't want to sign the deal, and one of them is really good at culture and getting the best out of the team." "There are some areas in which they are going to build apps, and that will be a threat to app companies. But there are many areas in which app companies are advantaged. Cursor and Krea are great examples of this—products where you benefit from being multi-model." "When you actually use a creative tool, you don't want to just use Nano Banana, you want to have access to OpenAI, Nano Banana, Kling—all of them—Qwen, you name it. So using a single interface to access all the models is powerful." Anish Acharya on BILLIONS with @GuillaumeMbh

2:14

162

33,041

Palatial

Bytez retweeted

Palatial

@PalatialSim

Mar 2

Last week, we launched Palatial PhysReady and the response blew us away. Over 100 companies signed up for our waitlist and the team had a blast watching everyone tagging @PalatialSim with their creative prompts. We generated over 100 assets in 1 day and below are a few of the highlights. We're looking forward to giving everyone access to the platform and API, wave 2 goes live on Wednesday! Sign up at palatial.cloud/join

0:44

1,695

Julien Chaumond

Bytez retweeted

Julien Chaumond

@julien_c

Feb 28

We don’t want to have to choose between 2 model providers We want to choose between 1,000s of model providers

258

28,477

Palatial

Bytez retweeted

Palatial

@PalatialSim

Feb 24

A child consumes more data in 1 month than any LLM has ever seen. Embodied agents learn by doing, but the data that teaches them is tactile, sensorial and causal. Such data does not exist. To make physical AGI possible, we need to generate this new data at an industrial scale. Enter Palatial: automated infrastructure that converts raw data into sensory rich playgrounds for robots to learn in. Today, we’re unveiling Palatial PhysReady, the first automated sim asset generator (try it ⬇️) [1/5]

2:15

276

58,077

Director Michael Kratsios

Bytez retweeted

Director Michael Kratsios

@mkratsios47

Feb 18

The future of AI is agentic, and America is leading the way to make it secure and interoperable. A new AI Agent Standards Initiative is launching this week @NIST to drive industry-led standards and open protocols that build trust and advance innovation. nist.gov/news-events/news/20…

Announcing the "AI Agent Standards Initiative" for Interoperable and Secure Innovation

The Initiative will ensure that the next generation of AI is widely adopted with confidence, can function securely on behalf of its users, and can interoperate smoothly across the digital ecosystem.

nist.gov

140

329

1,588

152,837

OpenRouter

Bytez retweeted

OpenRouter

@OpenRouter

Feb 18

Benchmarks are now available on OpenRouter! See how models perform on industry standard tests, including programming, math, science, long context reasoning, and more to come.

737

97,079

Artificial Analysis

Bytez retweeted

Artificial Analysis

@ArtificialAnlys

15 Dec 2025

NVIDIA has just released Nemotron 3 Nano, a ~30B MoE model that scores 52 on the Artificial Analysis Intelligence Index with just ~3B active parameters Hybrid Mamba-Transformer architecture: Nemotron 3 Nano combines the hybrid Mamba-Transformer approach @NVIDIAAI has used on previous Nemotron models with a moderate-sparsity MoE architecture, enabling highly efficient inference, particularly at longer sequence lengths Small-model improvements: with 31.6B total and 3.6B active parameters, Nemotron 3 Nano scores 52 on our Intelligence Index, in line with OpenAI’s gpt-oss-20b (high). This represents a 6 point lead on the similarly-sized Qwen3 30B A3B 2507 and 15 improvement on NVIDIA’s previous Nemotron Nano 9B V2 (a dense model) High openness: Nemotron 3 Nano follows other recent NVIDIA models in open licensing and releases of data and methodology for the community to use and replicate - it scores an 67 on the Artificial Analysis Openness Index, in line with previous Nemotron Nano models Key model details: ➤ 1 million token context window, with text only support ➤ Supports reasoning and non-reasoning modes ➤ Released under the NVIDIA Open Model License; the model is freely available for commercial use or training of derivative models ➤ On launch, the model is being made available with a range of serverless inference providers including @baseten, @DeepInfra, @FireworksAI_HQ, @togethercompute and @friendliai, and it is available now on Hugging Face for local inference or self-deployment See below for our full analysis and key announcement links from NVIDIA 👇

285

110,533

OpenRouter

Bytez retweeted

OpenRouter

@OpenRouter

16 Dec 2025

You can now see the most popular large-context models on the OpenRouter Rankings 👇

Yam Peleg

@Yampeleg

14 Dec 2025

how?

6,872

Michael Bronstein

Bytez retweeted

Michael Bronstein @mmbronstein

8 Dec 2025

NeurIPS 2025 papers per 1 Million People 1. Singapore – 64.51 2. Switzerland – 22.13 3. Israel – 11.17 4. UAE – 9.47 5. UK – 7.50 6. US – 7.44 7. Denmark – 7.37 8. Australia – 7.31 9. Canada – 6.93 10. South Korea – 5.78

110

1,168

144,787

alphaXiv

Bytez retweeted

alphaXiv

@askalphaxiv

4 Dec 2025

New paper from Qwen team! They showed that because token-level updates are just fragile approximations of sequence rewards, you must use Routing Replay and Clipping to minimize the gap between training & inference for stable RL training in LLMs now trending on AlphaXiv 📈

423

51,542

Chanwoo Park

Bytez retweeted

Chanwoo Park

@chanwoopark20

6 Dec 2025

One of my favorite moments from Yejin Choi’s NeurIPS keynote was her point as follows: "it looks like a minor detail, but one thing I learned since joining and spending time at NVIDIA is that all these, like, minor details, implementation details matter a lot" -- I think this is exactly the point that theory people often undervalue when it comes to empirical work.

1,090

125,711

OpenRouter

Bytez retweeted

OpenRouter

@OpenRouter

4 Dec 2025

We collaborated with @a16z to publish the **State of AI** - an empirical report on how LLMs have been used on OpenRouter. After analyzing more than 100 trillion tokens across hundreds of models and 3 million users (excluding 3rd party) from the last year, we have a lot of insights to share.

146

795

241,175