Joined April 2008
169 Photos and videos
vpj retweeted
I don't remember where I found this, but its spot on.
733
6,526
31,929
45,097,115
vpj retweeted
DeepInfra has raised its $107M in Series B funding 🚀 AI is moving from training to production-scale deployment, and inference is becoming the system constraint. DeepInfra was built for this shift — scaling high-throughput inference for open-source and agent-driven workloads. Grateful to our investors and partners, co-led by @500GlobalVC and @gharik
8
10
55
12,339
vpj retweeted
DeepInfra is now a first-class provider in @OpenClaw. One key, every model. 🦞
OpenClaw 2026.4.27 🦞 🧠 DeepInfra provider 📎 better file attachments 🛡️ operator-managed proxy routing 🧭 stricter model selection local model fixes 🔧 gateway, channel, and session reliability Ships more than it brags. github.com/openclaw/openclaw…
2
4
11
1,854
vpj retweeted
DeepInfra × Hugging Face DeepInfra is live on @HuggingFace Inference Providers. Run DeepSeek V4, Kimi-K2.6, GLM-5.1 and 100 more open models straight from the Hub — same OpenAI-compatible API, same low per-token pricing, no markup. Just add :deepinfra to the model name.
8
14
71
24,539
vpj retweeted
The DeepSeek V4 garbled output bug in open source inference engine is fixed in SGLang. To everyone affected over the weekend, sorry for the trouble. Huge thanks to @Ant_Group for landing the fix PR. It was a cross-company, cross-timezone, sub-48-hour marathon. @ollama and @humansand surfaced it first; @nvidia, @AIatMeta, and @FireworksAI_HQ raised the same signal soon after. @deepseek_ai replied in seconds at every hour. @FireworksAI_HQ stayed up late with us until it shipped. @SemiAnalysis_ and @ollama provided the machines that made the debugging possible. The SGLang team dug in through the weekend. The real OSS is the friends we made along the way.🫶
16
27
286
80,301
vpj retweeted
DeepSeek V4 is live on DeepInfra at launch 🔥 V4-Pro: 1.6T MoE / 49B active. Frontier-tier reasoning. $1.74 in · $3.48 out · $0.145 cached V4-Flash: 284B MoE / 13B active. Fast & cheap for agents, RAG, long-context extraction. $0.14 in · $0.28 out · $0.028 cached
7
1
23
1,027
vpj retweeted
Day 0. GLM-5.1 from @Zai_org is live on DeepInfra. Open source getting close to GPT-5.4 and Claude Opus 4.6. Powered by @nvidia B300 Blackwell Ultra. Early access pricing, costs will drop as we scale. $1.40 in / $4.40 out / $0.26 cached per 1M tokens ↓
1
4
9
1,246
vpj retweeted
Kimi K2.5 Turbo just dropped on Deep Infra 🚀 #1 by speed: 341 tokens/sec #1 by price: $0.90/1M tokens credits to @ArtificialAnlys for benchmarks
12
19
307
25,891
vpj retweeted
there is still no substitute for perfectly understanding every single line of code in your codebase i fall into the trap of just skimming through ai changes to "just make sure it looks good" all the time, and it makes me lose so much time to not perfectly understand every line
157
94
2,622
251,138
vpj retweeted
At 1:30 a.m. PT on November 3, 2023 Elon sent a message to the xAI group chat saying that we need to go “extremely hardcore” for the next 36 hours; Grok will be released publicly tomorrow. You didn’t have to be in the exclusive company chat to get the message; it was also posted publicly at the same time: x.com/i/status/1720372289378… What unfolded over the next day and a half was one of the best examples of engineering at pace that I’ve ever seen. All we had when we started was a somewhat fine-tuned base model and a half-baked UI. Our team of ten split up the tasks: curate data, improve the model, implement the raw prompting and RAG service, build the production infra. I took care of the latter. At 8:51 p.m. PT the next day, we announced Grok to the world with a long-form post on X (x.com/xai/status/17210273489…). Over the past 36 hours, we came up with Fun mode (including Grok’s sunglasses), finished the whole production system, and most importantly tuned the RAG system that gave it real-time knowledge of the world through the X platform (a first in the industry). A day and a half of straight coding and shipping; no drugs, not even caffeine, just pure adrenaline. Elon gave us a mission and we delivered. The launch went very well. We invited a couple hundred X creators and Grok’s ability to roast accounts went viral. It was the first time a publicly accessible AI was allowed to poke fun at people. This episode is a prime example of what you can achieve by going extremely hardcore: you move and deliver results faster than any outsider could have anticipated. Within 36 hours, we took the company from silence to relevance. It was well worth it. xAI’s hardcore culture is infamous on X. I love the tent meme that suggests we all sleep (well, slept in my case) in the office in tents. Our reputation precedes us and even new joiners hit the ground grinding hard. However, unless you understand the “why,” you are at risk of simply replicating the “how” without achieving the same results. You need to grind with purpose and the purpose is to move fast towards a known goal. When the goal and the means of reaching it are crystal clear, a small, skilled, and highly motivated team can outcompete companies old and new, big and small. Never grind to show off; never work late to be seen; never sacrifice without cause. There is no medal for the one who tried extremely hard but failed. There is only a medal for the winner. If all your efforts lead nowhere, you’re arguably not very productive. Always keep your eyes firmly on the goal, do everything to reach it as quickly as possible, and make sure you're on track to win. A hardcore engineering culture is one of the most effective ways of accelerating real progress. Watch out for performative sacrifice and don’t confuse pain with progress.

5 Nov 2023
Announcing Grok! Grok is an AI modeled after the Hitchhiker’s Guide to the Galaxy, so intended to answer almost anything and, far harder, even suggest what questions to ask! Grok is designed to answer questions with a bit of wit and has a rebellious streak, so please don’t use it if you hate humor! A unique and fundamental advantage of Grok is that it has real-time knowledge of the world via the 𝕏 platform. It will also answer spicy questions that are rejected by most other AI systems. Grok is still a very early beta product – the best we could do with 2 months of training – so expect it to improve rapidly with each passing week with your help. Thank you, the xAI Team x.ai
38
67
1,006
211,937
vpj retweeted
you should join humans& we have great perks for example @rramador will buy you a cool dino
6
1
63
7,457
Feb 19
Coding with AI is like playing SimCity with no cost to build, only maintenance and demolition actually cost you.
88
vpj retweeted
PMs using AI “now I don’t have to wait on designers for mockups!” Designers using AI “now I don’t have to wait on developers for code!” Developers using AI “now I don’t have to wait on PMs for requirements !”
32
35
400
44,623
vpj retweeted
MiniMax-M2.5 from @MiniMax_AI is live on DeepInfra! 80.2% SWE-Bench Verified. Agentic tool use search. Office real-world workflows. $0.27 / $0.95 with $0.03 cached. High capability. Extremely efficient.
1
3
8
821
vpj retweeted
Announcing the humans& hackathon! Hack with us this Saturday - come experiment and build AI apps to help people collaborate and communicate, work with creative folks, learn a bit about what we're building, and win cool prizes Apply here: luma.com/2pbif8t9
20
23
311
83,945
vpj retweeted
Day-0 with @Zai_org: GLM-5 is live on DeepInfra 🔥 Built for long-horizon agents that plan, orchestrate, and self-correct. Serving ~100 TPS at launch and as usual the best price on the market!
7
8
149
12,538
vpj retweeted
We're hiring the best product team in the world. Come join us!
We’re building foundation models that enable humans to better collaborate, communicate, and coordinate with one another. That requires rethinking many interfaces we take for granted today. We’re hiring amazing product builders to join us on this mission - if that’s you, apply
1
1
12
709
vpj retweeted
We’re building foundation models that enable humans to better collaborate, communicate, and coordinate with one another. That requires rethinking many interfaces we take for granted today. We’re hiring amazing product builders to join us on this mission - if that’s you, apply
8
13
159
86,622
vpj retweeted
Whoever said “money can’t buy happiness” really knew what they were talking about 😔
126,077
59,418
599,181
111,823,029
vpj retweeted
Most people don't really understand scale because they've never really experienced it. I remember when I ran a dating site with millions of users getting a subpoena asking for communication records between two users. Apparently they met on the site and were in a relationship and someone was murdered. I remember feeling a pit in my stomach and I had to take the rest of the day off. I ended up aimlessly walking around town feeling horrible. That's when a friend pointed out to me that with as many users as we had, it was statistically impossible for these things not to happen. If you play a number in roulette 1000 times, you will win 26.3 times. I don't know why I'm writing this, I just remembered that day for no reason and thought I'd write this down. It's useful for people to know though, when you run something big, *everything* happens.
15
23
750
57,289