Brian Halligan

Brian Halligan

169 Photos and videos

Tweets

vpj retweeted

Brian Halligan

@bhalligan

May 6

I don't remember where I found this, but its spot on.

733

6,526

31,929

45,097,115

DeepInfra

vpj retweeted

DeepInfra

@DeepInfra

May 4

DeepInfra has raised its $107M in Series B funding 🚀 AI is moving from training to production-scale deployment, and inference is becoming the system constraint. DeepInfra was built for this shift — scaling high-throughput inference for open-source and agent-driven workloads. Grateful to our investors and partners, co-led by @500GlobalVC and @gharik

12,339

DeepInfra

vpj retweeted

DeepInfra

@DeepInfra

May 1

DeepInfra is now a first-class provider in @OpenClaw. One key, every model. 🦞

OpenClaw🦞

@openclaw

Apr 29

OpenClaw 2026.4.27 🦞 🧠 DeepInfra provider 📎 better file attachments 🛡️ operator-managed proxy routing 🧭 stricter model selection local model fixes 🔧 gateway, channel, and session reliability Ships more than it brags. github.com/openclaw/openclaw…

1,854

DeepInfra

vpj retweeted

DeepInfra

@DeepInfra

Apr 29

DeepInfra × Hugging Face DeepInfra is live on @HuggingFace Inference Providers. Run DeepSeek V4, Kimi-K2.6, GLM-5.1 and 100 more open models straight from the Hub — same OpenAI-compatible API, same low per-token pricing, no markup. Just add :deepinfra to the model name.

24,539

LMSYS Org

vpj retweeted

LMSYS Org

@lmsysorg

Apr 27

The DeepSeek V4 garbled output bug in open source inference engine is fixed in SGLang. To everyone affected over the weekend, sorry for the trouble. Huge thanks to @Ant_Group for landing the fix PR. It was a cross-company, cross-timezone, sub-48-hour marathon. @ollama and @humansand surfaced it first; @nvidia, @AIatMeta, and @FireworksAI_HQ raised the same signal soon after. @deepseek_ai replied in seconds at every hour. @FireworksAI_HQ stayed up late with us until it shipped. @SemiAnalysis_ and @ollama provided the machines that made the debugging possible. The SGLang team dug in through the weekend. The real OSS is the friends we made along the way.🫶

286

80,301

DeepInfra

vpj retweeted

DeepInfra

@DeepInfra

Apr 24

DeepSeek V4 is live on DeepInfra at launch 🔥 V4-Pro: 1.6T MoE / 49B active. Frontier-tier reasoning. $1.74 in · $3.48 out · $0.145 cached V4-Flash: 284B MoE / 13B active. Fast & cheap for agents, RAG, long-context extraction. $0.14 in · $0.28 out · $0.028 cached

1,027

DeepInfra

vpj retweeted

DeepInfra

@DeepInfra

Apr 7

Day 0. GLM-5.1 from @Zai_org is live on DeepInfra. Open source getting close to GPT-5.4 and Claude Opus 4.6. Powered by @nvidia B300 Blackwell Ultra. Early access pricing, costs will drop as we scale. $1.40 in / $4.40 out / $0.26 cached per 1M tokens ↓

1,246

DeepInfra

vpj retweeted

DeepInfra

@DeepInfra

Mar 13

Kimi K2.5 Turbo just dropped on Deep Infra 🚀 #1 by speed: 341 tokens/sec #1 by price: $0.90/1M tokens credits to @ArtificialAnlys for benchmarks

307

25,891

gabriel

vpj retweeted

gabriel

@gabriel1

Mar 6

there is still no substitute for perfectly understanding every single line of code in your codebase i fall into the trap of just skimming through ai changes to "just make sure it looks good" all the time, and it makes me lose so much time to not perfectly understand every line

157

2,622

251,138

Toby Pohlen

vpj retweeted

Toby Pohlen

@TobyPhln

Mar 4

At 1:30 a.m. PT on November 3, 2023 Elon sent a message to the xAI group chat saying that we need to go “extremely hardcore” for the next 36 hours; Grok will be released publicly tomorrow. You didn’t have to be in the exclusive company chat to get the message; it was also posted publicly at the same time: x.com/i/status/1720372289378… What unfolded over the next day and a half was one of the best examples of engineering at pace that I’ve ever seen. All we had when we started was a somewhat fine-tuned base model and a half-baked UI. Our team of ten split up the tasks: curate data, improve the model, implement the raw prompting and RAG service, build the production infra. I took care of the latter. At 8:51 p.m. PT the next day, we announced Grok to the world with a long-form post on X (x.com/xai/status/17210273489…). Over the past 36 hours, we came up with Fun mode (including Grok’s sunglasses), finished the whole production system, and most importantly tuned the RAG system that gave it real-time knowledge of the world through the X platform (a first in the industry). A day and a half of straight coding and shipping; no drugs, not even caffeine, just pure adrenaline. Elon gave us a mission and we delivered. The launch went very well. We invited a couple hundred X creators and Grok’s ability to roast accounts went viral. It was the first time a publicly accessible AI was allowed to poke fun at people. This episode is a prime example of what you can achieve by going extremely hardcore: you move and deliver results faster than any outsider could have anticipated. Within 36 hours, we took the company from silence to relevance. It was well worth it. xAI’s hardcore culture is infamous on X. I love the tent meme that suggests we all sleep (well, slept in my case) in the office in tents. Our reputation precedes us and even new joiners hit the ground grinding hard. However, unless you understand the “why,” you are at risk of simply replicating the “how” without achieving the same results. You need to grind with purpose and the purpose is to move fast towards a known goal. When the goal and the means of reaching it are crystal clear, a small, skilled, and highly motivated team can outcompete companies old and new, big and small. Never grind to show off; never work late to be seen; never sacrifice without cause. There is no medal for the one who tried extremely hard but failed. There is only a medal for the winner. If all your efforts lead nowhere, you’re arguably not very productive. Always keep your eyes firmly on the goal, do everything to reach it as quickly as possible, and make sure you're on track to win. A hardcore engineering culture is one of the most effective ways of accelerating real progress. Watch out for performative sacrifice and don’t confuse pain with progress.

xAI

@xai

5 Nov 2023

Announcing Grok! Grok is an AI modeled after the Hitchhiker’s Guide to the Galaxy, so intended to answer almost anything and, far harder, even suggest what questions to ask! Grok is designed to answer questions with a bit of wit and has a rebellious streak, so please don’t use it if you hate humor! A unique and fundamental advantage of Grok is that it has real-time knowledge of the world via the 𝕏 platform. It will also answer spicy questions that are rejected by most other AI systems. Grok is still a very early beta product – the best we could do with 2 months of training – so expect it to improve rapidly with each passing week with your help. Thank you, the xAI Team x.ai

1,006

211,937

Saurabh Shah

vpj retweeted

Saurabh Shah

@saurabh_shah2

Feb 24

you should join humans& we have great perks for example @rramador will buy you a cool dino

7,457

vpj

vpj

@vpj

Feb 19

Coding with AI is like playing SimCity with no cost to build, only maintenance and demolition actually cost you.

Luke Wroblewski

vpj retweeted

Luke Wroblewski

@LukeW

Feb 19

PMs using AI “now I don’t have to wait on designers for mockups!” Designers using AI “now I don’t have to wait on developers for code!” Developers using AI “now I don’t have to wait on PMs for requirements !”

400

44,623

DeepInfra

vpj retweeted

DeepInfra

@DeepInfra

Feb 18

MiniMax-M2.5 from @MiniMax_AI is live on DeepInfra! 80.2% SWE-Bench Verified. Agentic tool use search. Office real-world workflows. $0.27 / $0.95 with $0.03 cached. High capability. Extremely efficient.

821

humans&

vpj retweeted

humans&

@humansand

Feb 17

Announcing the humans& hackathon! Hack with us this Saturday - come experiment and build AI apps to help people collaborate and communicate, work with creative folks, learn a bit about what we're building, and win cool prizes Apply here: luma.com/2pbif8t9

ALT watercolor of terminal with sf visible

311

83,945

DeepInfra

vpj retweeted

DeepInfra

@DeepInfra

Feb 11

Day-0 with @Zai_org: GLM-5 is live on DeepInfra 🔥 Built for long-horizon agents that plan, orchestrate, and self-correct. Serving ~100 TPS at launch and as usual the best price on the market!

149

12,538

Charlie George

vpj retweeted

Charlie George

@__Charlie_G

Feb 11

We're hiring the best product team in the world. Come join us!

humans&

@humansand

Feb 11

We’re building foundation models that enable humans to better collaborate, communicate, and coordinate with one another. That requires rethinking many interfaces we take for granted today. We’re hiring amazing product builders to join us on this mission - if that’s you, apply

ALT offsite photo converted into painting with large brush strokes

709

humans&

vpj retweeted

humans&

@humansand

Feb 11

ALT offsite photo converted into painting with large brush strokes

159

86,622

Elon Musk

vpj retweeted

Elon Musk

@elonmusk

Feb 5

Whoever said “money can’t buy happiness” really knew what they were talking about 😔

126,077

59,418

599,181

111,823,029

james hong

vpj retweeted

james hong

@jhong

Feb 2

Most people don't really understand scale because they've never really experienced it. I remember when I ran a dating site with millions of users getting a subpoena asking for communication records between two users. Apparently they met on the site and were in a relationship and someone was murdered. I remember feeling a pit in my stomach and I had to take the rest of the day off. I ended up aimlessly walking around town feeling horrible. That's when a friend pointed out to me that with as many users as we had, it was statistically impossible for these things not to happen. If you play a number in roulette 1000 times, you will win 26.3 times. I don't know why I'm writing this, I just remembered that day for no reason and thought I'd write this down. It's useful for people to know though, when you run something big, *everything* happens.

750

57,289