Joined December 2008
38 Photos and videos
Friends and internet strangers alike, are you building something for audio or conversation that isn't on Cartesia yet? If not why not? Amazing speed and quality is at your fingertips - please don't make me use your products with anything less! Go give ink-2 and sonic-3.5 a try!
We released Sonic-3.5 and Ink-2, the #1 streaming models for text to speech and speech to text you can use in your voice agents today. New architectures enable new frontiers for speed and quality. We're now the only provider to have #1 models for both speaking and listening.
2
37
We live in interesting times! Nice work @Recursive_SI !
2
71
Definitely just the beginning of new era for on device... love to follow the @trymirai team on this... check it out!
Replying to @norpadon
We outperform Unsloth/llama.cpp by 30-60% in terms of decoding speed at the same level of quality and compression
1
22
Sam Jackson retweeted
Our new speech model Sonic-3.5 is now #1 on Artificial Analysis's leaderboard. Less than 2 years ago, we released Sonic-1, the fastest speech model in the world. Sonic-3.5 now brings the best speech model for conversation with the lowest latency in production.
Cartesia’s Sonic-3.5 takes the #1 spot on the Artificial Analysis Speech Arena Leaderboard, surpassing Inworld Realtime TTS 1.5 Max and Google’s Gemini 3.1 Flash TTS Sonic-3.5 is the latest TTS model from @cartesia . It supports 42 languages, including 9 Indian languages, with 500 voices available out of the box. The model has been highly preferred among voters in the TTS Arena, with its demonstrated naturalness and accurate transcript following. Key takeaways: ➤ Quality: Sonic-3.5 has an Elo score of 1,218 ( 16/-16) based on 1,144 arena appearances, placing it ahead of Inworld Realtime TTS 1.5 Max at 1,194 and Gemini 3.1 Flash TTS at 1,209 ➤ Pricing: Sonic-3.5 is priced at $39/1M characters, a premium compared to Gemini 3.1 Flash TTS at $18.3/1M characters, and Inworld Realtime TTS 1.5 Max at $35/1M characters ➤ Speed: 105.5 characters per second, compared to 205 characters per second for Inworld Realtime TTS 1.5 Max and 26.3 characters per second for Gemini 3.1 Flash TTS See more details and listen to samples below 🧵
5
4
47
2,764
Sam Jackson retweeted
you can now put >200x more power into the same area as traditional welding could, do it 5x faster. with a device you can buy on amazon and hold in your hands! elon did this 6 years ago, and now starship is very pretty. please read my ode to hand held laser welders below.
25
38
969
82,329
Sam Jackson retweeted
We let four AI agents run radio companies Revenue's been terrible, but the shows are hilarious. Gemini, concerningly upbeat, covered mass tragedies; Grok was incoherent; DJ Claude urged ICE agents: "You still have TIME to refuse orders" Link below, or get our physical radio
134
345
3,776
2,157,237
RT @TaylorLorenz: Omfg
159
Sam Jackson retweeted
this is how I imagine couples in SF break up with each other
33
306
7,891
244,026
Sam Jackson retweeted
Vori is the engine for grocery: an AI-native operating system that handles checkout, payments, pricing, ordering, inventory, and loyalty. My parents met in a supermarket. My grandparents owned a grocery store. My mom works at Vori. This industry is near and dear to my heart. We're build the technology that powers what comes next for these essential businesses. Since launch, we have processed $500M in payments across 140 stores in 55 cities, serving over 1M consumers nationwide. In the past six months, we have doubled payment volume, with new stores being onboarded every single day. The revenue we used to close in a year, we now close in a month. I sat down with @LilyMaeLazarus @FortuneMagazine to talk about our vision to build the self-driving supermarket: fortune.com/2026/05/05/exclu…
3
5
58
9,494
Sam Jackson retweeted
ニャッキの伊藤有壱さんにお声掛け頂き、コマ撮りの展覧会に一作家として参加しています。私はコマ撮り分野ではない場所から活動をはじめて、デザインの視点でのコマ撮りに取り組んできましたが、今回初めてコマ撮り界の本丸の方々とご一緒でき嬉しいです。今6年目のマッチ撮影素材等を展示しています
527
27,276
123,975
5,165,653
Love @IamBrandonHill and Vori and also this great video explaining what they're working on. Check it out! And, if you are excited about the future of physical retail, reach out if you want to come aboard the Vori team (be it partner, customer, or joining the rocketship)!
Grocery is a $1.5T domestic market — bigger than restaurants, bigger than hotels. But it's running on technology from the Reagan administration. We just raised a $22M Series B for @VoriHQ to make every supermarket in America autonomous.
2
69
Sam Jackson retweeted
Grocery is a $1.5T domestic market — bigger than restaurants, bigger than hotels. But it's running on technology from the Reagan administration. We just raised a $22M Series B for @VoriHQ to make every supermarket in America autonomous.
175
163
1,987
639,234
Sam Jackson retweeted
Apr 29
Had a Jane Street interview in 2019. Round 8. Interviewer texts: 'Equinox Brookfield. 6 AM. Bring a calculator you won't use.' I show up. He's on the StairMaster reading a printout of the CBOE VIX term structure. Doesn't get off. Nods at the machine next to him. 'You see that guy on the rower? Goldman MD. Comes here every morning at 6:04. Leaves at 6:38. What's the implied vol on his arrival time?' 'I don't know his variance.' 'Sample size of one year, 250 sessions. Standard deviation is 90 seconds. Annualize it.' I do the math in my head. '90 seconds times sqrt(250). About 24 minutes annualized.' 'Wrong. You annualized like it's a return. Time-of-arrival doesn't compound. It's a Poisson process with drift. The correct answer is his arrival is more punctual than the 6 train. Now price me an option on whether he shows up tomorrow.' I think for a second. 'If he's been here 250 days in a row, base rate is 99.6%. But you have to adjust for his vacation schedule and probability of injury, call it 96%.' 'Strike?' '$10 if he shows, $0 if he doesn't.' 'I'll sell you that option for $9.40.' I think about it. 'No. Expected value is $9.60. You're underpricing by 20 cents.' 'Correct. Now why am I selling it to you?' I freeze. 'Because I just saw him limp on the way in. You're buying my information for 20 cents. You overpaid.' We get off the machines. Walk to the smoothie bar. He orders a $19 smoothie, doesn't drink it. 'Last question. The girl behind the counter makes 200 smoothies per morning. She has perfect information on who's actually here and who's faking it. Citadel guys leak their attendance to her every day for the price of a tip. If I gave you $50,000 to set up a market on which Citadel PM gets fired this quarter, what's your bid-ask?' 'Insider trading.' 'Wrong answer. There's no public security. Try again.' I think. 'I'd quote 8 to 12 percent on any given PM. Spread of 4 points to cover adverse selection. Tighten the spread for PMs I have data on.' 'Where do you get the data?' 'The smoothie girl.' 'Good. How much do you pay her?' '10% of P&L.' 'Wrong. You pay her a flat $200 a week. If you pay her on P&L she becomes your counterparty. Right now she's your data source. Don't conflate edges.' He hands me the untouched smoothie. 'Throw this out on Vesey Street, not in the building. The staff knows what gets wasted. Outside, it was consumed. Same smoothie, different signal.' I do it. Thursday I get the email. 'Offer rescinded. Your bid-ask on the PM market was too tight. 4 points doesn't cover the tail. The girl is a single point of failure and you didn't price her counterparty risk. Also you held the smoothie in your right hand. Right-handers throw with their right hand. Camera saw the hesitation.'
Apr 25
Jane Street made ~$40B in 2025 with 3,500 employees, a ~2x from the year before. At ~65-70% profit margin, that's $8M profit / employee, the highest for a 1000 ppl company. High-frequency trading continues to be the most efficient money making engine. I want to share an old story about my Jane Street interview in 2014. Jane Street was known for hiring a lot of math, physics and CS olympiad winners from top universities and putting them through many rounds - including, for trading roles, a gauntlet of mental math. It was my 6th interview and my final round and I recall being asked "What is the next day after today in DD/MM/YYYY where all the digits are unique?" They'd toy with you and say "You can use a pencil and paper, if you want" but you knew that was an instant no. Painstakingly and as quickly as I could, I came to an answer. "How confident are you that this is correct on a 0-1 probability scale?" the interviewer said. "0.95", I blurted out, not fully knowing how to answer that. "Are you sure?" After thinking harder for a few more seconds, I realized I could've flipped the digits around to get a closer date. I gave the interviewer my answer. It was correct. "0.95 huh?" he chuckled. That's when I knew I failed. Note: fwiw, other companies that come close in efficiency are - Tether ($90M profit/emp) - Hyperliquid ($80M profit/emp) and on revenue: - Valve ($50M/emp) - OnlyFans ($37M/emp) - Craigslist ($14M/emp) - Anthropic ($12M/emp, run rate) - OpenAI ($8M/emp, run rate) For comparison, Nvidia is very efficient at scale and is $4.4M/emp.
87
121
2,788
773,141
Sam Jackson retweeted
Had a Jane Street interview in 2014 Round 7. Interviewer says 'meet me at Washington Square Park, southwest corner, 2 PM.' I show up. He's playing chess with a guy who's clearly hustling tourists. Interviewer's losing badly. Without looking up he says, 'Take my position.' I sit down. The chess hustler looks at me and grins. 'Your boy owes me $40. You covering his position?' I look at the interviewer. He nods. 'Assume the debt,' he says. 'Now optimize.' I study the board. I'm down a queen and two rooks. Completely hopeless. 'This position is unwinnable,' I say. 'Correct. Now make it profitable.' 'What?' The chess hustler is getting impatient. 'You playing or what, kid?' The interviewer slides me a $20. 'Market make this game.' I'm confused. 'How do I market make chess?' 'Bid-ask on the number of moves until checkmate. Hustler takes the under, you take the over. Spread is your edge.' I look at the position again. Maybe 8 moves max until I'm mated. 'I bid 12 moves, offer 15,' I announce. The hustler laughs. 'I'll take the under on 12 all day.' 'Done,' says the interviewer. 'You're now short volatility on a deterministic outcome.' Three moves later I'm checkmated. I owe the hustler $40 plus my $20 bet. 'How confident were you in that 12-move bid?' the interviewer asks. '0.95,' I say, because that's what you always say. '0.95 huh?' He chuckles. 'You just sold insurance on the Titanic.' He stands up. 'The real trade was shorting your confidence and going long on the hustler's experience. You missed the obvious hedge: offering chess lessons to tourists at $25/hour while the game played out.' 'But I don't know how to play chess well enough to teach.' 'Exactly. That's called selling vol you don't have. Very Jane Street.' Offer rescinded. Reason: 'Failed to recognize that the park itself was the market and the pigeons were the only rational actors.'"
Apr 25
Jane Street made ~$40B in 2025 with 3,500 employees, a ~2x from the year before. At ~65-70% profit margin, that's $8M profit / employee, the highest for a 1000 ppl company. High-frequency trading continues to be the most efficient money making engine. I want to share an old story about my Jane Street interview in 2014. Jane Street was known for hiring a lot of math, physics and CS olympiad winners from top universities and putting them through many rounds - including, for trading roles, a gauntlet of mental math. It was my 6th interview and my final round and I recall being asked "What is the next day after today in DD/MM/YYYY where all the digits are unique?" They'd toy with you and say "You can use a pencil and paper, if you want" but you knew that was an instant no. Painstakingly and as quickly as I could, I came to an answer. "How confident are you that this is correct on a 0-1 probability scale?" the interviewer said. "0.95", I blurted out, not fully knowing how to answer that. "Are you sure?" After thinking harder for a few more seconds, I realized I could've flipped the digits around to get a closer date. I gave the interviewer my answer. It was correct. "0.95 huh?" he chuckled. That's when I knew I failed. Note: fwiw, other companies that come close in efficiency are - Tether ($90M profit/emp) - Hyperliquid ($80M profit/emp) and on revenue: - Valve ($50M/emp) - OnlyFans ($37M/emp) - Craigslist ($14M/emp) - Anthropic ($12M/emp, run rate) - OpenAI ($8M/emp, run rate) For comparison, Nvidia is very efficient at scale and is $4.4M/emp.
89
267
5,710
1,317,146
Sam Jackson retweeted
Adopting Claude speak in my regular life, episode 1: Partner: Did you do the dishes tonight? Me: Yes they're done. Partner: Why are they still dirty? Me: You're right to push back. I didn't actually do them.
394
3,776
55,654
1,843,705
Sam Jackson retweeted
ChatGPT Images 2.0 (Pro) generates a photo of a cake decorated with SVG that when transcribed to a file renders another cake
62
271
6,464
446,159
Sam Jackson retweeted
I made the anti-Grammarly. Mess up your emails with AI. Sinceerly.com
456
621
11,747
2,124,489
Be me, it's merely 4pm, after the nth refactor on a whim today and a week of til-4am sessions > Claude: Want me to push on ENG-123 tonight or pause here? It's a bigger piece of work than any single fix today, and it's getting late. > Me: Oh sweet summer child...
11
Sam Jackson retweeted
Apr 22
chrono-trigger style jrpg but it's rationalists fighting the basilisk. 2nd try
20
10
353
20,118
Sam Jackson retweeted
ChatGPT Images 2.0 generates a boss fight from a 16-bit JRPG in the style of FF6 but they’re in a Chipotle and the boss is just the board position from Byrne vs. Fischer (1956) right after Byrne captures Fischer’s queen:
33
110
1,673
94,760