Attorney, Entrepreneur, Father and Husband.

Joined July 2008
26 Photos and videos
Caleb Baskin retweeted
The streets will never forget
1,130
8,778
93,888
2,859,306
Not sure how this squares with @DavidSacks position that they've been scaremongering and this risk is overstated. Which is it?
Jun 13
As I said, they’ve been asking to be regulated and have now succeeded!
9
Hey @claudeai and @bcherny, for the knowledge workers, it would be amazing if the plugins in office (Word, Outlook, etc.), could see/access/work in existing chat, CoWork and Claude Code chats, threads and projects. That interconnectivity would solve so many issues.
2
69
Hey @Superhuman, my Humanizer plugin has stopped reading text abruptly. Grammarly still works in the same plugin. How to fix?
1
1
100
Caleb Baskin retweeted
Steve Kerr reveals he spent an entire season secretly slipping Taylor Swift’s ‘All Too Well’ lyrics into his press conferences without anyone noticing (Via @espn, h/t @TheNBABase)
152
182
7,144
927,625
Hey @steipete, is there a way you could add Send To functionality to the Codex Mac app? Right-click to send to a new chat in a project, existing thread/chat in same, would be great for knowledge folks. Same functionality would be great in Outlook, i.e., send this email/thread (with attachments) to a chat/project. Haven't been able to figure this out solo; thought you might have an idea.
1
50
Hey @ClaudeDevs is there a way you could add Send To functionality to the Mac app? Right-click to send to a new chat in a CoWork project, existing thread/chat in same, would be great for knowledge folks. Same functionality would be great in Outlook, i.e., send this email/thread (with attachments) to a chat/project/CoWork.
34
I would love Model Council in the Perplexity desktop app @AravSrinivas if you get a chance.
1
60
Hey @meetgranola how come now M365 Connector?
1
39
Is there anything to be done about the constant workspace out of space errors in Cowork @bcherny?
45
Caleb Baskin retweeted
“When Steph and Klay were on fire during those playoff runs, there was really nothing like it. It was so electric. It's so much bigger than just a game. When a team gets on a run like that, you can really feel the energy change in the whole city. It lifts everybody up. 𝐈 𝐰𝐚𝐧𝐭 𝐭𝐡𝐚𝐭 𝐬𝐨 𝐛𝐚𝐝 𝐡𝐞𝐫𝐞 𝐢𝐧 𝐒𝐚𝐧 𝐉𝐨𝐬𝐞.” —Macklin Celebrini
1
42
545
12,283
Hey @bcherny we are constantly hitting sandbox limits with CoWork. Could this get fixed/addressed? Maybe give us a setting to expand the sandbox in the short term and some skills to manage storage sometime thereafter?
24
Caleb Baskin retweeted
Too soon?
18
28
305
22,559
Caleb Baskin retweeted
From one Sharks record holder to the next. 🤝
104
951
12,008
608,849
So Codex seems to just hang when clicking on a plugin to install. How to get around this? @openai any thoughts?
38
Caleb Baskin retweeted
A lot to like from Luca Cagnoni last night!! Here are some of little things that impressed me in his game last night versus Nashville. I know he played 6 NHL games already so relax #thebite
19
75
886
47,067
Hi @bcherny, question about @Claude: What model/effort is used with Opus 4.6 extended thinking in CoWork desktop vs. Opus 4.6 1M Max in Code?
277
Very astute take.
Judging by my tl there is a growing gap in understanding of AI capability. The first issue I think is around recency and tier of use. I think a lot of people tried the free tier of ChatGPT somewhere last year and allowed it to inform their views on AI a little too much. This is a group of reactions laughing at various quirks of the models, hallucinations, etc. Yes I also saw the viral videos of OpenAI's Advanced Voice mode fumbling simple queries like "should I drive or walk to the carwash". The thing is that these free and old/deprecated models don't reflect the capability in the latest round of state of the art agentic models of this year, especially OpenAI Codex and Claude Code. But that brings me to the second issue. Even if people paid $200/month to use the state of the art models, a lot of the capabilities are relatively "peaky" in highly technical areas. Typical queries around search, writing, advice, etc. are *not* the domain that has made the most noticeable and dramatic strides in capability. Partly, this is due to the technical details of reinforcement learning and its use of verifiable rewards. But partly, it's also because these use cases are not sufficiently prioritized by the companies in their hillclimbing because they don't lead to as much $$$ value. The goldmines are elsewhere, and the focus comes along. So that brings me to the second group of people, who *both* 1) pay for and use the state of the art frontier agentic models (OpenAI Codex / Claude Code) and 2) do so professionally in technical domains like programming, math and research. This group of people is subject to the highest amount of "AI Psychosis" because the recent improvements in these domains as of this year have been nothing short of staggering. When you hand a computer terminal to one of these models, you can now watch them melt programming problems that you'd normally expect to take days/weeks of work. It's this second group of people that assigns a much greater gravity to the capabilities, their slope, and various cyber-related repercussions. TLDR the people in these two groups are speaking past each other. It really is simultaneously the case that OpenAI's free and I think slightly orphaned (?) "Advanced Voice Mode" will fumble the dumbest questions in your Instagram's reels and *at the same time*, OpenAI's highest-tier and paid Codex model will go off for 1 hour to coherently restructure an entire code base, or find and exploit vulnerabilities in computer systems. This part really works and has made dramatic strides because 2 properties: 1) these domains offer explicit reward functions that are verifiable meaning they are easily amenable to reinforcement learning training (e.g. unit tests passed yes or no, in contrast to writing, which is much harder to explicitly judge), but also 2) they are a lot more valuable in b2b settings, meaning that the biggest fraction of the team is focused on improving them. So here we are.
38
Caleb Baskin retweeted
Apr 2
Another four-point night for Macklin Celebrini, giving him on the season: 40 goals 65 assists 105 points All in just 73 games. He's still just 19 years old.
181
1,581
15,397
845,472