CK

CK

119 Photos and videos

Tweets

Pinned Tweet

CK @seaspinner

12 Nov 2024

I now have an account there - CK @ bluespinner

CK @seaspinner

8 Nov 2024

I've learned there's a BlueSky account using the handle @seaspinner. That is NOT me - I have no idea who they are - and to avoid confusion I do want people here, especially my followers, to know that isn't my account. I'll let you know if/when I get an account there.

1,176

Marc Watkins

CK retweeted

Marc Watkins @Marc__Watkins

Feb 15

Replying to @simonw

I've seen it as well for writing and research-based tasks. Automating processes breaks traditional systems of meaning. We can't create relationships with work without some activity in the process. AI is completing tasks too vividly for us to think about.

Simon Willison

CK retweeted

Simon Willison

@simonw

Feb 15

Short musings on "cognitive debt" - I'm seeing this in my own work, where excessive unreviewed AI-generated code leads me to lose a firm mental model of what I've built, which then makes it harder to confidently make future decisions simonwillison.net/2026/Feb/1…

How Generative and Agentic AI Shift Concern from Technical Debt to Cognitive Debt

This piece by Margaret-Anne Storey is the best explanation of the term cognitive debt I've seen so far. Cognitive debt, a term gaining traction recently, instead communicates the notion that …

simonwillison.net

140

205

2,307

162,236

Ewan Morrison

CK retweeted

Ewan Morrison

@MrEwanMorrison

Feb 9

"LLM's can't reason." Stanford paper: "... unless we start measuring how models fail not just how often they succeed we’ll keep deploying systems that pass benchmarks, fail silently in production, and explain themselves with total confidence while doing the wrong thing."

Alex Prompter

@alex_prompter

Feb 9

🚨 Holy shit… Stanford just published the most uncomfortable paper on LLM reasoning I’ve read in a long time. This isn’t a flashy new model or a leaderboard win. It’s a systematic teardown of how and why large language models keep failing at reasoning even when benchmarks say they’re doing great. The paper does one very smart thing upfront: it introduces a clean taxonomy instead of more anecdotes. The authors split reasoning into non-embodied and embodied. Non-embodied reasoning is what most benchmarks test and it’s further divided into informal reasoning (intuition, social judgment, commonsense heuristics) and formal reasoning (logic, math, code, symbolic manipulation). Embodied reasoning is where models must reason about the physical world, space, causality, and action under real constraints. Across all three, the same failure patterns keep showing up. > First are fundamental failures baked into current architectures. Models generate answers that look coherent but collapse under light logical pressure. They shortcut, pattern-match, or hallucinate steps instead of executing a consistent reasoning process. > Second are application-specific failures. A model that looks strong on math benchmarks can quietly fall apart in scientific reasoning, planning, or multi-step decision making. Performance does not transfer nearly as well as leaderboards imply. > Third are robustness failures. Tiny changes in wording, ordering, or context can flip an answer entirely. The reasoning wasn’t stable to begin with; it just happened to work for that phrasing. One of the most disturbing findings is how often models produce unfaithful reasoning. They give the correct final answer while providing explanations that are logically wrong, incomplete, or fabricated. This is worse than being wrong, because it trains users to trust explanations that don’t correspond to the actual decision process. Embodied reasoning is where things really fall apart. LLMs systematically fail at physical commonsense, spatial reasoning, and basic physics because they have no grounded experience. Even in text-only settings, as soon as a task implicitly depends on real-world dynamics, failures become predictable and repeatable. The authors don’t just criticize. They outline mitigation paths: inference-time scaling, analogical memory, external verification, and evaluations that deliberately inject known failure cases instead of optimizing for leaderboard performance. But they’re very clear that none of these are silver bullets yet. The takeaway isn’t that LLMs can’t reason. It’s more uncomfortable than that. LLMs reason just enough to sound convincing, but not enough to be reliable. And unless we start measuring how models fail not just how often they succeed we’ll keep deploying systems that pass benchmarks, fail silently in production, and explain themselves with total confidence while doing the wrong thing. That’s the real warning shot in this paper. Paper: Large Language Model Reasoning Failures

267

2,626

203,879

neil turkewitz

CK retweeted

neil turkewitz @neilturkewitz

Jan 31

Ironic that TIME would embrace AI which denies truth, context and time itself. The selling of a lie, brought to you by the good folks at Time!

TIME

@TIME

Jan 29

America wasn’t born in a moment. Before it was a nation, it was an idea—and in 1776, nothing about it was inevitable. ‘On This Day…1776’ is a new TIME Studios series executive produced by @DarrenAronofsky that tells the story of the year that shaped a nation. Released in short weekly episodes around the dates events occurred. Premiering on TIME’s YouTube channel. Watch the series: time-magazine.visitlink.me/I…

0:44

1,456

neil turkewitz

CK retweeted

neil turkewitz @neilturkewitz

Jan 25

“technofascism tends to operate through…insidious mechanisms of control, such as data extraction, algorithmic governance, behavioral nudging & platform monopolization…creat[ing] depoliticization, atomization & polarization, all under the guise of convenience & personalization.”

Mark Coeckelbergh @MCoeckelbergh

Jan 25

New article on AI and fascism link.springer.com/article/10…

200

593

24,132

Ed Newton-Rex

CK retweeted

Ed Newton-Rex

@ednewtonrex

Jan 19

helpful tip for working out when to use ai to write for you: - if you’re writing something that matters, don’t write it with ai - if you’re writing something that doesn’t matter, still don’t write it with ai. in fact don’t write it at all

153

1,101

15,050

Ed Zitron

CK retweeted

Ed Zitron

@edzitron

Jan 19

Great profile of me in The Guardian, on the front page of the Life & Arts section in print. Also suggests I should be played by Robert Pattinson or Paul Mescal in the AI bubble movie… theguardian.com/technology/2…

625

29,483

CK

CK @seaspinner

Jan 17

RT @kortizart: I hope, when we enter the era where we hold everyone accountable for these barbaric and brutal moments, that corporations an…

Chomba Bupe

CK retweeted

Chomba Bupe @ChombaBupe

Jan 6

Artists have stopped sharing their works, writer's have stopped sharing their work, now engineering will stop sharing their work all thanks to generative artificial intelligence. Looks like a step backwards for humanity, what GenAI has done.

Julia Turc

@juliarturc

Jan 5

Replying to @juliarturc

We used to open-source libraries in order help other humans. But now I don't care if Anthropic saves 1,000 tokens thanks to my library. The only way I see this working is some sort of marketplace where developers offer AI-optimized libraries and the clankers choose to pay when they predict it'll save them tokens.

3,037

15,227

292,409

neil turkewitz

CK retweeted

neil turkewitz @neilturkewitz

Jan 5

I don’t know who needs to hear this, but if anyone on here is telling women that to avoid being virtually undressed by AI models, they need to stay off the grid, please understand that you are contributing to a problem, not solving it. Thank you for your attention to this matter.

296

Michelle Cowbourne

CK retweeted

Michelle Cowbourne

@Glastomichelle

Jan 4

Sunrise on Glastonbury Tor this morning. A beautiful start to the day and plenty of people made the climb and were rewarded with a stunning sky.

199

1,321

12,615

neil turkewitz

CK retweeted

neil turkewitz @neilturkewitz

30 Dec 2025

Over at the @washingtonpost, @geoffreyfowler writes: “I can tell you which AI tools are worth using — & which to avoid.” I can save you time—Avoid Them All. They’re the product of exploitation, diminish our agency, & enrich the worst people on the planet. wapo.st/49gIuAa

Column | ChatGPT is overrated. Here’s what to use instead.

When I want help from AI, ChatGPT is no longer my default first stop.

washingtonpost.com

394

Ed Zitron

CK retweeted

Ed Zitron

@edzitron

22 Dec 2025

Premium: How The AI Bubble bursts in 2026 - The largest funder of AI data centers is pulling out, OpenAI and Anthropic need more money than ever during a massive VC liquidity crisis - and NVIDIA's debt-powered customer base is quietly shrinking. wheresyoured.at/premium-how-…

Premium - How The AI Bubble Bursts In 2026

Hello and welcome to the final premium edition of Where's Your Ed At for the year. Since kicking off premium, we've had some incredible bangers that I recommend you revisit (or subscribe and read in...

wheresyoured.at

206

1,139

134,151

CK

CK @seaspinner

20 Dec 2025

So it's "Adjust your chatbot's addictiveness level" - and then when there's even more chatbot addiction causing more harm, OpenAI will just blame the user. As they're trying to do now, but they hope this will help them in court.

TechCrunch

@TechCrunch

20 Dec 2025

OpenAI allows users to directly adjust ChatGPT’s enthusiasm level techcrunch.com/2025/12/20/op…

More Perfect Union

CK retweeted

More Perfect Union

@MorePerfectUS

19 Dec 2025

NEW: Sam Altman wants to make a deal with us: he'll give us a utopian future, if we give him... everything. $750 billion in investment. As much electricity as the population of India. And all of our data. And if he's wrong he'll still profit off of what comes next:

14:08

168

1,631

6,493

920,226

John Harwood

CK retweeted

John Harwood

@JohnJHarwood

17 Dec 2025

the incumbent president is mentally ill

Dan Diamond

@ddiamond

17 Dec 2025

Pool report from @joeygarrison — new plaques installed outside the Oval Office that mock past Democrat presidents. The one that names Barack Hussein Obama says he presided over the "highly ineffective 'Unaffordable' Care Act," among other critiques.

1,758

2,523

14,134

485,307

Ed Newton-Rex

CK retweeted

Ed Newton-Rex

@ednewtonrex

16 Dec 2025

Brutal (but fair) Guardian headline today Only 3% of respondents to the UK government’s consultation on AI & copyright supported their unfair plan to hand AI companies people’s work for free. 95% said AI companies should license training data. How can tech minister Liz Kendall claim there is “no clear consensus”? What on earth would consensus look like if not this? Government should rule out an AI copyright exception immediately. theguardian.com/technology/2…

235

1,368

34,671

Marge Nelk

CK retweeted

Marge Nelk @NelkMarge

16 Dec 2025

Replying to @ednewtonrex

There is certainly a clear consensus that people don’t like theft and exploitation. This idea doesn’t even need lobbying, it’s elementary. It’s strange that governments aren’t protecting creative people and are instead letting U.S. tech giants take cultural heritage for free.

158

neil turkewitz

CK retweeted

neil turkewitz @neilturkewitz

15 Dec 2025

This alone should be reason for every self-respecting member of Congress to call for his removal.

Jake Tapper 🦅

@jaketapper

15 Dec 2025

The president’s TruthSocial post about the Reiners’ horrific deaths is quite a contrast from how Rob Reiner reacted to the murder of Charlie Kirk.

336

MeidasTouch

CK retweeted

MeidasTouch

@MeidasTouch

15 Dec 2025

This goes way beyond politics. This is pure evil.

Ron Filipkowski

@RonFilipkowski

15 Dec 2025

This is both disgusting and expected. The Scumbag-in-Chief

500

4,412

36,433

458,372