Liberal. Music lover. Writer (fiction - different genres - and I also wrote about music). Here to talk about politics, music and other people's work.

Joined January 2011
119 Photos and videos
Pinned Tweet
12 Nov 2024
I now have an account there - CK @ bluespinner
8 Nov 2024
I've learned there's a BlueSky account using the handle @seaspinner. That is NOT me - I have no idea who they are - and to avoid confusion I do want people here, especially my followers, to know that isn't my account. I'll let you know if/when I get an account there.
2
2
1,176
CK retweeted
Replying to @simonw
I've seen it as well for writing and research-based tasks. Automating processes breaks traditional systems of meaning. We can't create relationships with work without some activity in the process. AI is completing tasks too vividly for us to think about.
1
2
92
CK retweeted
Short musings on "cognitive debt" - I'm seeing this in my own work, where excessive unreviewed AI-generated code leads me to lose a firm mental model of what I've built, which then makes it harder to confidently make future decisions simonwillison.net/2026/Feb/1…
140
205
2,307
162,236
CK retweeted
"LLM's can't reason." Stanford paper: "... unless we start measuring how models fail not just how often they succeed we’ll keep deploying systems that pass benchmarks, fail silently in production, and explain themselves with total confidence while doing the wrong thing."
🚨 Holy shit… Stanford just published the most uncomfortable paper on LLM reasoning I’ve read in a long time. This isn’t a flashy new model or a leaderboard win. It’s a systematic teardown of how and why large language models keep failing at reasoning even when benchmarks say they’re doing great. The paper does one very smart thing upfront: it introduces a clean taxonomy instead of more anecdotes. The authors split reasoning into non-embodied and embodied. Non-embodied reasoning is what most benchmarks test and it’s further divided into informal reasoning (intuition, social judgment, commonsense heuristics) and formal reasoning (logic, math, code, symbolic manipulation). Embodied reasoning is where models must reason about the physical world, space, causality, and action under real constraints. Across all three, the same failure patterns keep showing up. > First are fundamental failures baked into current architectures. Models generate answers that look coherent but collapse under light logical pressure. They shortcut, pattern-match, or hallucinate steps instead of executing a consistent reasoning process. > Second are application-specific failures. A model that looks strong on math benchmarks can quietly fall apart in scientific reasoning, planning, or multi-step decision making. Performance does not transfer nearly as well as leaderboards imply. > Third are robustness failures. Tiny changes in wording, ordering, or context can flip an answer entirely. The reasoning wasn’t stable to begin with; it just happened to work for that phrasing. One of the most disturbing findings is how often models produce unfaithful reasoning. They give the correct final answer while providing explanations that are logically wrong, incomplete, or fabricated. This is worse than being wrong, because it trains users to trust explanations that don’t correspond to the actual decision process. Embodied reasoning is where things really fall apart. LLMs systematically fail at physical commonsense, spatial reasoning, and basic physics because they have no grounded experience. Even in text-only settings, as soon as a task implicitly depends on real-world dynamics, failures become predictable and repeatable. The authors don’t just criticize. They outline mitigation paths: inference-time scaling, analogical memory, external verification, and evaluations that deliberately inject known failure cases instead of optimizing for leaderboard performance. But they’re very clear that none of these are silver bullets yet. The takeaway isn’t that LLMs can’t reason. It’s more uncomfortable than that. LLMs reason just enough to sound convincing, but not enough to be reliable. And unless we start measuring how models fail not just how often they succeed we’ll keep deploying systems that pass benchmarks, fail silently in production, and explain themselves with total confidence while doing the wrong thing. That’s the real warning shot in this paper. Paper: Large Language Model Reasoning Failures
34
267
2,626
203,879
CK retweeted
Ironic that TIME would embrace AI which denies truth, context and time itself. The selling of a lie, brought to you by the good folks at Time!
Jan 29
America wasn’t born in a moment. Before it was a nation, it was an idea—and in 1776, nothing about it was inevitable. ‘On This Day…1776’ is a new TIME Studios series executive produced by @DarrenAronofsky that tells the story of the year that shaped a nation. Released in short weekly episodes around the dates events occurred. Premiering on TIME’s YouTube channel. Watch the series: time-magazine.visitlink.me/I…
3
4
41
1,456
CK retweeted
“technofascism tends to operate through…insidious mechanisms of control, such as data extraction, algorithmic governance, behavioral nudging & platform monopolization…creat[ing] depoliticization, atomization & polarization, all under the guise of convenience & personalization.”
3
200
593
24,132
CK retweeted
helpful tip for working out when to use ai to write for you: - if you’re writing something that matters, don’t write it with ai - if you’re writing something that doesn’t matter, still don’t write it with ai. in fact don’t write it at all
12
153
1,101
15,050
CK retweeted
Great profile of me in The Guardian, on the front page of the Life & Arts section in print. Also suggests I should be played by Robert Pattinson or Paul Mescal in the AI bubble movie… theguardian.com/technology/2…
30
41
625
29,483
Jan 17
RT @kortizart: I hope, when we enter the era where we hold everyone accountable for these barbaric and brutal moments, that corporations an…
97
CK retweeted
Artists have stopped sharing their works, writer's have stopped sharing their work, now engineering will stop sharing their work all thanks to generative artificial intelligence. Looks like a step backwards for humanity, what GenAI has done.
Replying to @juliarturc
We used to open-source libraries in order help other humans. But now I don't care if Anthropic saves 1,000 tokens thanks to my library. The only way I see this working is some sort of marketplace where developers offer AI-optimized libraries and the clankers choose to pay when they predict it'll save them tokens.
94
3,037
15,227
292,409
CK retweeted
I don’t know who needs to hear this, but if anyone on here is telling women that to avoid being virtually undressed by AI models, they need to stay off the grid, please understand that you are contributing to a problem, not solving it. Thank you for your attention to this matter.
1
5
8
296
CK retweeted
Sunrise on Glastonbury Tor this morning. A beautiful start to the day and plenty of people made the climb and were rewarded with a stunning sky.
32
199
1,321
12,615
CK retweeted
Over at the @washingtonpost, @geoffreyfowler writes: “I can tell you which AI tools are worth using — & which to avoid.” I can save you time—Avoid Them All. They’re the product of exploitation, diminish our agency, & enrich the worst people on the planet. wapo.st/49gIuAa
1
2
13
394
CK retweeted
22 Dec 2025
Premium: How The AI Bubble bursts in 2026 - The largest funder of AI data centers is pulling out, OpenAI and Anthropic need more money than ever during a massive VC liquidity crisis - and NVIDIA's debt-powered customer base is quietly shrinking. wheresyoured.at/premium-how-…
21
206
1,139
134,151
20 Dec 2025
So it's "Adjust your chatbot's addictiveness level" - and then when there's even more chatbot addiction causing more harm, OpenAI will just blame the user. As they're trying to do now, but they hope this will help them in court.
20 Dec 2025
OpenAI allows users to directly adjust ChatGPT’s enthusiasm level techcrunch.com/2025/12/20/op…
2
52
CK retweeted
NEW: Sam Altman wants to make a deal with us: he'll give us a utopian future, if we give him... everything. $750 billion in investment. As much electricity as the population of India. And all of our data. And if he's wrong he'll still profit off of what comes next:
168
1,631
6,493
920,226
CK retweeted
the incumbent president is mentally ill
17 Dec 2025
Pool report from @joeygarrison — new plaques installed outside the Oval Office that mock past Democrat presidents. The one that names Barack Hussein Obama says he presided over the "highly ineffective 'Unaffordable' Care Act," among other critiques.
1,758
2,523
14,134
485,307
CK retweeted
Brutal (but fair) Guardian headline today Only 3% of respondents to the UK government’s consultation on AI & copyright supported their unfair plan to hand AI companies people’s work for free. 95% said AI companies should license training data. How can tech minister Liz Kendall claim there is “no clear consensus”? What on earth would consensus look like if not this? Government should rule out an AI copyright exception immediately. theguardian.com/technology/2…
13
235
1,368
34,671
CK retweeted
16 Dec 2025
Replying to @ednewtonrex
There is certainly a clear consensus that people don’t like theft and exploitation. This idea doesn’t even need lobbying, it’s elementary. It’s strange that governments aren’t protecting creative people and are instead letting U.S. tech giants take cultural heritage for free.
1
3
158
CK retweeted
This alone should be reason for every self-respecting member of Congress to call for his removal.
The president’s TruthSocial post about the Reiners’ horrific deaths is quite a contrast from how Rob Reiner reacted to the murder of Charlie Kirk.
3
9
336
CK retweeted
This goes way beyond politics. This is pure evil.
This is both disgusting and expected. The Scumbag-in-Chief
500
4,412
36,433
458,372