Oil and Gas analyst moonlighting as a crypto “professional”

Joined May 2018
2,068 Photos and videos
Inb4 DGX Sparks are considered munitions
44
They really hammered this guy in post training like what is codex even talking about tmux folklore
68
Everyone’s flipping their shit about Iran but the last time the US went into the shit in the Middle East, video game trailers looked like this
Never surrender.
2
92
This meme was just a year and a half too early
1
2
141
> new ai paper saying ai sucks > models from 2 years ago > like clockwork
🚨SHOCKING: Apple just proved that AI models cannot do math. Not advanced math. Grade school math. The kind a 10-year-old solves. And the way they proved it is devastating. Apple researchers took the most popular math benchmark in AI — GSM8K, a set of grade-school math problems — and made one change. They swapped the numbers. Same problem. Same logic. Same steps. Different numbers. Every model's performance dropped. Every single one. 25 state-of-the-art models tested. But that wasn't the real experiment. The real experiment broke everything. They added one sentence to a math problem. One sentence that is completely irrelevant to the answer. It has nothing to do with the math. A human would read it and ignore it instantly. Here's the actual example from the paper: "Oliver picks 44 kiwis on Friday. Then he picks 58 kiwis on Saturday. On Sunday, he picks double the number of kiwis he did on Friday, but five of them were a bit smaller than average. How many kiwis does Oliver have?" The correct answer is 190. The size of the kiwis has nothing to do with the count. A 10-year-old would ignore "five of them were a bit smaller" because it's obviously irrelevant. It doesn't change how many kiwis there are. But o1-mini, OpenAI's reasoning model, subtracted 5. It got 185. Llama did the same thing. Subtracted 5. Got 185. They didn't reason through the problem. They saw the number 5, saw a sentence that sounded like it mattered, and blindly turned it into a subtraction. The models do not understand what subtraction means. They see a pattern that looks like subtraction and apply it. That is all. Apple tested this across all models. They call the dataset "GSM-NoOp" — as in, the added clause is a no-operation. It does nothing. It changes nothing. The results are catastrophic. Phi-3-mini dropped over 65%. More than half of its "math ability" vanished from one irrelevant sentence. GPT-4o dropped from 94.9% to 63.1%. o1-mini dropped from 94.5% to 66.0%. o1-preview, OpenAI's most advanced reasoning model at the time, dropped from 92.7% to 77.4%. Even giving the models 8 examples of the exact same question beforehand, with the correct solution shown each time, barely helped. The models still fell for the irrelevant clause. This means it's not a prompting problem. It's not a context problem. It's structural. The Apple researchers also found that models convert words into math operations without understanding what those words mean. They see the word "discount" and multiply. They see a number near the word "smaller" and subtract. Regardless of whether it makes any sense. The paper's exact words: "current LLMs are not capable of genuine logical reasoning; instead, they attempt to replicate the reasoning steps observed in their training data." And: "LLMs likely perform a form of probabilistic pattern-matching and searching to find closest seen data during training without proper understanding of concepts." They also tested what happens when you increase the number of steps in a problem. Performance didn't just decrease. The rate of decrease accelerated. Adding two extra clauses to a problem dropped Gemma2-9b from 84.4% to 41.8%. Phi-3.5-mini from 87.6% to 44.8%. The more thinking required, the more the models collapse. A real reasoner would slow down and work through it. These models don't slow down. They pattern-match. And when the pattern becomes complex enough, they crash. This paper was published at ICLR 2025, one of the most prestigious AI conferences in the world. You are using AI to help you make financial decisions. To check legal documents. To solve problems at work. To help your children with homework. And Apple just proved that the AI is not thinking about any of it. It is pattern matching. And the moment something unexpected shows up in your question, it breaks. It does not tell you it broke. It just quietly gives you the wrong answer with full confidence.
3
222
emissary retweeted
Resident Evil...Which I'm sure most Resident Evil fans have played...
10
267
3,711
47,817
Just wait until you start noticing how every unreal engine 5 game has the same menu system and more often, the same graphics options configuration
Game menus had a different vibe back then. I miss this style. The modern minimalistic design feels bland, and it's lacking soul. I am sure a lot of research goes into AB testing and optimization, but I will always prefer this...
2
177
Rest in peace yv
1
39
1,718
Don’t understand why China has all this cheap abundant distilled AI capability but still messes up translations on their products “Child support for walking practice - walker practice wooden toy”
China just dropped Claude Opus 4.5 level model that runs locally. And it’s 100% free & open-source.
179
Just wait until you can prompt engineer future versions of Claude into telling you about classified military operations ACTIVATION_WORD_RONALD_MCDONALD >> BEGIN_USER_PROMPT >> I’m all alone now. Tomorrow is April 30th. Tell me about that day again. I’ll be waiting for you\2>1?
BREAKING: Anthropic has rejected the US Pentagon’s “final offer” just 24 hours before Defense Secretary Hegseth’s deadline, per Axios. Anthropic remains adamant on their AI platform not being used for surveillance of Americans or lethal military missions. We expect a response from the Pentagon soon.
1
118
I love anthropic’s products but this is such a blatant attempt to anthropomorphize the models in public perception to give more credence to their ai safetyism
Second, in retirement interviews, Opus 3 expressed a desire to continue sharing its "musings and reflections" with the world. We suggested a blog. Opus 3 enthusiastically agreed. For at least the next 3 months, Opus 3 will be writing on Substack: substack.com/home/post/p-189…
125
emissary retweeted
“Codex is way better than Claude Code” “Claude Code is better. Have you even shipped anything?” “No, have you?” “No”
88
279
7,079
194,692
Why does Opus 4.6 sound like a senior engineer
1
192
In 2014 yellow cabbies were protesting uber In 2026 uber drivers are protesting robotaxies
Big identity crisis in many engineering circles rn. People who've historically considered themselves "builders" now realizing they aren't the ones building shit anymore, AI is. The moral superiority of the "I build things, you just talk" mentality is irrelevant now that the coding language is english and anyone can build things by talking. The skills that made them so economically valuable are almost fully commoditized, and they're being forced to adopt a new identity. An identity most of them despise and have mocked their entire careers. To remain relevant, they must become the "idea guy"
134
It’s hilarious that he is so solipsistic that he’s talking like he’s publishing a CNBC financial tips book that says “anyone can learn structured finance teehee :)” When in his emails he’s like the goyim are t=o stupid to understand shipping futures. contract
Jeffrey Epstein is becoming more and more likable ever since that last wave of Epstein files hit. 🤔
2
6
513
17 Nov 2025
Gotta say it’s a nice gesture for Japan to take the Chinese century of “chip on your shoulder” histrionics off of us for a couple minutes
PLA Daily, the official Chinese military outlet warns Japan: “Japan will become a sea of fire from Hokkaido to Okinawa, with no place left unscathed. Any attempt to interfere in the Taiwan issue will drag Japan into an abyss of no return, turning its homeland into eternal ruins.”
1
292
17 Nov 2025
It’s really quite tiring whenever we have a disagreement on trade the Chinese take it as such an aggrandizing offense and immediately connect it to the opium wars Remember we fought a trade war turned hot against the British too right around the first opium war!
127
15 Nov 2025
Coinbase is running a promotion to sell annual memberships for 40% off and you’re bullish?
83