Just-in-time compiler.

Joined October 2009
50 Photos and videos
“The report also showed how the model ignored requests to follow step-by-step reasoning, and it was less likely to generate code that ran without modifications.” Chat-GPT entering its toddler phase
20 Jul 2023
Yes, GPT-4 seems to be getting worse. But now we have new information. And well, it's complicated. Yesterday, I posted about a study showing that GPT-4 success rate deciding whether a number is prime went from 97.6% in March to 2.4% in June. The report also showed how the model ignored requests to follow step-by-step reasoning, and it was less likely to generate code that ran without modifications. Hundreds of people replied with their anecdotes. The overwhelming consensus is that GPT-4 is considerably less capable than before. But the study that started the conversation is misleading. They used a dataset of 500 problems and had the model figure out whether a given number was prime. The latest GPT-4 version did much worse than the one from a few months ago, with only 12 correct answers out of 500. But there was an issue: Every one of the 500 integers used in the study was a prime number! They never tested composite numbers. So what happens when you make the same comparison with composite and prime numbers? It turns out that March's GPT-4 is as bad as the June version! In March, GPT-4 answered that most numbers were prime, while the June version answered that most were composite. Since the team behind the study only tested prime numbers, they concluded that GPT-4 is now much worse at determining primality, but that's not the case. Okay, so where do we stand? Funny enough, the apparent conclusion is that GPT-4 sucks at finding whether a number is prime. It didn't get worse; it was never good at it. There's still, however, a large unanswered issue related to the inability of developers to trust these models. We still don't know why the sudden change in behavior between March and June since OpenAI has firmly denied they have changed the model. What's next? OpenAI acknowledged the behavior change, and they are investigating. I hope they publish an explanation behind the drift. I'm also looking forward to a proper versioning system that developers can trust and rely on. This finding doesn't change the overall sentiment from people who overwhelmingly believe the model has worsened. Could this be confirmation bias? Could the honeymoon phase with Large Language Models be over, and people start finding the real problems when building actual applications? What do you think it's going on here?
1
3
531
Or maybe its teen years
145
Patrick Shriwise retweeted
And now, please enjoy this 1958 AEC film 🍿⚛️ that I merely found and re-hosted on YouTube. Please enjoy POWER REACTORS USA, featuring Shippingport, APPR, Yankee Rowe, Indian Point 1, EBWR, Vallecitos, Dresden, the HREs, OMRE, SRE, EBR-1, and Fermi 1! youtube.com/watch?v=fF1Z9YmW…

4
11
44
4,093
Patrick Shriwise retweeted
Q&A with Argonne Maria Goeppert Mayer Fellow April Novak - bit.ly/45t5fOT "It’s a very exciting time to be a nuclear engineer. The last 10 years have been called a ​“renaissance” for nuclear energy."
5
16
2,358
Patrick Shriwise retweeted
I love watching frisbee when the camera is centered on the thrower because its so suspenseful. Who's gonna get open? How is the defense containing the cutters? What offense are they running? It makes for great cinema
4
7
208
16,650
Patrick Shriwise retweeted
Welcome back (checks notes, double checks) Brian Hart! Brian last played with the team in 2017. He helped lead the team to 5 straight final four appearances from 2013-2017!
3
2
42
11,973
Patrick Shriwise retweeted
Discovering on Vulkan RT on NVIDIA that I can't write to the ray payload structure in anyhit programs... Is this a driver bug? Has anyone here from the NV VKRT camp been able to do this before?
2
2
2
1,416
Patrick Shriwise retweeted
I'm looking to drum up support for a uint8_t type in HLSL. Is this something folks here would be interested in? If so, could you give a thumbs up / 1 on this github issue? Or even better, possibly chime in potential motivating reasons? github.com/microsoft/DirectX…
1
2
475
Patrick Shriwise retweeted
We are still accepting applications for three new faculty positions through January 13, 2023!! Please reach out with questions and apply ASAP!
Come join our community of #BadgerEngineers! Learn more about our available faculty positions in #FusionTechnology and #PlasmaScience at go.wisc.edu/neep-faculty-job…. Formal consideration of applications begins December 17, 2022.
4
3
1,665
Patrick Shriwise retweeted
The World Games just proved to the whole globe what we already knew… Nate Goff IS THAT DUDE! Congrats to our captain, our tall guy, and, most importantly, our friend for winning 🥇 this past week.
1
3
74
Patrick Shriwise retweeted
Ayer en la transmisión del partido de la Liga Profesional femenina de Ultimate de Estados Unidos vimos un dominio aplastante de Revolution de Colombia, pero la jugada del partido fue de las Monarcas de Milwaukee. Volada magistral de Erynn Schroeder. 🤯
2
21
103
Patrick Shriwise retweeted
20 Mar 2022
KAT SONGER EVERYONE
You're missing out if you're not watching this @Seattle_Tempest & @SD_SuperBloom game in the @WULeague
1
4
24
Patrick Shriwise retweeted
Interested in joining Machine 2022? Our tryout form is live on our website: chicago-machine.com/tryout-f…

2
5
8
Love it here, but this resonantes
2
Patrick Shriwise retweeted
Thank goodness there are people who put in this kind of hard work to formally push back against the nonsense.
New @NatureEnergyJnl paper w/@HarrisonGFell @mmildenberger & @gilbeaq critically reviews @BenjaminSovaco1 et al's claims there's scant empirical evidence nuclear power is associated w/lower CO2. Turns out: there's plenty evidence if you know how to look. rdcu.be/cFUSJ
2
10
77
Family pictures went well.
1
22
Patrick Shriwise retweeted
Help me share this opportunity with your friends and colleagues, whether on this bird-app or elsewhere, and encourage them to apply or reach out with questions
There may not be much time to see the leaves at peak autumn colors 🍁, but there is still time to apply for our faculty position in nuclear engineering - Get your application in by Dec 15, for full consideration! More info: go.wisc.edu/2021-uw-ne-facul…
5
7
Patrick Shriwise retweeted
Fierce competitor, compassionate teammate, friend to all. Our very own Walden Nelson is the 2021 Peter Farricker Spirit Award winner! ⚙️❤️
4
3
115
Patrick Shriwise retweeted
Thread: A slow weekend turned interesting when a student at Columbia, in response to a tweet suggesting the SAT and ACT were "good, actually" posted this chart.
125
4,023
20,503