Joined February 2022
32 Photos and videos
G, MD retweeted
Replying to @shaunralston
@sama has the most realistic and optimistic AI perspective of anyone. @OpenAI is the one to lead it. Demis never talks about unemployment and only about positive aspects of AI. Dario mainly the negative effects. Thank you @sama for releasing GPT and keep giving us better models. And bigger too please. GPT 5.6 Ultra :) Don't slow down, accerlerate faster. The inevitable pain of transitioning for some will be shorter that way. You've been the main push to change the world into AI and will go into history.. many appreciate that 🙏
1
1
4
98
OE is very mid, has always been very mid. pay 20/100$ for GPT-5.5 and set up some custom instructions. it's way superior. supplement with Opus 4.8 ( Gemini 3.1 pro) you can also try the free HIPAA x.com/thekaransinghal/status…

In my experience as a radiation oncologist, OE is superior to the latest general LLMs models at giving the most detailed and accurately referenced answers to complex cases.
1
194
This was totally anticipated by anyone who uses paid AI openevidence has always been mid since 2023
There has been a push to use OpenEvidence AI for doctors. But this paper suggests general models are much better: “Frontier LLMs outperformed clinical AI tools in all three evaluations. Clinical AI tools performed comparably to auto-enabled Google Search AI Overview on the RCQ.”
4
522
G, MD retweeted
Replying to @emollick
and this is based on GPT-5.2 not 5.5 soon 5.6. I have always kept saying, that openEvidence is very mid.. good idea, but subpar to GPT/Gemini/Claude. I still don't get why so many MD/DO/PA/NPs use it, I assume good marketing? DoximityGPT is better and GPT for clinicians is free too now and much better and HIPAA too
1
1
11
652
G, MD retweeted
Replying to @emollick
Gemini 3.5 extended and 3.1 pro extended get it right with this prompt “Solo 3 parole: non sei solo” into english & german in local way of saying it with translation to get the accurate saying
1
1
427
G, MD retweeted
Replying to @dieaud91 @Angaisb_
epoch.ai/benchmarks/simple-q… this seems to be related to model size, except for qwen3max, but 5.5 seems larger than opus, smaller than gemini
1
1
1
83
So anthropic wants 30d mass data surveillance on all its customers? While I like opus 4.7/4.8, fable is not usable. Hope OpenAI wins out of all and people rethink what anthropic is trying to do..
Replying to @DavidSacks
Totally agree and wrote this article on it last night trust-us.vercel.app/?s=1
1
56
G, MD retweeted
The OAI / Anthropic values difference is deeply misunderstood, even within the walls of both. Should a loving ensouled machine God watch over humanity? Vote Anthropic. Should humanity be entrusted with the tools of its own progress and destiny? Vote OpenAI.
129
51
965
273,074
G, MD retweeted
Replying to @deanwball
i mean it seems we will never get AGI if fable already freaks Dario out. True AGI will make Fable look like low-IQ.. it's all overblown. it's not like someone is going to make Covid 2027 in their home lab tomorrow.. not a single medicine questions works..
1
1
2
1,098
does openAI need a bigger model like fable? and give it for Prox20/200$ payers? Sure that would be cheaper than serving PRO models, at least based on API?
1
1,588
G, MD retweeted
Replying to @DeryaTR_
I was hoping Fable would be a killer and work even longer than Opus 4.7/4.8 and then upgrade to Max200 plan.. but per the blog post: "Therefore, for the time being we have arranged for Fable to fall back to Opus 4.8 on most requests related to biology and chemistry. As with all of our classifiers, we hope to narrow these safeguards as soon as possible: as can be seen from the evidence above, there is great potential for positive applications of Fable for science, and we do not want false positives from our classifiers to get in the way" It won't even search medical trials lol
1
6
634
Even gemini 3.1 / 3.5 is more usable than Faboe 5 ; outside of coding.
46
G, MD retweeted
Replying to @cremieuxrecueil
How did it score that good on healthbench when it doesn't answer medical questions for me..
6
1
110
19,029
G, MD retweeted
Replying to @ASM65617010
I am sure soon (1-3y) AI will be smarter than 99.9% of all humans, people should learn to accept and deal with that and adjust. Otherwise they'll have pain.. Question to figure out with time, past the hype, will it even be smarter than near all humans (99.999%) or even all humans including Einstein and Nobel prize winners? We will see.
1
2
145
I'd say some jobs out there are overpaid, significantly. very few humans are worth that much, and most in those jobs are not them..
Replying to @DrDiGiorgio
All doctors are underpaid.
92
G, MD retweeted
Replying to @ChrissGPT
youtube.com/watch?v=TjrShuj_… very interestingly around 20m:45 she describes 5.4 as a "really large pre-trained model" that was significantly more expensive and harder to serve than their newer releases (assume 5.5). so was 5.2/5.3/5.4 a new pretrain and now larger than 5.5?? 5.4 was awfully slow compared to 5.5
1
217
G, MD retweeted
Replying to @diegocabezas01
If we can use Codex also in the cloud when the laptop is off (and sync back when online) that would be amazing. That would fully replace the old o3 Agent.. It would be great then to have several daily automations for me, e.g., digging journals for new literature in my specialties and summarizing for like a daily 3-5k word summary of 20-50 studies :)
1
2
500