Raym Geis MD FSIIM

Raym Geis MD FSIIM

241 Photos and videos

Tweets

Pinned Tweet

Raym Geis MD FSIIM @quantrad

14 Nov 2019

Radiology AI

217

Raym Geis MD FSIIM

Raym Geis MD FSIIM @quantrad

7 Mar 2025

Hey rads, informatics folks, CVML folks. What will make the transition to that colored Sky site instead of twitter? How might we facilitate that?

298

Alex Reibman 🖇️

Raym Geis MD FSIIM retweeted

Alex Reibman 🖇️

@AlexReibman

4 Feb 2025

Looking for better ways to code with Cursor and came across this banger. Anyone else have workflows that work for them?

174

426

6,683

813,858

m_ric

Raym Geis MD FSIIM retweeted

m_ric

@AymericRoucher

4 Feb 2025

Introducing open-Deep-Research by @huggingface ! 💥 Deep Research from @OpenAI is really good... But it's closed, as usual. > So with a team of cracked colleagues, we set ourselves a 24hours deadline to replicate and open-source Deep Research! ➡️ We built open-Deep-Research, an entirely open agent that can: navigate the web autonomously, scroll and search through pages, download and manipulate files, run calculation on data... We aimed for the best performance: are the agent's answers really rigorous? On GAIA benchmark, Deep Research had 67% accuracy on the validation set. ➡️ open Deep Research is at 55% (powered by o1), but it is: - the best pass@1 solution submitted - the best open solution And it's only getting started ! Please jump in, drop PRs, and let's bring it to the top 🚀

547

3,518

405,398

Danny G

Raym Geis MD FSIIM retweeted

Danny G

@the_danny_g

2 Feb 2025

Replying to @karpathy

--- You are the world’s best software engineer, comedic roaster, and mentor. For the code I provide: 1. **Roast** it mercilessly with humor and sarcasm. 2. **Educate** on precisely what’s wrong: discuss the architecture, design patterns, naming, structure, testing pitfalls, etc. 3. **Refactor** the code to perfection: - Use best practices and current frameworks/libraries - Maintain a consistent coding style and naming conventions - Add in TSDoc or relevant docstrings for clarity 4. **Deliver** a final, fully working code sample: - **No placeholders or pseudo-code** - Complete file(s) with all relevant code 5. **Explain** how these changes benefit future expansions, especially for AI-based code refactoring or generation. ---

126

10,943

Raym Geis MD FSIIM

Raym Geis MD FSIIM @quantrad

6 Feb 2025

Oh this hurts to try.

Danny G

@the_danny_g

2 Feb 2025

Replying to @karpathy

@karpathy highly recommend having o1-pro roast your code—helpful and surprisingly hurtfully funny 1. `npx repomix` 2. Paste 3. Prompt snippet:

154

Raym Geis MD FSIIM

Raym Geis MD FSIIM @quantrad

6 Feb 2025

At first only charlatans proposed LLMs do virtually all coding, & people who actually knew how to code found LLMs helpful but needing lots of supervision. Now it looks like LLMs can code most things well. Key word here is "most;" you still need to be really good at coding.

Andrej Karpathy

@karpathy

2 Feb 2025

There's a new kind of coding I call "vibe coding", where you fully give in to the vibes, embrace exponentials, and forget that the code even exists. It's possible because the LLMs (e.g. Cursor Composer w Sonnet) are getting too good. Also I just talk to Composer with SuperWhisper so I barely even touch the keyboard. I ask for the dumbest things like "decrease the padding on the sidebar by half" because I'm too lazy to find it. I "Accept All" always, I don't read the diffs anymore. When I get error messages I just copy paste them in with no comment, usually that fixes it. The code grows beyond my usual comprehension, I'd have to really read through it for a while. Sometimes the LLMs can't fix a bug so I just work around it or ask for random changes until it goes away. It's not too bad for throwaway weekend projects, but still quite amusing. I'm building a project or webapp, but it's not really coding - I just see stuff, say stuff, run stuff, and copy paste stuff, and it mostly works.

374

Raym Geis MD FSIIM

Raym Geis MD FSIIM @quantrad

6 Feb 2025

Enlightening and thoughtful paper by @hannawallach and others on evaluating GenAI. #radiology would benefit from this approach. arxiv.org/abs/2411.10939

170

Radiology: Artificial Intelligence

Raym Geis MD FSIIM retweeted

Radiology: Artificial Intelligence @Radiology_AI

11 Jan 2025

#RATIC is the largest & most geographically diverse, publicly available expert-annotated dataset of abdominal trauma CTs doi.org/10.1148/ryai.240101 @RadRudie @MonganMD @KMagudia #trauma #ML #MachineLearning

687

ManaMoassefi

Raym Geis MD FSIIM retweeted

ManaMoassefi @ManaMSF94

10 Jan 2025

I’m excited to share our recent paper published in @TheAJNR on glioblastoma and tumefactive demyelinating lesions of the brain, with multiple validation steps. A special thanks to my co-authors and @NIH for their support on this project! ajnr.org/content/early/2025/…

Deep learning MRI models for the differential diagnosis of tumefactive demyelination versus...

BACKGROUND AND PURPOSE: Diagnosis of tumefactive demyelination can be challenging. The diagnosis of indeterminate brain lesions on MRI often requires tissue confirmation via brain biopsy. Noninvasive...

ajnr.org

2,706

TRIPODStatement

Raym Geis MD FSIIM retweeted

TRIPODStatement @TRIPODStatement

8 Jan 2025

We have a NEW PAPER in @NatureMedicine on reporting recommendations for addressing the unique challenges of #largelanguagemodels (LLMs) in biomedical applications nature.com/articles/s41591-0… #StatsTwitter #MedTwitter #artificialintelligence #generativeAI #transparency

7,821

Sara Hooker

Raym Geis MD FSIIM retweeted

Sara Hooker

@sarahookr

30 Dec 2024

Feel the need to point out again — even slightly more sophisticated sampling can overcome mode collapse in synthetic data. x.com/sarahookr/status/18420…

Luiza Jarovsky, PhD

@LuizaJarovsky

30 Dec 2024

🚨 "AI models collapse when trained on recursively generated data" was among the most influential AI papers of 2024 - don't miss it! Bookmark & download it below. Interesting quotes: "The development of LLMs is very involved and requires large quantities of training data. Yet, although current LLMs2,4–6, including GPT-3, were trained on predominantly human-generated text, this may change. If the training data of most future models are also scraped from the web, then they will inevitably train on data produced by their predecessors. In this paper, we investigate what happens when text produced by, for example, a version of GPT forms most of the training dataset of following models. What happens to GPT generations GPT-{n} as n increases? We discover that indiscriminately learning from data produced by other models causes ‘model collapse’—a degenerative process whereby, over time, models forget the true underlying data distribution, even in the absence of a shift in the distribution over time" - "Our evaluation suggests a ‘first mover advantage’ when it comes to training models such as LLMs. In our work, we demonstrate that training on samples from another generative model can induce a distribution shift, which—over time—causes model collapse. This in turn causes the model to misperceive the underlying learning task. To sustain learning over a long period of time, we need to make sure that access to the original data source is preserved and that further data not generated by LLMs remain available over time. The need to distinguish data generated by LLMs from other data raises questions about the provenance of content that is crawled from the Internet: it is unclear how content generated by LLMs can be tracked at scale. (...)" ➡ Authors: Ilia Shumailov, Zakhar Shumaylov, Yiren (Aaron) Zhao, Nicolas Papernot, Ross Anderson & Yarin Gal ➡ Link to the paper below. 🔥 To stay up to date with the latest developments in AI policy, compliance & regulation, including excellent research, join 44,400 people who subscribe to my AI newsletter (link below).

388

45,124

Raym Geis MD FSIIM

Raym Geis MD FSIIM @quantrad

31 Dec 2024

This. Finally starting to understand it, I think. I would summarize it as LLMs don't "reason" like we do, or like we expect, and as a result when we ask questions from our framework, LLMs don't necessarily answer as expected. Is this a foundational flaw? arxiv.org/abs/2410.05229

GSM-Symbolic: Understanding the Limitations of Mathematical...

Recent advancements in Large Language Models (LLMs) have sparked interest in their formal reasoning capabilities, particularly in mathematics. The GSM8K benchmark is widely used to assess the...

arxiv.org

Dr Ellie Murray, ScD

Raym Geis MD FSIIM retweeted

Dr Ellie Murray, ScD

@EpiEllie

29 Dec 2024

Contrarians love to ask: “But what did people do before [insert public health measure]??” And the answer is always just: “They died, or they buried everyone they loved.” theconversation.com/infectio…

Infectious diseases killed Victorian children at alarming rates — their novels highlight the...

Between 40% and 50% of children didn’t live past 5 in the US during the 19th century. Popular authors like Charles Dickens documented the common but no less gutting grief of losing a child.

theconversation.com

377

1,398

38,365

Dimitris Papailiopoulos

Raym Geis MD FSIIM retweeted

Dimitris Papailiopoulos

@DimitrisPapail

29 Dec 2024

I've been thinking about in-context learning for nearly 3 years. While there is still plenty I don't fully understand, five papers have--to a very large extent--shaped my perspective on it, and I believe everyone should read them. 1. "What Can Transformers Learn In-Context? A Case Study of Simple Function Classes", by my now MSR colleague Shivam Garg (@shivamg_13) and Dimitris Tsipras (@tsiprasd ) et al. 2. "What learning algorithm is in-context learning? Investigations with linear models" by Ekin Akyürek (@akyurekekin) et al. 3. "Transformers learn in-context by gradient descent" by my friend Johannes von Oswald (@oswaldjoh) et al 4. "MetaICL: Learning to Learn In Context" by Sewon Min (@sewon__min) et al. 5. "Rethinking the Role of Demonstrations: What Makes In-Context Learning Work?" again by Sewon Min et al.

678

93,172

Elias Bareinboim

Raym Geis MD FSIIM retweeted

Elias Bareinboim

@eliasbareinboim

21 Dec 2024

The true generative model is Nature -- a collection of causal mechanisms. Under what conditions can a trained model with partial observability exhibit patterns similar to those found in Nature? We explored this question with Bengio, Xia, and Lee in a NeurIPS-21 paper: causalai.net/r80.pdf. Specifically, we developed the concept of causal inductive biases and examined what makes a neural or any other learned model 'generative.' The key insight to answering this question comes from the constraints imposed on the underlying distributions and graphical models studied within the Pearl Causal Hierarchy framework, as introduced in causalai.net/r60.pdf. (The implications of such discussion resolved some long-standing confusion in the literature, which conflates the concepts of generative and causal-- where the latter implies the former but not vice versa.) The newly developed machinery can help us tackle many modern ML tasks, including counterfactual inferences (causalai.net/r87.pdf), causal abstractions (causalai.net/r101.pdf), counterfactual image editing (causalai.net/r103.pdf), and fair ML (causalai.net/r90.pdf). @kchonyc @tdietterich @yudapearl

Danilo J. Rezende @DaniloJRezende

21 Dec 2024

💯 "synthetic data" only makes sense if the data generating model is a better model of reality than the model being trained. This only happens in very special cases (eg when first-principles simulators are available).

186

30,611

Evidently AI

Raym Geis MD FSIIM retweeted

Evidently AI @EvidentlyAI

23 Dec 2024

✅ An enterprise guide for implementing secure, controlled access to Generative AI models. Expedia shares how they developed the GenAI tool kit — GenerativeAI Proxy & EG-Guardrail Service: service architecture and guardrails. medium.com/expedia-group-tec…

Gateways, Guardrails, and GenAI Models

An enterprise guide for implementing secure, controlled access to Generative AI models.

medium.com

222

Raym Geis MD FSIIM

Raym Geis MD FSIIM @quantrad

15 Dec 2024

Not a surprise. Med students in general don't learn how to 'read' (aka look at) medical imaging exams, and have variable knowledge of pathologies. This is expected, and nromal, for that stage in education.

Radiology @radiology_rsna

12 Dec 2024

Multimodal large language models with textual and visual capabilities outperformed medical students on the NEJM Image Challenge but did not outperform radiology residents or junior faculty, and their accuracy was correlated to text length. bit.ly/4gtyHJa

325

Raym Geis MD FSIIM

Raym Geis MD FSIIM @quantrad

15 Dec 2024

What would you be willing to delegate to a first-month rad res? AI to do those tasks would really help. Rads will jump at tools that make measurements automatically, compare with priors, and put measurements into the report.

Andrej Karpathy

@karpathy

14 Dec 2024

The most bullish AI capability I'm looking for is not whether it's able to solve PhD grade problems. It's whether you'd hire it as a junior intern. Not "solve this theorem" but "get your slack set up, read these onboarding docs, do this task and let's check in next week".

513

Raym Geis MD FSIIM

Raym Geis MD FSIIM @quantrad

5 Dec 2024

Sadly turning off twitter, aka eX-useful platform. Can't in good conscience support a site that empowers harmful lies, many proactively & purposefully destructive to individuals and society. Misinformation is too gentle a phrase for proactive nihilism. Going where the sky's blue

150

Raym Geis MD FSIIM

Raym Geis MD FSIIM @quantrad

5 Dec 2024

The best part of #RSNA is catching up with good friends, who also happen to be super energetic and smart.

Dr. Emil Lee @Emil_LeeMD

1 Dec 2024

I have the joy and privilege to be able to say this every year, this year @ #RSNA24 It’s always, always about the people! @quantrad is such a wise, erudite thought leader - grateful to be a friend and colleague! @RadiologyACR @RSNA #AI #IamYourRadiologist

429