A new cover for SUPER AGERS after making the NYT bestseller list. Thanks to you for making it the #1 ranked new non-fiction book on Amazon.
amazon.com/gp/new-releases/b…
For medical information, general AI frontier models (Google, OpenAI, Anthropic) outperformed specialized @EvidenceOpen and @UpToDate as assessed by 12 US clinicians, randomized and blinded to which model and extensive testing/benchmarks. This was not anticipated. @NatureMedicinenature.com/articles/s41591-0…
Here is the performance breakdown for each model's blinded assessment for 4 major tasks: (1) clinical correctness, (2) completeness, (3) safety, and (4) clarity.
The overall ranking.
Congratulations to @ekoermann@krithikvish and their team @nyulangone for getting this done. We need more of these rigorous assessments.
Getting to the root of age-related diseases. By studying a rare accelerated aging genetic disorder, gain-of-function mutations of DNMT3A were found to be causal.
DNA hyper-methylation was then linked to stem cells dysfunction and multiple age-related diseases (blood, bone, metabolic). Work in mice and humans. @NatureGenetnature.com/articles/s41588-0…
Interleukin-17 (IL-17 and its receptor, IL7R) is emerging as a key mediator in many immune-related diseases. Today @SciImmunology a superb review
science.org/doi/10.1126/scii…
There has been a push to use OpenEvidence AI for doctors. But this paper suggests general models are much better: “Frontier LLMs outperformed clinical AI tools in all three evaluations. Clinical AI tools performed comparably to auto-enabled Google Search AI Overview on the RCQ.”
For medical information, general AI frontier models (Google, OpenAI, Anthropic) outperformed specialized @EvidenceOpen and @UpToDate as assessed by 12 US clinicians, randomized and blinded to which model and extensive testing/benchmarks. This was not anticipated. @NatureMedicinenature.com/articles/s41591-0…
Medicine discovers the bitter lesson: frontier LLMs (here GPT 5.2, Opus 4.6, Gemini 3.1) outperform specialized "clinical AI" (e.g. OpenEvidence) in a blind test.
Even funnier that hospital IT are more likely to approve the *specialized* versions despite them being worse.
For medical information, general AI frontier models (Google, OpenAI, Anthropic) outperformed specialized @EvidenceOpen and @UpToDate as assessed by 12 US clinicians, randomized and blinded to which model and extensive testing/benchmarks. This was not anticipated. @NatureMedicinenature.com/articles/s41591-0…
Workforce survival in healthcare
"The coming decade demands that we stop asking whether AI can replace clinicians and start asking how it can help us keep them."
thelancet.com/journals/lance…
Not every day you see an odds ratio of 50 (for interleukin-10 autoantibodies and a common HLA allele).
~80% of patients with inflammatory bowel disease (IBD) have this HLA allele
These individuals (~3.5% of IBD) may benefit from B cell depletion (such as achieved via CAR T).
nejm.org/doi/full/10.1056/NE…nejm.org/doi/full/10.1056/NE…
The cerebellum has long been considered spared from being tied to cognitive resilience and Alzheimer's disease. That turned out to be wrong @NatureNeuronature.com/articles/s41593-0…