Joined February 2022
584 Photos and videos
Turing Community retweeted
The energy at ICLR in Rio was incredible! From researchers pushing the boundaries of AI to conversations about what's coming next, every interaction reminded us why this community matters. Next stop: @ICMLConf in Seoul.🇰🇷 We're excited to keep the conversation going. See you there!
6
10
35,204
Turing Community retweeted
Our CEO @jonsid recently spoke with @politico about what's actually driving progress in AI, and what most people get wrong about synthetic data. Short version: real-world deployment is where models get better. Human-AI hybrid data pipelines beat pure synthetic. And we need to rethink education before superintelligence arrives. Full interview below.
1
8
14
17,199
Turing Community retweeted
Hot Take: the best #CVPR conversations happen off the conference floor. Turing is hosting a Happy Hour in Denver for researchers and enterprise AI leaders. Drinks, hors d'oeuvres, and real talk on LLMs and the future of AI. DM us for details. Spots are limited.
3
5
12
439
Turing Community retweeted
MMLU is saturated. HLE is getting there. We built Multimodal STEM HLE : for what comes next, and the top frontier labs publishing SOTA models are already using it. 1,100 PhD-level multimodal STEM problems that break Opus 4.6. Around 20% pass@1 on SOTA. Hard enough to expose reasoning failures. Solvable enough to generate real RL signal. Every problem requires joint reasoning over images and text, has a deterministic ground-truth answer, and was authored by a PhD-level domain specialist. 50-task public sample on @HuggingFace. Full pack available now. Links below.
4
10
19
34,100
Turing Community retweeted
The models are already extraordinary. That's not the hard part anymore. The hard part is letting them touch reality. Real workflows. Real data. Real stakes. The next decade belongs to whoever solves deployment, not whoever builds the best benchmark score. I've been making that bet for seven years. I'm more convinced than ever. Link below.
4
7
22
4,890
Turing Community retweeted
Who's actually building AI? 3 months and 14 episodes into This Week in AI, @Jason has sat down with founders and operators across infra, models, dev tools, consumer, creative, robotics, healthcare, and more. INFRA & COMPUTE Chase Lochmiller (Crusoe) @ChaseLochmiller Lin Qiao (Fireworks AI) @lqiao Chris Lattner (Modular) @clattner_llvm Nick Harris (Lightmatter) @theanalognick Mitesh Agrawal (Positron AI) @mitesh711 Alex Cheema (EXO Labs) @alexocheema Philip Johnston (Starcloud) @PhilipJohnston Naveen Rao (Unconventional AI) @NaveenGRao Russ d'Sa (LiveKit) @dsa FOUNDATION MODELS & RESEARCH Kanjun Qiu (Imbue) @kanjun Carina Hong (Axiom Math) @CarinaLHong Jeremy Fraenkel (Fundamental) @fraenkelj EVALS & BENCHMARKS Anastasios Angelopoulos (Arena) @ml_angelopoulos DEV TOOLS, CODING & AUTOMATION Karri Saarinen (Linear) @karrisaarinen Matan Grinberg (Factory) @matanSF Spiros Xanthos (Resolve AI) @spirosx Wade Foster (Zapier) @wadefoster CONSUMER & SEARCH Aravind Srinivas (Perplexity) @AravSrinivas Richard Socher (youdotcom & Recursive) @RichardSocher Tanay Kothari (Wispr Flow) @tankots Steven Berlin Johnson (NotebookLM) @stevenbjohnson CREATIVE & MEDIA Demi Guo (Pika) @demi_guo_ Victor Riparbelli (Synthesia) @vriparbelli Mikey Shulman (Suno) @MikeyShulman Grant Lee (Gamma) @thisisgrantlee ROBOTICS Jake Loosararian (Gecko Robotics) @jakeloosy Boris Sofman (Bedrock Robotics) @bsofman HEALTHCARE Shiv Rao (Abridge) @ShivdevRao Trey Holterman (Tennr) @TreyHolterman ENTERPRISE, VERTICAL & DATA George Sivulka (Hebbia) @gsivulka Kashif Ali (TaxGPT) @ChKashifAli Alex Elias (Qloo) @ape TALENT & WORKFORCE Ali Ansari (micro1) @aliansarinik Jonathan Siddharth (Turing) @jonsid Thank you all for joining! Episode 14 out now: youtube.com/watch?v=szd0TYQq…
4
13
37
63,089
Turing Community retweeted
Excited to share that @Turingcom Co-founder @krishnanvijay will be joining an industry panel with: @ravisujith (GVP, @OracleAI), @Kenneth_Marino (@UUtah), and @MingHsuanYang (@ucmerced @GoogleDeepMind) at #CVPR2026 The morning will be dedicated to solving the hardest parts of Agentic Systems and bridging the gap between Computer Vision, NLP and Informational Retrieval.
#CVPR2026 is just around the corner! If you are heading to Denver, join us at GRAIL-V @CVPR on 3rd June, for a morning dedicated to solving the hard parts of Agentic Systems - bridging the gap between Computer Vision, NLP, and Information Retrieval We have a packed schedule featuring foundational keynotes and a deep-dive industry panel focused on moving from frontier research to production-scale agents for @CVPRConf 📅 Date: Wednesday, June 3, 2026 ⏰ Time: 7:30 AM – 12:30 PM 📍 Location: Room 506, Colorado Convention Center, Denver 🔗 Details & Full Schedule: lnkd.in/g4JaU5x6 🔥 Keynotes from: 🌟 Kristen Grauman (@UTAustin) 🌟@mohitban47 (@unc_ai_group) 🌟@DanRothNLP (@_PennAI ) 🌟@scottyih (@Meta @AIatMeta ) 🎤 Industry Panel: @ravisujith(GVP, @Oracle AI) , @krishnanvijay (@turingcom ), @Kenneth_Marino (@UUtah ), @MingHsuanYang (@ucmerced @GoogleDeepMind ) Organizing Team - @amitpinaki @sarahookr @aliceoh @jyotika @Hitesh_LPatel @keviv9 , @karandua Vivek Srikumar, Tao Sheng #CVPR2026 #AI #ComputerVision #LLM #Agents #Multimodal #Research #MachineLearning #ICLR2026 #ICML2026 #VisionLanguage
5
12
543
Turing Community retweeted
Last week we released the Open MM-RL Dataset. A PhD-level multimodal STEM benchmark built for verifiable reasoning across physics, chemistry, biology, and math. Four STEM domains, one dataset -Physics: Quantum and Particle Physics, Condensed Matter and Materials, Electromagnetism, Photonics, and Plasma Systems, Astrophysics and Space Physics -Mathematics: Algebra and Structure, Discrete Mathematics, Analysis and Continuous Mathematics, Probability and Geometry -Biology: Evolutionary Systems, Molecular Mechanisms, Cellular Processes and Neural Biology -Chemistry: Chemical Structure, Reaction Mechanisms, Synthesis, Spectroscopy and Properties The bar is raised. Download below.
1
4
10
711
Turing Community retweeted
Now trending at #1 on @huggingface
Introducing the Open MM-RL Dataset. A PhD-level multimodal STEM benchmark built for verifiable reasoning across physics, chemistry, biology, and math. Four STEM domains, one dataset -Physics: Quantum and Particle Physics, Condensed Matter and Materials, Electromagnetism, Photonics, and Plasma Systems, Astrophysics and Space Physics -Mathematics: Algebra and Structure, Discrete Mathematics, Analysis and Continuous Mathematics, Probability and Geometry -Biology: Evolutionary Systems, Molecular Mechanisms, Cellular Processes and Neural Biology -Chemistry: Chemical Structure, Reaction Mechanisms, Synthesis, Spectroscopy and Properties We're raising the bar.
4
8
31
32,189
Turing Community retweeted
Open MM-RL Dataset is trending on @huggingface. We built something I've wanted for a long time. - PhD-level STEM reasoning across physics, math, biology & chemistry - 100% verifiable, auto-gradable answers - Single-image, multi-panel & multi-image formats - Two-round expert review on every problem - RL-ready reward structure out of the box Most multimodal dataset test perception. This one tests reasoning. The kind that doesn't break under scrutiny. Built by PhD SMEs. Validated for frontier models. Open to the community. Website & Dataset below.
3
11
37
5,797
Turing Community retweeted
Most browser agent benchmarks are already solved. We built ones that aren't. 500 tasks. 100 templates. 50% model-breaking difficulty at delivery. Full case study → Below.
3
6
11
2,680
Turing Community retweeted
Open-MM-RL is trending at #3 on @huggingface! This is a strong signal that the community wants harder, cleaner datasets for frontier model evaluation, training and a sign that the community is actively looking for datasets that make multimodal evaluation more rigorous. Take a look, tell us what you think, below.
Introducing the Open MM-RL Dataset. A PhD-level multimodal STEM benchmark built for verifiable reasoning across physics, chemistry, biology, and math. Four STEM domains, one dataset -Physics: Quantum and Particle Physics, Condensed Matter and Materials, Electromagnetism, Photonics, and Plasma Systems, Astrophysics and Space Physics -Mathematics: Algebra and Structure, Discrete Mathematics, Analysis and Continuous Mathematics, Probability and Geometry -Biology: Evolutionary Systems, Molecular Mechanisms, Cellular Processes and Neural Biology -Chemistry: Chemical Structure, Reaction Mechanisms, Synthesis, Spectroscopy and Properties We're raising the bar.
1
8
16
1,878
Turing Community retweeted
Introducing the Open MM-RL Dataset. A PhD-level multimodal STEM benchmark built for verifiable reasoning across physics, chemistry, biology, and math. Four STEM domains, one dataset -Physics: Quantum and Particle Physics, Condensed Matter and Materials, Electromagnetism, Photonics, and Plasma Systems, Astrophysics and Space Physics -Mathematics: Algebra and Structure, Discrete Mathematics, Analysis and Continuous Mathematics, Probability and Geometry -Biology: Evolutionary Systems, Molecular Mechanisms, Cellular Processes and Neural Biology -Chemistry: Chemical Structure, Reaction Mechanisms, Synthesis, Spectroscopy and Properties We're raising the bar.
2
11
28
65,860