Benjamin Feuer

Benjamin Feuer

19 Photos and videos

Tweets

Benjamin Feuer @FeuerBenjamin

6 Dec 2025

So excited to be the SFT lead of this massive collaboration! The OpenThoughts team is 🔥

Negin Raoof

@NeginRaoof_

6 Dec 2025

How can we make a better TerminalBench agent? Today, we are announcing the OpenThoughts-Agent project. OpenThoughts-Agent v1 is the first TerminalBench agent trained on fully open curated SFT and RL environments. OpenThinker-Agent-v1 is the strongest model of its size on TerminalBench, and sets a new bar on our newly released OpenThoughts-TB-Dev benchmark. (1/n)

1,163

Thao Nguyen

Benjamin Feuer retweeted

Thao Nguyen @thao_nguyen26

17 Jul 2025

If you are attending #ICML2025, check out our DataWorld workshop on Sat July 19. We have updated the website with more info on speakers & accepted papers! dataworldicml2025.github.io/ Also happy to chat offline about all things ✨ data ✨

10,960

Benjamin Feuer

Benjamin Feuer @FeuerBenjamin

3 Jul 2025

New research paper for you to read over your July 4th break (if you're US-based) -- Vision is a skeleton key! 🗝️ We convert a small VLM into an "everything classifier" by transforming data into visualizations that VLMs can naturally understand and reason about. We call it MARVIS: Modality Adaptive Reasoning over VISualizations. Our MARVIS-3B model: - Beats Gemini by 16% on average across 100s of vision and tabular tasks 🏆 - Gets within 2.5% of the best specialized model across across 4 modalities ... 🎯 - Using just one 3B model ... 💪 - ... without exposing any P.I.I. (personally identifiable information) to the VLM ... 🔐 - And without requiring any model training! ⚡ Our GitHub: lnkd.in/eqPkzU2m 💻 Our Paper: lnkd.in/esXZEvEE 📄 Research Supported By: oumi.ai Thanks to @LennartPurucker @Oussama_e

566

Benjamin Feuer

Benjamin Feuer @FeuerBenjamin

18 Jun 2025

So excited to announce the DCVLR (Data Curation for Vision-Language Reasoning) competition at NeurIPS 2025, led by @Oumi_PBC and sponsored by @LambdaAPI! 🌟open-data 🌟 🤖 open-models 🤖 💻 open-source 💻 💪anyone can compete for free 💪 dcvlr-neurips.github.io/ 🧵 1 / n

DCVLR: NeurIPS 2025 Competition

Join the DCVLR NeurIPS 2025 Competition. Advance visual reasoning in VLMs through data curation.

dcvlr-neurips.github.io

10,829

more replies

Benjamin Feuer

Benjamin Feuer @FeuerBenjamin

18 Jun 2025

Thanks to Rohun Tripathi, Oussama Elachqar, @Zhang_Yu_hui , @NimrodShabtay , @NHulkund , Stefan Webb, @thao_nguyen26 , Vishaal Udandarao, @XiaohanWang96 , @lschmidt3 , @sainingxie , Serena Yeung-Levy, Paul Pu Liang, @sarameghanbeery , @georgiagkioxari , Manos Koukoumidis !

649

Benjamin Feuer

Benjamin Feuer @FeuerBenjamin

18 Jun 2025

And also our supporters @natolambert , Thomas Bordes, George Will

218

Ryan Marten

Benjamin Feuer retweeted

Ryan Marten

@ryanmart3n

5 Jun 2025

Announcing OpenThinker3-7B, the new SOTA open-data 7B reasoning model: improving over DeepSeek-R1-Distill-Qwen-7B by 33% on average over code, science, and math evals. We also release our dataset, OpenThoughts3-1.2M, which is the best open reasoning dataset across all data scales. Full details are in our ✨new paper✨ - below we share the highlights: BTW, it also works on non-Qwen models😉 (1/N)

192

923

200,643

Neha Hulkund

Benjamin Feuer retweeted

Neha Hulkund @NHulkund

21 May 2025

📣We are extending our deadline to May 31st!📣 Looking forward to seeing everyone's submissions :)

Thao Nguyen @thao_nguyen26

1 May 2025

📢 Announcing our data-centric workshop at ICML 2025 on unifying data curation frameworks across domains! 📅 Deadline: May 24, AoE 🔗 Website: dataworldicml2025.github.io/ We have an amazing lineup of speakers panelists from various institutions and application areas.

828

Mike A. Merrill

Benjamin Feuer retweeted

Mike A. Merrill

@Mike_A_Merrill

19 May 2025

Many agents (Claude Code, Codex CLI) interact with the terminal to do valuable tasks, but do they currently work well enough to deploy en masse? We’re excited to introduce Terminal-Bench: An evaluation environment and benchmark for AI agents on real-world terminal tasks. Tl;dr lots of room for improvement! tbench.ai/

243

52,223

Thao Nguyen

Benjamin Feuer retweeted

Thao Nguyen @thao_nguyen26

1 May 2025

133

25,907

Manos Koukoumidis

Benjamin Feuer retweeted

Manos Koukoumidis

@Koukoumidis

29 Jan 2025

If AI isn’t truly open, it will fail us. We can’t close in a black box our greatest invention yet just so that a few can freely monetize. AI needs its Linux moment, and so we started working towards it. This can only succeed if we all work together! #oumi #opensource #collaboration @rsalakhu @svlevine @larry_heck @karpathy @atalwalkar @prfsanjeevarora @tsvetshop @hhexiy @sainingxie @larry_heck @AnimaAnandkumar @JunjieHu12 @georgiagkioxari @profjoeyg @pliang279 @danqi_chen @ChrisGPotts @BillMacCartney @vinodv @tur_gokhan @dilekhakkanitur @j_foerst @gingsmith23 @kahinish @jamesjoaquin @gan3sh @ethanjb @kirbywinfield @egonzdp

1:00

22,369

Rohan Paul

Benjamin Feuer retweeted

Rohan Paul

@rohanpaul_ai

4 Feb 2025

Oumi:build state-of-the-art foundation models, end-to-end. Oumi is a fully open-source platform designed to train, evaluate, and deploy foundation models end-to-end. It supports models from 10M to 405B parameters, enabling fine-tuning using LoRA, QLoRA, DPO, and other techniques. It integrates with popular inference engines (vLLM, SGLang) and works across laptops, clusters, and cloud platforms (AWS, Azure, GCP, Lambda, etc.). Supports multimodal models like Llama, DeepSeek, and Phi. ⚙️ Key Benefits → Oumi simplifies model training with a unified API, allowing seamless model fine-tuning, data synthesis, and evaluation. Supports both open-source and commercial APIs like OpenAI, Anthropic, and Vertex AI, making it highly flexible. → Enables fast inference with optimized engines such as vLLM and SGLang, ensuring efficient deployment. Installation is straightforward with pip install oumi, supporting both CPU and GPU setups. → Features a CLI tool (oumi train, oumi evaluate, oumi infer) for easy model training, evaluation, and inference. → Supports cloud-based training with direct job execution on AWS, Azure, GCP, and Lambda. → Includes prebuilt ready-to-use training recipes for LLM fine-tuning, distillation, evaluation, and inference. → 100% open-source under Apache 2.0 license, with an active community on Discord and GitHub.

3,214

elvis

Benjamin Feuer retweeted

elvis

@omarsar0

3 Feb 2025

Oumi is a fully open-source platform to help you build state-of-the-art foundation models, end-to-end.

169

808

58,019

Benjamin Feuer

Benjamin Feuer @FeuerBenjamin

20 Nov 2024

After careful consideration, I have decided to leave X for BlueSky. I hope to see many of you there with me very soon! @benjaminfeuer.bsky.social

343

Agronomy, Crop, and Soil Science Societies

Benjamin Feuer retweeted

Agronomy, Crop, and Soil Science Societies @ASA_CSSA_SSSA

30 Oct 2024

🐞 Check out this zero-shot AI model dataset with 6M images of important species that is vital to farming and environmental research! Learn more: ow.ly/Kzig50TRSaC #ZeroShotLearning #AIModels #AgriculturalTech #EnvironmentalResearch @chegday @FeuerBenjamin @AII4RA

4,669