PhD Candidate in Computer Science, NYU, Deep Learning

Joined May 2022
19 Photos and videos
So excited to be the SFT lead of this massive collaboration! The OpenThoughts team is πŸ”₯
How can we make a better TerminalBench agent? Today, we are announcing the OpenThoughts-Agent project. OpenThoughts-Agent v1 is the first TerminalBench agent trained on fully open curated SFT and RL environments. OpenThinker-Agent-v1 is the strongest model of its size on TerminalBench, and sets a new bar on our newly released OpenThoughts-TB-Dev benchmark. (1/n)
3
1
14
1,163
Benjamin Feuer retweeted
If you are attending #ICML2025, check out our DataWorld workshop on Sat July 19. We have updated the website with more info on speakers & accepted papers! dataworldicml2025.github.io/ Also happy to chat offline about all things ✨ data ✨
24
81
10,960
New research paper for you to read over your July 4th break (if you're US-based) -- Vision is a skeleton key! πŸ—οΈ We convert a small VLM into an "everything classifier" by transforming data into visualizations that VLMs can naturally understand and reason about. We call it MARVIS: Modality Adaptive Reasoning over VISualizations. Our MARVIS-3B model: - Beats Gemini by 16% on average across 100s of vision and tabular tasks πŸ† - Gets within 2.5% of the best specialized model across across 4 modalities ... 🎯 - Using just one 3B model ... πŸ’ͺ - ... without exposing any P.I.I. (personally identifiable information) to the VLM ... πŸ” - And without requiring any model training! ⚑ Our GitHub: lnkd.in/eqPkzU2m πŸ’» Our Paper: lnkd.in/esXZEvEE πŸ“„ Research Supported By: oumi.ai Thanks to @LennartPurucker @Oussama_e
2
4
566
So excited to announce the DCVLR (Data Curation for Vision-Language Reasoning) competition at NeurIPS 2025, led by @Oumi_PBC and sponsored by @LambdaAPI! 🌟open-data 🌟 πŸ€– open-models πŸ€– πŸ’» open-source πŸ’» πŸ’ͺanyone can compete for free πŸ’ͺ dcvlr-neurips.github.io/ 🧡 1 / n
1
13
43
10,829
Thanks to Rohun Tripathi, Oussama Elachqar, @Zhang_Yu_hui , @NimrodShabtay , @NHulkund , Stefan Webb, @thao_nguyen26 , Vishaal Udandarao, @XiaohanWang96 , @lschmidt3 , @sainingxie , Serena Yeung-Levy, Paul Pu Liang, @sarameghanbeery , @georgiagkioxari , Manos Koukoumidis !
1
1
5
649
And also our supporters @natolambert , Thomas Bordes, George Will
4
218
Benjamin Feuer retweeted
Announcing OpenThinker3-7B, the new SOTA open-data 7B reasoning model: improving over DeepSeek-R1-Distill-Qwen-7B by 33% on average over code, science, and math evals. We also release our dataset, OpenThoughts3-1.2M, which is the best open reasoning dataset across all data scales. Full details are in our ✨new paper✨ - below we share the highlights: BTW, it also works on non-Qwen modelsπŸ˜‰ (1/N)
34
192
923
200,643
Benjamin Feuer retweeted
21 May 2025
πŸ“£We are extending our deadline to May 31st!πŸ“£ Looking forward to seeing everyone's submissions :)
πŸ“’ Announcing our data-centric workshop at ICML 2025 on unifying data curation frameworks across domains! πŸ“… Deadline: May 24, AoE πŸ”— Website: dataworldicml2025.github.io/ We have an amazing lineup of speakers panelists from various institutions and application areas.
4
7
828
Benjamin Feuer retweeted
Many agents (Claude Code, Codex CLI) interact with the terminal to do valuable tasks, but do they currently work well enough to deploy en masse? We’re excited to introduce Terminal-Bench: An evaluation environment and benchmark for AI agents on real-world terminal tasks. Tl;dr lots of room for improvement! tbench.ai/
16
63
243
52,223
Benjamin Feuer retweeted
πŸ“’ Announcing our data-centric workshop at ICML 2025 on unifying data curation frameworks across domains! πŸ“… Deadline: May 24, AoE πŸ”— Website: dataworldicml2025.github.io/ We have an amazing lineup of speakers panelists from various institutions and application areas.
2
21
133
25,907
Benjamin Feuer retweeted
If AI isn’t truly open, it will fail us. We can’t close in a black box our greatest invention yet just so that a few can freely monetize. AI needs its Linux moment, and so we started working towards it. This can only succeed if we all work together! #oumi #opensource #collaboration @rsalakhu @svlevine @larry_heck @karpathy @atalwalkar @prfsanjeevarora @tsvetshop @hhexiy @sainingxie @larry_heck @AnimaAnandkumar @JunjieHu12 @georgiagkioxari @profjoeyg @pliang279 @danqi_chen @ChrisGPotts @BillMacCartney @vinodv @tur_gokhan @dilekhakkanitur @j_foerst @gingsmith23 @kahinish @jamesjoaquin @gan3sh @ethanjb @kirbywinfield @egonzdp
7
33
83
22,369
Benjamin Feuer retweeted
Oumi:build state-of-the-art foundation models, end-to-end. Oumi is a fully open-source platform designed to train, evaluate, and deploy foundation models end-to-end. It supports models from 10M to 405B parameters, enabling fine-tuning using LoRA, QLoRA, DPO, and other techniques. It integrates with popular inference engines (vLLM, SGLang) and works across laptops, clusters, and cloud platforms (AWS, Azure, GCP, Lambda, etc.). Supports multimodal models like Llama, DeepSeek, and Phi. βš™οΈ Key Benefits β†’ Oumi simplifies model training with a unified API, allowing seamless model fine-tuning, data synthesis, and evaluation. Supports both open-source and commercial APIs like OpenAI, Anthropic, and Vertex AI, making it highly flexible. β†’ Enables fast inference with optimized engines such as vLLM and SGLang, ensuring efficient deployment. Installation is straightforward with pip install oumi, supporting both CPU and GPU setups. β†’ Features a CLI tool (oumi train, oumi evaluate, oumi infer) for easy model training, evaluation, and inference. β†’ Supports cloud-based training with direct job execution on AWS, Azure, GCP, and Lambda. β†’ Includes prebuilt ready-to-use training recipes for LLM fine-tuning, distillation, evaluation, and inference. β†’ 100% open-source under Apache 2.0 license, with an active community on Discord and GitHub.
3
12
41
3,214
Benjamin Feuer retweeted
3 Feb 2025
Oumi is a fully open-source platform to help you build state-of-the-art foundation models, end-to-end.
15
169
808
58,019
After careful consideration, I have decided to leave X for BlueSky. I hope to see many of you there with me very soon! @benjaminfeuer.bsky.social
9
343
Benjamin Feuer retweeted
🐞 Check out this zero-shot AI model dataset with 6M images of important species that is vital to farming and environmental research! Learn more: ow.ly/Kzig50TRSaC #ZeroShotLearning #AIModels #AgriculturalTech #EnvironmentalResearch @chegday @FeuerBenjamin @AII4RA
3
5
4,669