sunil kumar

sunil kumar

6 Photos and videos

Tweets

Bowen Zhao retweeted

sunil kumar

@__sunil_kumar_

6 Aug 2025

Excited to share some new work: we show how to efficiently train small vision-language models to use a zoom tool with GRPO. We identify key points for doing small-scale RL for tool use in VLMs generally, and share what works (and what doesn’t) under significant resource constraints. arXiv link below ⬇️

402

31,322

sunil kumar

Bowen Zhao retweeted

sunil kumar

@__sunil_kumar_

20 May 2025

We've open-sourced a MCP that allows big models to use huggingface computer vision models as tools. This allows Claude to act as a "visual agent", using other task specific models to help it solve problems. Below, is an example of Claude using an open vocab object detector to zoom in on small details to solve a hard problem that it could not solve natively. Additionally, we've written a blog post discussing why outsourcing vision capabilities from large models is something you should consider. MCP Repo: github.com/groundlight/mcp-v… Blog post: groundlight.ai/blog/vision-a…

2,070

sunil kumar

Bowen Zhao retweeted

sunil kumar

@__sunil_kumar_

14 Mar 2025

We just released an open-source framework that makes it easy to build visual reasoning agents (with GRPO). github.com/groundlight/r1_vl…

0:14

121

965

108,527

Bowen Zhao

Bowen Zhao

@BowenROIM

12 Mar 2025

It’s my pleasure making some minor contribution to this great project! This attention rollout video is my favorite “visualization of the year” so far. Try out our demo if you are interested and stay tuned for our future research!

Leo Dirac @leopd

10 Mar 2025

We trained a Visual LLM to reason using GRPO, and open sourced the code. Tiny 3B model beats all the big players (GPT, Claude, etc 0-shot) after RL training on this cryptogram task. Live demo and links: groundlight.ai/blog/visual-r… Vision foundation models have kinda stagnated recently, but now that we have shown how to incorporate reason, I think we'll be able to make progress again.

0:14

Groundlight

Bowen Zhao retweeted

Groundlight @GroundlightAI

13 Aug 2024

Just dropped ⚡ - our latest youtube tutorial, running Groundlight's computer vision on a Raspberry Pi! Watch it here: youtube.com/watch?v=YpNKHjuZ… Tim Huff, our Robotics Engineer, walks you step-by-step how to create an application to get notified if someone steals your parking spot. Unlike traditional computer vision: ☑ No need for a dataset ☑ Works on day one ☑ Adapts to your real environment

444

Bowen Zhao

Bowen Zhao

@BowenROIM

22 Jul 2024

Replying to @awk_ai

@awk_ai @HannaHajishirzi Can’t run billion-level LLMs efficiently? Take a look at our work: APT. We are excited to share our #ICML2024 oral paper, “APT: Adaptive Pruning and Tuning Pretrained Language Models for Efficient Training and Inference”. Paper: shorturl.at/xabJl

1,194

more replies

Bowen Zhao

Bowen Zhao

@BowenROIM

22 Jul 2024

Experiments show that APT maintains 98% task performance when pruning 60% of the parameters in small models, and it preserves 86.4% of LLaMA's performance with 70% density. Furthermore, APT speeds up LMs’ finetuning by up to 8x and reduces LLMs’ training memory cost by up to 70%.

Bowen Zhao

Bowen Zhao

@BowenROIM

22 Jul 2024

Our paper has more interesting results, go check it out.

Groundlight

Bowen Zhao retweeted

Groundlight @GroundlightAI

20 Jun 2024

#CVPR2024 down the street gets our science team out of the office! 10,000 computer vision researchers in Seattle figuring out what's next in the field from multimodal foundation models to the most creative applications.

954

Groundlight

Bowen Zhao retweeted

Groundlight @GroundlightAI

18 Jun 2024

🚀 Exciting News from Groundlight AI! 🚀 Introducing, the Groundlight Hub, your easy access to computer vision. youtu.be/u3xPnOnvTcE

725

Groundlight

Bowen Zhao retweeted

Groundlight @GroundlightAI

29 May 2024

Our Robotics Engineer, Tim Huff, gives a demo of Groundlight AI's Visual Inspections 🔍 and Anomaly Detection solution, compatible with @Universal_Robot 🤖 cobots:

2:16

1,123

Node AI | $GPU

Bowen Zhao retweeted

Node AI | $GPU @NodeAIETH

6 May 2024

Importance of Demand Generation ✍️ 🌟 The number of GPUs we own and the robustness of our L1 integration are crucial, but they're not the sole determinants of our long-term success. It's all about creating demand for our product. 🥅 Our ultimate goal is to stimulate demand for our GPU nodes through diverse integrations and utilities. We're committed to providing solutions that meet the needs of our users and drive adoption. 📈 In just the last 3 days, we've seen a significant uptick, accumulating nearly 400 additional rental hours for our $GPUs. This demonstrates the growing interest and demand for our platform. 🥇 Our primary focus remains clear: to establish #NodeAI as the premier infrastructure provider for high-performance AI. Demand generation is key, and we're dedicated to making it happen. #AI #DEPIN #GPU

947

1,463

222,689

Bowen Zhao

Bowen Zhao

@BowenROIM

1 Mar 2024

Super excited to share this paper! Temporal chaos in pretraining corpora greatly impacts language model’s capabilities in answering time-sensitive questions. We will publish our codes for dataset generation and temporal alignment later. Stay tuned!

Yizhong Wang @yizhongwyz

1 Mar 2024

When you use ChatGPT, do you notice that it has a data cutoff date? 🗓️ But as models are pretrained on web text originating from many historical periods, do they have a sense that they should use their latest knowledge to answer questions rather than historical info? Excited to introduce our new work on *Temporal Alignment*! We show that pretrained LMs encode a chaotic sense of time, but it’s possible to align them to a recent time, or some time in history! 🕐🕟🕘 📜 shorturl.at/cfrR8

228

Yizhong Wang

Bowen Zhao retweeted

Yizhong Wang @yizhongwyz

1 Mar 2024

This is a joint work co-led with my awesome mentees @BowenROIM and @zanbrum. Please reach out to them if you have job or collaboration opportunities for them! Finally, always many thanks to our fantastic advisors @HannaHajishirzi and @nlpnoah ❤️❤️❤️

5,744

Jiacheng Liu

Bowen Zhao retweeted

Jiacheng Liu @liujc1998

27 Sep 2023

What if we combine PPO with Monte-Carlo Tree Search – the secret sauce for AlphaGo to reach superhuman performance? Spoiler: MAGIC!! Our inference-time decoding method, PPO-MCTS, achieves impressive results across many text generation tasks. 📜 arxiv.org/abs/2309.15028 🧵(1/n)

0:22

448

73,987

Jiuding Sun

Bowen Zhao retweeted

Jiuding Sun @SunJiuding

21 Jun 2023

How robust are the instructions in your instruction-tuned model? In our most recent work (w/ @ChantalShaib and @byron_c_wallace), we show that there is a considerable dip in performance on in-domain tasks when you slightly vary the instruction. arxiv.org/abs/2306.11270

11,273

#ELJARDINERO

Bowen Zhao retweeted

#ELJARDINERO @J4FN1_

17 Jun 2020

Perisic and Icardi winning league titles and still in the UCL while Conte & Inter........