Research Intern @AdobeResearch, CS Ph.D. student @UWMadison | prev. intern: @AdobeResearch; @Krafton_AI

Joined October 2023
9 Photos and videos
Pinned Tweet
We all knew LLM agents struggle to explore, but we had to eyeball it ๐Ÿ‘€. We couldn't measure exploration errors. Until now. ๐Ÿ—บ๏ธ๐Ÿค– We built a policy-agnostic metric to quantify exploration and exploitation errors in LLM agents. Spoiler: Exploration error is what kills๐Ÿ“‰ agent performance in our setting ๐Ÿ‘‡๐Ÿงต(1/8)
1
17
31
2,145
Jaden Park retweeted
The reversal curse. Edits that don't suppress negations. Multi-hop updates that don't propagate. These look like separate bugs. Our ICML 2026 spotlight argues they may share a common geometric origin, visible only when you study how representations move under updates ๐Ÿงต (1/11)
3
18
81
9,080
Jaden Park retweeted
Almost all "flagship" models are now MoEs. But smaller models still prefer to be dense as they target memory-constrained scenarios where total params matter. So we ask: Can we leverage an MoE to produce dense models without having to train them from scratch? ๐Ÿงต๐Ÿ‘‡
1
7
50
2,942
Lots of good news this week! ๐Ÿš€ 1. My internship project from @AdobeResearch has been accepted to #SIGGRAPH2026! ("MAOAM: Unified Object & Material Selection with Vision-Language Models") Special thanks to my wonderful mentor @michi_fischer who has made this project possible! 2. Paper accepted to #ICML2026! ("DocHop: Benchmarking Out-of-domain Multi-hop Reasoning in Information-Dense Documents") 3. Paper accepted (with minor revisions) at #DMLR! ("Decomposing Complex Visual Comprehension into Atomic Visual Skills for Vision Language Models") In both papers, we generate carefully designed benchmarks to tackle compositional/multi-hop reasoning in VLMs. Proud to have contributed in these projects. More detailed posts soon :) Stay tuned!
1
3
23
684
Jaden Park retweeted
We all knew LLM agents struggle to explore, but we had to eyeball it ๐Ÿ‘€. We couldn't measure exploration errors. Until now. ๐Ÿ—บ๏ธ๐Ÿค– We built a policy-agnostic metric to quantify exploration and exploitation errors in LLM agents. Spoiler: Exploration error is what kills๐Ÿ“‰ agent performance in our setting ๐Ÿ‘‡๐Ÿงต(1/8)
1
17
31
2,145
I will be at #ICLR2026 to present my work on data contamination in VLMs! (Fri, Apr 24, 2026 โ€ข 8:30 AM โ€“ 11:00 AM, Pavilion 3 P3-917) I am currently interested in VLA/physical AI, agents and robustness/generalization. Would love to chat and connect with anyone with similar interests :)
7 Nov 2025
Me: memorize past exams ๐Ÿ“š๐Ÿ’ฏ Also me: fail on a slight tweak ๐Ÿคฆโ€โ™‚๏ธ๐Ÿคฆโ€โ™‚๏ธ Turns out, we can use the same method to ๐—ฑ๐—ฒ๐˜๐—ฒ๐—ฐ๐˜ ๐—ฐ๐—ผ๐—ป๐˜๐—ฎ๐—บ๐—ถ๐—ป๐—ฎ๐˜๐—ฒ๐—ฑ ๐—ฉ๐—Ÿ๐— ๐˜€! ๐Ÿงต(1/10) - Project Page: mm-semantic-perturbation.gitโ€ฆ
2
18
958
We all knew LLM agents struggle to explore, but we had to eyeball it ๐Ÿ‘€. We couldn't measure exploration errors. Until now. ๐Ÿ—บ๏ธ๐Ÿค– We built a policy-agnostic metric to quantify exploration and exploitation errors in LLM agents. Spoiler: Exploration error is what kills๐Ÿ“‰ agent performance in our setting ๐Ÿ‘‡๐Ÿงต(1/8)
1
17
31
2,145
Can we improve exploration failures in LM agents? ๐Ÿ› ๏ธ ๐Ÿ—บ๏ธ Exploration Prompts: Explicitly injecting exploration strategies increases success rate by 17%. ๐Ÿ“ Explicit Harness: Providing the agent with structured summaries of its past observations; success rate boost by 29.4%! ๐Ÿงต(7/8)
1
5
206
This was a joint co-first author work with @jungtaek_kim and @jongwonjeong123, with the guidance from @rdnowak, @Kangwook_Lee and @yong_jae_lee. If this sounds interesting, please check out our paper ๐Ÿ“„ arxiv.org/abs/2604.13151! If you have any questions, feedback, or new ideas, I'd be more than happy to discuss!๐Ÿงต(8/8)
6
297
Excited to be back at @AdobeResearch this summer where I will be working with @Shramanpramani2 :) Would love to connect with anyone who will be around!
1
6
120
Jaden Park retweeted
๐Ÿ”ฅ Upgrade your frozen vision encoders with <10 lines of code! Single-scale inference throws away vital details. Enter MuRF ๐Ÿš€: a simple, training-free plug-in for instant, massive gains in MLLMs, Seg & Depth. ๐Ÿคฏ 1/6
7
25
147
28,381
Jaden Park retweeted
๐ŸšจNew work with @Meta @RealityLabs We introduce EGAgent, an agentic reasoning framework for very long video understanding powered by entity scene graphs Why? With long multimodal data streams, agents must search and reason across multiple modalities! ๐Ÿงต (1/n)
2
8
19
2,060
Jaden Park retweeted
New paper out! ๐Ÿšจ Introducing STTS: Unified Spatio-Temporal Token Scoring for Efficient Video VLMs. We tackle the massive token bottleneck in video models by jointly identifying the tokens that actually matter. The overall figure below breaks down the core problem! ๐Ÿงต๐Ÿ‘‡
1
4
19
4,738
Jaden Park retweeted
Hi ML Twitter! My Summer 2026 internship unfortunately fell through last minute ๐Ÿ˜ตโ€๐Ÿ’ซ If your team is looking for interns, Iโ€™d love to connect - RTs appreciated ๐Ÿ™ My website: aniketrege.github.io/

17
29
266
34,725
There should be a meta-conference where reviewers are Claude Code. (1) Claude Code figures out how to run your code like TerminalBench. (2) Claude Code tries to run your code for 48 hours. If Claude Code can't beat your Table 1 in 2 days of vibe-research, it gets accepted โœ….
1
3
201