Tyler LaBonte

Tyler LaBonte

63 Photos and videos

Tweets

Pinned Tweet

Tyler LaBonte @tmlabonte

1 May 2025

Excited to present at the first #AISTATS2025 poster session on May 3! Ever wondered how LLMs can generalize to new tasks in-context despite only training on token completion? We formalize this phenomenon as "task shift" and investigate a linear version: arxiv.org/abs/2502.13285

2,596

Tyler LaBonte

Tyler LaBonte @tmlabonte

May 24

The updated version of this paper has been accepted at @TmlrOrg 🚨🚀 Very excited about implications of our results for SOTA robustness algorithms & understanding spurious correlations more generally. Journal version link: openreview.net/pdf?id=h81ztb…

Tyler LaBonte @tmlabonte

23 Apr 2025

Heading to #ICLR2025 to present our SCSL workshop paper on understanding how last-layer retraining methods mitigate spurious correlations! openreview.net/pdf?id=B2W51a… Stop by on Monday, April 28 to chat and learn more 🙂

1,818

Tyler LaBonte

Tyler LaBonte @tmlabonte

Mar 22

Cramér on Lindeberg: "When he was reproached for not being sufficiently active in his scientific work, he said 'Well, I am really a farmer.' And if somebody happened to say that his farm was not properly cultivated, his answer was 'Of course my real job is to be a professor.'"

261

Microsoft Research

Tyler LaBonte retweeted

Microsoft Research

@MSFTResearch

Mar 9

Multimodal reasoning with Phi-4-reasoning-vision, new work on scaling LLM inference, benchmarking AI agents in network operations, cinematic video generation, adaptive evaluation for LLMs, and using AI to improve individual and population health. msft.it/6013QiQgx

2:00

11,673

Tyler LaBonte

Tyler LaBonte @tmlabonte

Mar 5

Our Phi-4-reasoning-vision-15B technical report is now available on arxiv: arxiv.org/abs/2603.03975

Phi-4-reasoning-vision-15B Technical Report

We present Phi-4-reasoning-vision-15B, a compact open-weight multimodal reasoning model, and share the motivations, design choices, experiments, and learnings that informed its development. Our...

arxiv.org

408

Tyler LaBonte

Tyler LaBonte @tmlabonte

Mar 4

Some nice coverage on our new model release, highlighting our hybrid approach to multimodal reasoning 🚀

VentureBeat

@VentureBeat

Mar 4

Microsoft built Phi-4-reasoning-vision-15B to know when to think — and when thinking is a waste of time venturebeat.com/ai/microsoft…

408

Tyler LaBonte

Tyler LaBonte @tmlabonte

Mar 4

It's been the privilege of my career to help build the newest Phi series model from @MSFTResearch! Phi-4-reasoning-vision-15B is open-weight & competitive on perf with 10X less compute/tokens. Read the blog for math and CUA case studies, hybrid reasoning, data insights, & more!

Microsoft Research

@MSFTResearch

Mar 4

Vision-language models improve multimodal systems, but can make them slower, costlier, and harder to deploy. Learn how Phi-4-reasoning-vision-15B, a compact and fast multimodal reasoning model, blends strengths of different methods while reducing their limits: msft.it/6014Q5X0u

White line icons against a blue-green gradient background form an architecture flow chart. In the middle of the chart is a three-by-three matrix of circles and lines within a round-edge square. Above the matrix, three icons in a row – an equation, a person using a desktop, and a head with gears flow by dotted lines to the matrix. To the left of the matrix is an icon representing a stack of files with an arrow pointing to the matrix. To the right of the matrix is a graph with a double headed arrow pointing to the matrix and to itself. Below the matrix is an icon representing a document. A dotted line arrow connects this graph to the matrix, showing the direction flowing from the matrix to the document. To the right of the document icon is an hourglass icon and three list icons with a dotted line connecting the hourglass to the lists.

ALT White line icons against a blue-green gradient background form an architecture flow chart. In the middle of the chart is a three-by-three matrix of circles and lines within a round-edge square. Above the matrix, three icons in a row – an equation, a person using a desktop, and a head with gears flow by dotted lines to the matrix. To the left of the matrix is an icon representing a stack of files with an arrow pointing to the matrix. To the right of the matrix is a graph with a double headed arrow pointing to the matrix and to itself. Below the matrix is an icon representing a document. A dotted line arrow connects this graph to the matrix, showing the direction flowing from the matrix to the document. To the right of the document icon is an hourglass icon and three list icons with a dotted line connecting the hourglass to the lists.

887

Tyler LaBonte

Tyler LaBonte @tmlabonte

Jan 14

Over the holidays, I stress-tested the AI coding hype by doing something concrete: I built a college football simulator game from scratch to see if agents actually deliver. Here’s what I learned 👇

185

more replies

Tyler LaBonte

Tyler LaBonte @tmlabonte

Jan 14

Misc takeaways: • Copilot GitHub was far more useful than I expected • Keeping code style consistent across humans agents is painful • Overall: Claude was best for agentic coding; Gemini best for interactive pair-programming

134

Tyler LaBonte

Tyler LaBonte @tmlabonte

Jan 14

Finally, thanks to @Kangwook_Lee's "Tenure Track Simulator" post for inspiring me to make the game public and write this up!

Tyler LaBonte

Tyler LaBonte @tmlabonte

11 Dec 2025

Nice work from GT colleagues about how next-token prediction naturally captures long-range structural dependencies!

Xinyuan Cao @CaoYouki

10 Dec 2025

(1/6) Why does next-token prediction work so well, even for long text? 🤔 Check out “Provable Long-Range Benefits of Next-Token Prediction”. A rigorous explanation for LLM’s long-range coherence/reasoning. Joint work with Santosh Vempala📄 arXiv: arxiv.org/abs/2512.07818

298

Tyler LaBonte

Tyler LaBonte @tmlabonte

25 Nov 2025

Fara has been one of the most exciting projects to watch evolve @MSFTResearch over the last few months. From my perspective, Fara is a real advance towards natively multimodal computer-use agents (e.g., no accessibility trees). Congrats to Corby and the team on the release!

Corbin Rosset

@corby_rosset

24 Nov 2025

Microsoft just dropped Fara-7B, its first on device AI Agent that can use your computer just like you would: it clicks, types, fills out forms and completed tasks just by “seeing” the screen. It’s best-in-class in terms of accuracy and cost from yours truly at Microsoft AI Frontiers and you can use it today

274

Tyler LaBonte

Tyler LaBonte @tmlabonte

20 May 2025

Returning to Building 99 for my second internship @MSFTResearch working on multimodal reasoning. Come say hi!

5,704

Tyler LaBonte

Tyler LaBonte @tmlabonte

6 May 2025

That's a wrap on ICLR/AISTATS! It was a wonderful experience to have deep research discussions in a part of the world I had never been to. Thanks to everyone who stopped by to chat or even just say hi 😎

1,180

Tyler LaBonte

Tyler LaBonte @tmlabonte

5 May 2025

See you this afternoon (May 5) at Poster 42 in Hall A-E!

Tyler LaBonte @tmlabonte

1 May 2025

369