Reshinth

Reshinth

74 Photos and videos

Tweets

Pinned Tweet

Reshinth @reshinth_

6 Apr 2024

How to define Diversity in the context of CodeLMs and Programming Languages ? 1. Diversity is positively correlated with Performance in solving a problem. 2. Shortcomings of diversity in small codeLMs. 3. Code Embedding models don't capture semantics. reshinthadithyan.github.io/b…

4,503

CiaraRowles

Reshinth retweeted

CiaraRowles @CiaraRowles1

Jun 2

We're excited to share Stable-Layers! We train Qwen-Image-Layered further with RL for improved layerization, using only feedback from a VLM — no paired supervision required! Paper: arxiv.org/abs/2605.30257 Project Page: stability-ai.github.io/stabl…

273

18,074

Varun Jampani

Reshinth retweeted

Varun Jampani @jampani_varun

1 Oct 2025

🎬 Introducing Stable Cinemetrics, to be presented at NeurIPS 2025. We present the first taxonomy of professional controls to systematically study and control video generative models through the lens of filmmaking. Interactive webpage with paper link: stable-cinemetrics.github.io… 🧵

0:19

4,116

Xiao Liang

Reshinth retweeted

Xiao Liang @MasterVito0601

13 Jun 2025

🙋‍♂️ Can RL training address model weaknesses without external distillation? 🚀 Please check our latest work on RL for LLM reasoning! 💯 TL;DR: We propose augmenting RL training with synthetic problems targeting model’s reasoning weaknesses. 📊Qwen2.5-32B: 42.9 → SwS-32B: 68.4

131

12,191

Reshinth

Reshinth @reshinth_

12 Jun 2025

119

Reshinth

Reshinth @reshinth_

7 Jun 2025

111

Reshinth

Reshinth @reshinth_

6 Jun 2025

With CodeLMs scaling actually solved models intrinsically learning internal structural syntactical & semantic information.

132

Josh

Reshinth retweeted

Josh

@JoshPurtell

6 Jun 2025

Open AI gave a talk on writing software through specs today. I thought it was my little secret, but seems like quite a few smart builders in the space have also found it's a useful approach. Now that the secrets out joshuapurtell.com/posts/spec…

Specification Engineering

... is a bet on better code gen and more complexity

joshuapurtell.com

728

76,665

Reshinth

Reshinth @reshinth_

30 May 2025

Augment Code

Reshinth retweeted

Augment Code

@augmentcode

31 Mar 2025

🧵We just released the #1 open-source agent on the SWE-bench Verified leaderboard by assembling the best of Claude Sonnet 3.7 and O1. Open-source repo here: github.com/augmentcode/augme… Here's how we achieved 65.4% success rate on the hardest coding benchmark in the industry: 🧠👇

276

51,383

Reshinth

Reshinth @reshinth_

23 Feb 2025

ML Twitter lately.

Casey Muratori @cmuratori

22 Feb 2025

Remember folks: if you aren't a subject matter expert, don't know the context, and have nothing valuable to add to a thread, you always have the option of not replying!

139

Alex Havrilla

Reshinth retweeted

Alex Havrilla @Dahoas1

5 Dec 2024

How important is the quality, diversity, and complexity (QDC) of synthetic data for LLM performance? What effect does QDC data composition have on self-improvement? We just released a comprehensive survey discussing these questions (and many more) 🧵

111

16,909

Nathan Cooper

Reshinth retweeted

Nathan Cooper @ncooper57

5 Dec 2024

As R&D staff @answerdotai, I work a lot on boosting productivity with AI. A common theme that always comes up is the combination of human AI. This combination proved to be powerful in our new project ShellSage, which is an AI terminal buddy that learns and teaches with you. A 🧵

201

69,139

Anthropic

Reshinth retweeted

Anthropic

@AnthropicAI

19 Nov 2024

New Anthropic research: Adding Error Bars to Evals. AI model evaluations don’t usually include statistics or uncertainty. We think they should. Read the blog post here: anthropic.com/research/stati…

A statistical approach to model evaluations

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

anthropic.com

297

2,085

756,249

Nathan Cooper

Reshinth retweeted

Nathan Cooper @ncooper57

7 Nov 2024

I'm so excited to be working on this new course from @fastdotai ! Education has always been a huge driving factor in my life. It is surreal that I'm getting to do this as part of my job. Really looking forward to working with students again 🤓

Jeremy Howard

@jeremyphoward

7 Nov 2024

Today, we're announcing that @fastdotai is joining @AnswerdotAI, marking a new phase in making AI accessible. And we're launching a new a new kind of "AI-first" educational experience, "How To Solve It With Code". answer.ai/posts/2024-11-07-s…

794

Charles Sutton

Reshinth retweeted

Charles Sutton @RandomlyWalking

1 Nov 2024

Our team has been working hard to harness the power of AI to make software more secure.✨🔐 Today we are excited to share a major milestone: our AI agent has discovered its first real-world security vulnerability! googleprojectzero.blogspot.c… More info 🧵

120

496

158,683

Robin Rombach

Reshinth retweeted

Robin Rombach

@robrombach

1 Aug 2024

🔥 I am so damn excited to announce the launch of Black Forest Labs. We set ourselves on a mission to advance state-of-the-art, high-quality generative deep learning models for images and video, and make them available to the broadest audience possible. Today, we release FLUX.1

Black Forest Labs

@bfl_ai

1 Aug 2024

We are excited to announce the launch of Black Forest Labs. Our mission is to develop and advance state-of-the-art generative deep learning models for media and to push the boundaries of creativity, efficiency and diversity.

0:03

153

1,156

386,080

Carlos Riquelme

Reshinth retweeted

Carlos Riquelme @rikelhood

30 Jul 2024

Fue un absoluto placer hablar con @jaimenovoa de @kfund sobre la industria de la inteligencia artificial, sobre cómo funcionan y se crean los chatbots en términos sencillos, el rol de los datos y el presente y potencial futuro del ecosistema de IA español. open.spotify.com/episode/65w…

#228 - Carlos Riquelme: conceptos básicos (y avanzados) de la IA Generativa y LLMs

PodKast de K Fund · Episode

open.spotify.com

6,475

evanthebouncy

Reshinth retweeted

evanthebouncy @evanthebouncy

19 Jun 2024

the training set of ARC is already contaminated... we have human-written python code to most of the ARC's training set on a github, AND in CoT style prompting that breaks the task down, AND in context with the rendered grids: iprc-dip.github.io/ANPL/ it's been there since 2023

14,054

Nathan Cooper

Reshinth retweeted

Nathan Cooper @ncooper57

17 Jun 2024

Big life update! I'm super excited to announce I have joined the awesome crew at @answerdotai 🤓

34,147

Reshinth

Reshinth @reshinth_

2 Jun 2024

200