Applied Scientist @FutureAGI_ | Prev: @microsoft, @UM_DKE, @UvA_Amsterdam | Projects with @meta, @x

Joined November 2009
4 Photos and videos
Pinned Tweet
31 Mar 2024
Super excited to share our paper "Akal Badi ya Bias: An Exploratory Study of Gender bias in Hindi" has been accepted to #facct2024. Preprint available soon. Really proud of this work. It is a huge group effort in partnership with @karya_inc [1/n]
8
10
101
26,303
Rishav Hada retweeted
@perplexity_ai just brought a major contribution to our repo, launched 4 weeks back! @JamesLiounis_ shipped first-class Sonar Agent API support to our open-source repo. Route Perplexity Sonar through Future AGI gateway with caching, fallback chains, and guardrails. Use it in experiments, prompts, evals too. Real-time web access and citations baked in. (github.com/future-agi/future…)
10
14
213
Rishav Hada retweeted
Today is the biggest day for us at @FutureAGI_ as we go fully open source. Hundreds of teams have trusted us to build self-improving AI agents. And as of today, it's on GitHub for every AI team on earth to use, extend, and build on. Long story short, I have been in AI infra long enough to know that everyone talks about agents that learn and improve. Nobody ships the platform that makes it happen. We did. And this is just the first step towards building truly autonomous AI - infrastructure where your agent gets better every deployment, without ever changing the LLM. Here's why this had to exist. The teams doing the hardest work in AI are burning 6-figure engineering cycles debugging the same hallucinations on loop. That's not a team failure - it's a tooling gap. Non-deterministic systems can't be engineered with tools designed for deterministic software. The math doesn't work That's the gap Future AGI was built to close. And today, it's yours. The entire stack. UI. Backend. Simulation. Evals. Optimization. Guardrails. Gateways. Everything you need to build agents that actually stress test themselves and improve from production data, autonomously. Because the real unlock isn't better observability, it's closing the loop where: Agent fails → system simulates why → runs evals → generates fix → validates on real traffic → deploys → monitors for regressions. Why open source? Because asking you to trust a closed system to autonomously improve your AI is absurd. You need to see the learning mechanisms. Inspect what's changing. Validate the optimization strategies. This is bigger than one company. Self-improving AI will define the next decade. And it starts with infrastructure everyone can build on. → GitHub link below. Star it. Run it. Push it to its limits.
3
20
44
4,559
Rishav Hada retweeted
You’ve curated the sources. You’ve researched everything. You know exactly what you want to say. You just can't get it out of your head and onto the page. We are building Almanac exactly for this. Experience the beta version here: try.almanac.so/ Here are a few things you can do with Almanac👇
17
39
125
46,312
24 Oct 2025
For the last year, friends kept telling me my feed is all product updates, a big shift from my research posts. We’ve been quietly brewing at @FutureAGI_ . Last month, we announced AgentCompass, and the response was incredible. Last week, we introduced Protect. [1/n]
1
1
3
63
24 Oct 2025
Here's a summary: 1. We unify text, image & audio safety under one framework. 2. We adopt Teacher-assisted relabeling for explainable, high-fidelity data. 3. Protect shows SOTA performance on public benchmarks, surpassing WildGuard, LlamaGuard-4, and GPT-4.1. [3/n]
1
1
57
24 Oct 2025
All of this is offered at ultra-low latency for real-time production interception. We have open-sourced our text adapters at HF: huggingface.co/collections/f… We're just getting started! [n/n]

1
35
Rishav Hada retweeted
24 Jun 2025
Most marketing stacks use AI. Few are truly intelligent. Join Bhavneet Kaur (VP, AI @ C5i) in conversation with @itsjustnikhil from Future AGI to explore how to build AI-native marketing platforms that actually deliver: – Reading the MarTech shift – Architecting predictive layers – Scaling with trust & speed 🗓 July 1 | 9:30 AM PT RSVP → lu.ma/nn9m3mo7 #FutureAGI #MarTech #GenAI #ReliableAI
2
5
89
Rishav Hada retweeted
17 Jun 2025
Future AGI lands at @superai_conf Singapore! 🇸🇬 If you're thinking beyond benchmarks and into real-world reliability, come talk to @itsjustnikhil. Don't miss our founder’s talk on "Building Reliable AI" at The Forum. This session will equip you with the essential tools to drive the next generation of AI using evals, observability, and intelligent guardrails to bridge the gap between capability and confidence. Let’s talk about the future and how to build it responsibly.
2
8
88
27 Apr 2025
After amazing participation in our first two sessions where we did a deep dive into how to setup smarter evaluations for your GenAI applications, we're back with a third one. This time we'll be discussing strategies around scaling AI engineering. Looking forward to this one!
25 Apr 2025
Is your AI stack ready for agentic scale? Join Sandeep Kaipu (Engineering Leader @Broadcom) and our founder, @itsjustnikhil, as they share a practical playbook for scaling GenAI infra, aligning with KPIs, and securing compliance. 📅 May 8 | 9:30 AM PT 🔗 RSVP → lu.ma/cb4g9n1e #FutureAGI #AIInfrastructure #EnterpriseAI #AIAgents
4
153
26 Mar 2025
With amazing participation in our 1st webinar, we're back with another one. This time, we're getting our hands dirty diving deep into the many layers of AI evaluation. We'll cover practical techniques to spot issues early, enhance data quality, & build more reliable GenAI apps.
25 Mar 2025
Everyone talks about building AI. Few talk about evaluating it well. Future AGI is hosting a webinar on AI evaluation—catching issues early, refining datasets, and ensuring trustworthy GenAI. 📅 April 4 | 9:30 AM PT | Online 🔗 Register here → lu.ma/5nsxmlxn
1
4
130
Rishav Hada retweeted
Let's talk a bit about authorship order on a paper. Yes, everyone cares about it, and it can become very emotional. Even if you think that big professors don't care, they do (although I know two professors who don't—you can probably guess who).
1
7
156
48,228
12 Feb 2025
At @FutureAGI_, we have developed state-of-the-art evaluation methods tailored for real-world business use cases. Some of my favorite features include error localization in input data, and a robust framework for synthetic data generation, among others. [3/n]
1
47
12 Feb 2025
Excited for everyone to try them out (rb.gy/ymysdc), and always eager to hear your thoughts and feedback. We are just getting started 🚀 . [n/n]

31
Rishav Hada retweeted
Wanted to share a thought I had for training LLMs after reading the Deepseek R1 paper. At a high level, this can help inculcate feedback coming from humans/models/rules into the training process dynamically as and when required. More details here: app.affine.pro/workspace/d2b…

5
4
24
2,740
Rishav Hada retweeted
14 Jan 2025
MSR India is accepting applications for the 2025 Research Fellow program
Microsoft Research India is excited to announce applications are open for our Research Fellow program (deadline 15th Feb 2025). Details of the program and the application are here: 🔗 Research Fellow program: aka.ms/msrirf @MSFTResearch
1
12
103
12,280
Rishav Hada retweeted
Cursor engineers are coming for our jobs
320
1,553
33,152
2,300,044
Rishav Hada retweeted
Ever wondered about inducing long-context abilities in NMT (or any old transformer) model "efficiently"? In our latest work, we pose the same question, and explore the interchangeability of sinusoidal PEs to newer extendable PEs like RoPE and ALiBi.
1
5
10
1,808