Excited to share MyScholarQA - a personalized deep research tool that learns from your papers and lets you customize reports! ๐งโ๐ฌ๐๏ธ
Our #ACL2026 paper built and evaluated it, showing simulated users (LLMs) couldn't mimic what real users wanted ๐
Spicy results a live demo ๐๐งต
Are you a researcher in CS or a CS-adjacent field curious about how an AI agent can help you with your research project? Want to try a new tool for your research support in a paid user study ($100, 2 hr)? Limited spot numbers. See details and sign up here: forms.gle/JzLtkAhe7TtvuiwQ8
๐คGiving complex tasks to AI agents is easyโgetting them to do exactly what you want isnโt. How can human-AI collaboration give us more reliable & steerable agents?
๐ซIntroducing Cocoa, our new interaction paradigm to balance human & AI agency in complex human-AI workflows. ๐งต
We analyzed 250K queries & 430K clickstream interactions from Asta, our AI-powered research assistantโand today we're releasing the full dataset. How do researchers actually use AI science tools? Here's what we found. ๐งต
What is future of reading? ๐
Announcing the 1st Science & Technology of Augmented Reading (STAR) workshop at #CHI2026!
We want your takes on: ๐ค AI & Agents for reading ๐๏ธ Visual Interactions ๐บ๏ธ Domains (Code, Law, Ed, etc.)
๐ Submit a 2-4 pg paper: chi-star-workshop.github.io/
What is future of reading? ๐
Announcing the 1st Science & Technology of Augmented Reading (STAR) workshop at #CHI2026!
We want your takes on: ๐ค AI & Agents for reading ๐๏ธ Visual Interactions ๐บ๏ธ Domains (Code, Law, Ed, etc.)
๐ Submit a 2-4 pg paper: chi-star-workshop.github.io/
What is future of reading? ๐
Announcing the 1st Science & Technology of Augmented Reading (STAR) workshop at #CHI2026!
We want your takes on: ๐ค AI & Agents for reading ๐๏ธ Visual Interactions ๐บ๏ธ Domains (Code, Law, Ed, etc.)
๐ Submit a 2-4 pg paper: chi-star-workshop.github.io/
Introducing Theorizer: Turning thousands of papers into scientific laws ๐โก๏ธ๐
Most automated discovery systems focus on experimentation. Theorizer tackles the other half of science: theory buildingโcompressing scattered findings into structured, testable claims. ๐งต
๐ New in Asta: multi-turn report generation.
You can now have back-and-forth conversations with Asta, our agentic platform for scientific research, to refine long-form, fully cited reports instead of relying on single-shot prompts.
Ever want to ask questions about a paper, including its figures & tables? ๐๐ Want smoother interactions w/papers on desktop & mobile?
Try Paper Figure QA, a new tool from @allen_ai that answers with the original figures, tables, and excerpts from papers: paperfigureqa.allen.ai
ALT A screenshot of a user asking about examples of how a system works in a paper, and the system responding with details about the system along with a relevant figure and caption from the paper.
Update: DataVoyager, which we launched in Preview early this fall, is now available in Asta. ๐
You can upload real datasets, ask complex research questions in natural language, & get back reproducible answers visualizations. ๐๐
Ai2 is hiring Summer interns to work on:
- human-AI interaction and intelligent UIs
- research support systems e.g., lit reviews, ideation
- and more!
We encourage publishing and public deployments. Apply with link below before Dec 15 for the first reviews cycle : )
#chi2026
Apply here: job-boards.greenhouse.io/theโฆ
DM also open if you have general questions or are interested in working on a topic with me specifically : )
Last year I worked at @Adobe@AdobeResearch and @allen_ai, exploring how we can help users read, organize and understand long documents. This piece covers what we learned on modelling user intent and combining LLMs with principled tools when building complex pipelines for it!
CDS PhD alum Vishakh Padmakumar (@vishakh_pk), now at @Stanford, tackled the hard part of summarization โ deciding what matters.
At @Adobe, he built diversity-aware summarizers; at AI2 (@allen_ai), intent-based tools for literature review tables.
nyudatascience.medium.com/suโฆ
Introducing Astaโour bold initiative to accelerate science with trustworthy, capable agents, benchmarks, & developer resources that bring clarity to the landscape of scientific AI agents. ๐งต
Producing reasoning texts boosts the capabilities of AI models, but do we humans correctly understand these texts? Our latest research suggests that we do not.
This highlights a new angle on the "Are they transparent?" debate: they might be, but we misinterpret them. ๐งต
People at #ACL2025, come drop by our poster today & chat with me about how context matters for reliable language model evaluations!
Jul 30, 11:00-12:30 at Hall 4X, board 424.
Excited to share โจ Contextualized Evaluations โจ!
Benchmarks like Chatbot Arena contain underspecified queries, which can lead to arbitrary eval judgments. What happens if we provide evaluators with context (e.g who's the user, what's their intent) when judging LM outputs? ๐งตโ
Ai2 is excited to be at #ACL2025 in Vienna, Austria this week. Come say hello, meet the team, and chat about the future of NLP. See you there! ๐ค๐
In Vienna for #ACL2025NLP this week!
@josephcc, @aps6992 and I will present the Ai2 ScholarQA scientific QA system on Wed. Iโll also be at @sdpworkshop on Thurs!
Hit me up if youโd like to chat about agents for science and post-training, or explore cafes in Vienna ๐ฅ