I’ll be in the Bay Area this week, June 8–12, and would be happy to visit AI labs, companies, startups, or research teams for a seminar, chalk talk, or informal discussion.
My group has been working on three connected directions that may be relevant to teams building foundation models, agents, and embodied AI:
1. Reasoning as control.
How should agents allocate test-time compute across decoding, search, tool use, verification, collaboration, and safety intervention? We view inference-time reasoning as a control problem and study runtime policies for reliable planning agents.
2. Self-improving trustworthy foundation models.
As foundation models become more agentic, failures become more dynamic: hallucinated grounding, brittle refusals, reward-model blind spots, adversarial reasoning traces, and poisoned alignment pipelines. We are building closed-loop systems with auditors, actuators, and amplifiers: systems that detect, intervene, recover, and learn from failure.
3. World models for physical intelligence.
Robotics faces a data bottleneck. High-quality teleoperation data is scarce, human demonstrations are embodiment-mismatched, and internet videos are abundant but noisy and actionless. We study world models as a bridge from imperfect experience to grounded robot behavior.
I’d be especially excited to connect with people working on agents, post-training, alignment, robotics, multimodal models, test-time compute, world models, and physical intelligence.
Short notice is totally fine - happy to do a seminar, chalk talk, small-group discussion, or just meet with research teams. Please DM me if useful.