Inioluwa Deborah Raji kicks off the workshop on Governance at the Technological Frontier: Translating Research into Policy for AI Oversight.
simons.berkeley.edu/workshop…#SimonsLive
Excited by the program of the workshop on "Agency in Collaborative Learning" at the Simons Institute for Theory of Computing. Thanks to Kate Donahue (@kpaxdonahue) and John Duchi.
Yuval Ishai kicking off the secure computation workshop at Simons Institute, Berkeley. More about Yuval in another post, now back to the workshop. #SimonsLive
Can diffusion model's generation accuracy be quantified?
arxiv.org/abs/2406.12839 gave the first bound that accounts for *both* the forward (score training) process and the backward (inference) process.
Making this bound smaller optimizes the design of diffusion model!
“Machine learning is linear algebra” - brilliant talk by Andrew Gordon Wilson on designing composable and compute efficient models with inductive biases. People were studying SSMs over ten years ago in the GP literature. Everything old is new again. @SimonsInstitute#SimonsLive
“The closer you initialize to the edge of chaos, the deeper a network you can train.”—@SuryaGanguli on the connection between chaos in dynamical systems and training deep neural networks at the Simons Institute's workshop on Transformers as a Computational Model. #SimonsLive
“I want a neural net that solves harder problems than it was trained on, AKA…weak to strong generalization.”—Tom Goldstein at the Simons Institute's workshop on Transformers as a Computational Model. @tomgoldsteincs#SimonsLive
"Validity-breadth trade-offs as a general issue for language generation: Hallucination at one extreme; mode collapse at the other." — Jon Kleinberg of Cornell University at the Simons Institute's workshop on Transformers as a Computational Model. #SimonsLive
Eran Malach now giving a mechanistic explanation of how transformers solve copying and retrieval in practice and some examples of length generalization in various algorithmic tasks, using RASP as a programming language. #SimonsLive
Misha Belkin on emergence in neural networks using an analogy (Larva—>Pupa—>Butterfly): Continuity is not reflected externally by what you can observe. There might be continuity internally. At the Simons Institute's workshop on "Transformers as a Computational Model" #SimonsLive
Sampath Kannan, Associate Director of the Simons Institute kicks off the workshop on "Transformers as a Computational Model" as part of the Special Year on Large Language Models and Transformers, Part 1. #SimonsLive
Loved @mraginsky's thought-provoking talk on Generalization from the Behavioral Perspective at @SimonsInstitute#SimonsLive. Inductive biases in learning systems set the epistemological bounds of any induction they can make, defining limits of generalization. The study of 1/3