We tested several unlearning methods and found none of them really erase knowledge from the model - they simply hide it! 🧐
What does this mean? We must tread carefully with unlearning research within diffusion models🚨
Here is what we learned 🧵👇(led by @kevinlu4588)
ALT Title slide reading "When Are Concepts Erased From Diffusion Models?"
Excited to share our paper “When Are Concepts Erased from Diffusion Models?” at @NeurIPSConf!
We introduce two conceptual models for erasure mechanisms in diffusion models, and a suite of probes to recover supposedly forgotten concepts.
Project website: unerasing.baulab.info/
Excited to share our paper “When Are Concepts Erased from Diffusion Models?” at @NeurIPSConf!
We introduce two conceptual models for erasure mechanisms in diffusion models, and a suite of probes to recover supposedly forgotten concepts.
Project website: unerasing.baulab.info/