Tom Sander (Ph.D.)

Tom Sander (Ph.D.)

44 Photos and videos

Tweets

Pinned Tweet

Tom Sander (Ph.D.)@RednasTom

26 Feb 2024

OpenAI may secretly know that you trained on GPT outputs! In our work "Watermarking Makes Language Models Radioactive", we show that training on watermarked text can be easily spotted ☢️ Paper: arxiv.org/abs/2402.14904 @pierrefdz @AIatMeta @Polytechnique @Inria

15,073

Tom Sander (Ph.D.)

Tom Sander (Ph.D.) retweeted

Tom Sander (Ph.D.)@RednasTom

May 13

1/9 Excited to share TextSeal, our new state-of-the-art watermark for large language models at FAIR / Meta Superintelligence Labs (@AIatMeta) 🔐 Paper: arxiv.org/abs/2605.12456 Code: github.com/facebookresearch/…

4,219

Tom Sander (Ph.D.)

Tom Sander (Ph.D.)@RednasTom

May 13

4,219

more replies

Tom Sander (Ph.D.)

Tom Sander (Ph.D.)@RednasTom

May 13

8/9 Novelty 3: fast localized detection. Real documents are often mixed: some human text, some AI-generated text. TextSeal searches for watermarked regions (previous figure), so detection remains strong even when the signal is diluted (results here)🧭

Tom Sander (Ph.D.)

Tom Sander (Ph.D.)@RednasTom

May 13

9/9 Beyond provenance, TextSeal is “radioactive”: its signal can transfer through model distillation, helping detect when another model was trained on watermarked outputs. Try it out! Code is Apache 2.0. Paper: arxiv.org/abs/2605.12456Code Code: github.com/facebookresearch/…

Tom Sander (Ph.D.)

Tom Sander (Ph.D.)@RednasTom

Apr 7

Delighted to share that last month, I successfully defended my Ph.D. in Mathematics! 🎓 Huge thanks to my incredible advisors, Chuan Guo at @MetaAI (FAIR) and Alain Durmus at @Polytechnique, for their phenomenal mentorship and support throughout this journey.

666

Tom Sander (Ph.D.)

Tom Sander (Ph.D.)@RednasTom

Apr 7

My research focuses on the intersection of machine learning and security, specifically Privacy, Traceability, Provenance and Watermarking in Deep Learning. It has been incredibly rewarding to work on making AI models more secure, transparent and accountable.

Tom Sander (Ph.D.)

Tom Sander (Ph.D.)@RednasTom

Apr 7

A sincere thank you to my thesis committee, my brilliant colleagues at FAIR and Polytechnique, and everyone who has encouraged me along the way. 🚀 scholar.google.com/citations…

Paul-Ambroise Duquenne

Tom Sander (Ph.D.) retweeted

Paul-Ambroise Duquenne @duquenne_pa

Mar 17

A couple of months after OmniASR, we’re excited to release OmniSONAR alongside OmniMT. OmniSONAR brings new training recipes for cross-lingual and cross-modal sentence encoders, enabling massively multilingual embeddings for text and speech. tinyurl.com/omnisonar 🧵 1/3

426

Tom Sander (Ph.D.)

Tom Sander (Ph.D.)@RednasTom

Jan 29

Most text watermarking methods focus on generation time. But what about existing text? We explore "Post-Hoc Watermarking," using an LLM to rephrase and watermark copyrighted books, training data, or similar content. 🧵 arxiv.org/abs/2512.16904 github.com/facebookresearch/…

How Good is Post-Hoc Watermarking With Language Model Rephrasing?

Generation-time text watermarking embeds statistical signals into text for traceability of AI-generated content. We explore *post-hoc watermarking* where an LLM rewrites existing text while...

arxiv.org

147

more replies

Tom Sander (Ph.D.)

Tom Sander (Ph.D.)@RednasTom

Jan 29

Why does this matter? "Watermark Radioactivity." If we watermark specific documents post-hoc, we can detect if they are used to train future models or retrieved in RAG systems. It turns passive data into active tracers.

Tom Sander (Ph.D.)

Tom Sander (Ph.D.)@RednasTom

Jan 29

with the usual suspect @pierrefdz and the rest of the AVSeal Team! We’ve released our code, and are part of the Meta Seal release! Check out the full paper 📄 Paper: arxiv.org/abs/2512.16904 Website: facebookresearch.github.io/m… 💻 Code: github.com/facebookresearch/…

How Good is Post-Hoc Watermarking With Language Model Rephrasing?

Generation-time text watermarking embeds statistical signals into text for traceability of AI-generated content. We explore *post-hoc watermarking* where an LLM rewrites existing text while...

arxiv.org