What if we could mathematically prove that code does what it's supposed to do, not just test it and hope?
The Caltech AI Alignment Group hosted @ClarkBarrett7 from @Stanford for a talk on CSLib, a platform for AI-assisted formal verification in Lean, and why proving code correct is becoming one of the most urgent problems in AI safety.
1/7
Congratulations to the entire CVC4/cvc5 team for winning the Rance Cleaveland Test-of-Time Tool Award at ETAPS this week!
etaps.org/awards/test-of-tim…
The CSLib steering committee recently announced the official launch of CSLib — an open-source effort to formalize computer science in Lean, inspired by the impact of Mathlib in mathematics.
CS researchers, practitioners, and enthusiasts are invited to get involved to support formalizing essential computer science concepts, and building infrastructure for reasoning about real-world code with Lean.
Learn more at:
🌐 cslib.io
📄 White paper: arxiv.org/abs/2602.04846
🤝 Contribute: github.com/leanprover/cslib/…#LeanLang#LeanProver#CSLib#OpenSource#FormalVerification
🥁And the #cav24 Award goes to...🥁
Clark Barrett @Stanford, David Dill @Stanford, Kyle Julian @Wing, Guy Katz @CseHuji and Mykel Kochenderfer @aiprof_mykel@Stanford for their #cav17 paper “Reluplex: An Efficient SMT Solver for Verifying Deep Neural Networks”
Congratulations! 👏
How can we train a language model to communicate with other agents? We propose informativeness as a training objective, where a sender's message is informative insofar as it increases the receiver's log probabilities over future observations conditional on the message. (1/8)
Are you ready for @eulerfinance's ✨$1.25M✨ audit competition on @cantinaxyz?
We're thrilled to announce that $100k of the total pot is being allocated to formal verification managed by @certora 🔥
We're looking to get to know the users of SMT solvers! Please DM us if you use any SMT solver, and especially if you use cvc5. Reposts for visibility are also appreciated!