We introduce 🌸✨ AlphaEvolve ✨🌸, an evolutionary coding agent using LLMs coupled with automatic evaluators, to tackle open scientific problems 🧑🔬 and optimize critical pieces of compute infra ⚙️
deepmind.google/discover/blo…
📢New Paper on Reward Modelling📢
Ever wondered how to choose the best comparisons when building a preference dataset for LLMs?
Our latest paper revives classic statistical methods to do it optimally!
Here’s a 🧵on how it works 👇
arxiv.org/abs/2502.04354
Image-generation diffusion models can draw arbitrary visual-patterns. What if we finetune Stable Diffusion to 🖌️ draw joint actions 🦾 on RGB observations?
Introducing 𝗚𝗘𝗡𝗜𝗠𝗔
paper, videos, code, ckpts: genima-robot.github.io/
🧵Thread⬇️
🚨Important update from our Robot Learning Lab in London.
Following recent news, we’re moving on after a wonderful 2 years…
Today, we unveil 4 big pieces of research from our incredible team. Check out the compilation video and thread below to see our final work! 📽️👇
We build neural codecs from a *single* image or video, achieving compression performance close to SOTA models trained on large datasets, while requiring ~100x fewer FLOPs for decoding ⚡ #CVPR2024c3-neural-compression.github…
We present #FunSearch in @Nature today - a system combining LLMs with evolutionary search to generate new discoveries in math and computer science! 👩🔬🔬✨
Introducing FunSearch in @Nature: a method using large language models to search for new solutions in mathematics & computer science. 🔍
It pairs the creativity of an LLM with an automated evaluator to guard against hallucinations and incorrect ideas. 🧵 dpmd.ai/x-funsearch
We construct neural processes by iteratively transforming a simple stochastic process into an expressive one, similar to flow/diffusion-based models, but in function space!
Join us at our #NeurIPS2023 poster session: neurips.cc/virtual/2023/post… on Wednesday morning!
Introducing Manifold Diffusion Fields (MDF), our new work on learning generative models over fields defined on curved geometries. This is joint work with our intern @Ahmed_AI035 (who hasn’t even started his PhD yet!) and @jsusskin at @Apple MLR arxiv.org/abs/2305.15586 🧵
Manifold Diffusion Fields
present Manifold Diffusion Fields (MDF), an approach to learn generative models of continuous functions defined over Riemannian manifolds. Leveraging insights from spectral geometry analysis, we define an intrinsic coordinate system on the manifold via the eigen-functions of the Laplace-Beltrami Operator. MDF represents functions using an explicit parametrization formed by a set of multiple input-output pairs. Our approach allows to sample continuous functions on manifolds and is invariant with respect to rigid and isometric transformations of the manifold. Empirical results on several datasets and manifolds show that MDF can capture distributions of such functions with better diversity and fidelity than previous approaches
paper page: huggingface.co/papers/2305.1…
Very happy to announce that our latest paper on Neural data compression with INRs, Meta Learning & Sparse Subnetwork selection has been accepted to #ICML2023 (Scores 7, 7, 7). 1/N
Paper: arxiv.org/abs/2301.09479
Can deep transformers be trained without skip connections nor normalisation layers?
Our ICLR 2023 paper shows you how, using wide NN signal propagation ideas. We hope this can potentially pave the way to more efficient deep LLMs! (1/9)
Paper: arxiv.org/abs/2302.10322
Previously we had introduced *functa*, a framework for representing data as neural functions (aka neural fields, INRs) and doing deep learning on them.
In our recent work *spatial functa* we show how to scale up the approach to ImageNet-1k 256x256.
📝arxiv.org/abs/2302.03130
Ever wondered why deep learning is always done on array data?🤔 Happy to announce our work:
From data to functa: Your data point is a function and you can treat it like one
📝arxiv.org/abs/2201.12204 w/ @emidup@arkitus@DaniloJRezende@danrsm, to appear in ICML22