After LLMs and diffusion, Muon also shines on tabular foundation models!
Also nice to see they used cautious weight decay 🥌
Super excited that TabICLv2 is out 🎉
🚀Beats RealTabPFN-2.5 with no tuning and purely synthetic pre-training data.
👉Introduces QASSMax for long-context generalization, early target embedding, repeated feature grouping, Muon, etc., and a much diversified synthetic data prior.