PyTorch 2.12 introduces major updates across compilation, export, distributed training, and accelerator support.
Highlights include up to 100x faster batched linalg.eigh on CUDA, the new torch.accelerator.Graph API, Microscaling quantization support in torch .export.save, and fused Adagrad.
The release includes 2,926 commits from 457 contributors since PyTorch 2.11.
Have questions? Join @AndreyTalman (@Meta), @albanDesmaison (@Meta), and @joespeez (@reflection_ai), moderated by @Chris_AI_HPC (@Meta), on May 20 at 10:00 AM PT for a live Q&A covering the release and answering questions from the community.
🔗 Read the release blog and register for the webinar: pytorch.org/blog/pytorch-2-1…#PyTorch#OpenSourceAI#MachineLearning#AIInfrastructure
If you are at GTC and you care about AI Frameworks, don't miss your chance to talk with the PyTorch experts at the booth. We have a packed schedule experts on different topics available for you to meet at booth #338.
🗓️ Plan your week: Check out the full "Meet the PyTorch Experts" schedule here: pytorch.org/event/nvidia-gtc…
We'll be posting the daily lineups here in this thread all week. See you at the booth! 🤝 @NVIDIADev
PyTorch 2.10 is now available, with updates focused on performance, determinism, and numerical debugging for modern training and post-training workflows.
Highlights include Python 3.14 support for torch.compile(), reduced kernel launch overhead in TorchInductor, a new varlen_attn() op for variable-length sequences, and improved tools for tracking numerical divergence.
🖇️ 🔥 Read the PyTorch 2.10 release blog and release notes: hubs.la/Q03_NHfT0#PyTorch#OpenSourceAI#AIInfrastructure
Sharing a little of what I have been up to recently (I help organize collaborative roadmap process for the PyTorch team at Meta) -- I've done a lot of roadmaps in my career and I like the way PyTorch does it best.
Meta's PyTorch teams published their actual roadmap documents publicly for the first time: dev-discuss.pytorch.org/t/me…
Our engineers' incentives and objectives are laid bare for folks to read 🙂
While all PyTorch development happens publicly on github, the actual planning and roadmap documents that teams at various PyTorch-affiliated companies write out weren't public, so we decided to change that for increased transparency.
Introducing ExecuTorch Alpha ⚡
ExecuTorch Alpha is focused on deploying large language models and large ML models to the edge, stabilizing the API surface, and improving installation processes.
Learn more in our latest blog: hubs.la/Q02vzrrW0
Announcing the alpha release of torchtune!
torchtune is a PyTorch-native library for fine-tuning LLMs. It combines hackable memory-efficient fine-tuning recipes with integrations into your favorite tools.
Get started fine-tuning today!
Details: hubs.la/Q02t214F0
Training a model (not something I do all that much these days) and feeling like myself in grad school "my computer is working hard on physics calculations therefore I am working hard, right?"
This year marks the 30th anniversary of the first draft of the MPI (Message Passing Interface) standard, and sadly also the passing of one of those original founders, Rusty Lusk, of @argonne National Lab. Vale. #HPCobituaries.neptunesociety.co…
Big announcement: PyTorch Foundation!
PyTorch has large core investments from many companies. So, we're creating a neutral foundation for securing assets and interests.
Technical Governance is separate & secure in a Maintainer model.
Here's more context:
pytorch.org/blog/PyTorchfoun…
Introducing torch.profiler! New PyTorch Profiler collects both GPU and framework related info, correlates them, performs automatic detection of bottlenecks in the model, generates recommendations on how to resolve these bottlenecks, and visualize.
Read 👉pytorch.org/blog/introducing…
PyTorch 1.8 supports ROCm wheels, providing an easy onboarding for using AMD GPUs. You can simply go to the standard PyTorch installation selector and choose ROCm as an installation option and execute the provided command. Learn more below: pytorch.org/blog/pytorch-for…
This is one of the best pieces on this issue that I've seen so far. Highlights the direction of the research Dr. Gebru was attempting to get published that is at the heart of issue. My 2c is that she and coauthors were raising potentially legit issues with important models.
New blog post! "Object Detection at 1840 FPS with TorchScript, TensorRT and DeepStream". In which I go deeper into production machine-learning technologies for Pytorch.
paulbridger.com/posts/video-…
ALT Object Detection at 1840 FPS with TorchScript, TensorRT and DeepStream
A friend of mine, a person that I have been admiring for years, is dying because of Covid19.
She is currently in the ICU but has just undergone a stroke and doctors say her condition is irreversible.
People die because of Covid19; if you do not comply, you implicitly kill others
Two things i should always try to remember:
1. I have never regretted a single instance of kindness
2. I have regretted, sometimes for years, every cruel impulse I have indulged