deep Manifold

deep Manifold

Users
Tweets

deep Manifold

@BetaTomorrow

Jun 14

Title: Preserving Plasticity in Continual Learning via Dynamical Isometry Authors: Andries Rosseau, Robert Müller (@deepqlearning), Ann Nowé From the Deep Manifold view, dynamical isometry helps preserve plasticity, but it is not the source of plasticity. The deeper source comes from high-order nonlinear data forcing stacked piecewise manifolds to form ring / torus-like stationary structures. These coupled stationary structures create elastic directions where weak perturbations can move the solution without destroying previously learned geometry. In this sense, layer-wise isometry is an important preservation mechanism, while Deep Manifold places plasticity inside a broader geometric picture: node-cover reorientation, accumulated curvature, interconnected toroidal geometry, and eventual manifold rigidity. #DeepManifoldInterpretation

Agentic Systems Lab | ETH Zurich

@ETH_agent_lab

Jun 8

Replying to @ETH_agent_lab

📄 Paper 6/6 Preserving Plasticity in Continual Learning via Dynamical Isometry Rosseau A, Müller R, Nowe A. Accepted at ICML 2026 Main Track ICML: icml.cc/virtual/2026/poster/… @deepqlearning

3,360

Agentic Systems Lab | ETH Zurich

Agentic Systems Lab | ETH Zurich

@ETH_agent_lab

Jun 8

📄 Paper 6/6 Preserving Plasticity in Continual Learning via Dynamical Isometry Rosseau A, Müller R, Nowe A. Accepted at ICML 2026 Main Track ICML: icml.cc/virtual/2026/poster/… @deepqlearning

3,548

Agentic Systems Lab | ETH Zurich

Agentic Systems Lab | ETH Zurich

@ETH_agent_lab

Jun 8

📄 Paper 5/6⁠ Reinforcement Learning for Tool-Calling Agents in Fast Healthcare Interoperability Resources (FHIR) Knorr M*, Müller R*, Bremer JP, Schweingruber N. Accepted at ICML 2026 Main Track arXiv: arxiv.org/pdf/2605.14126 ICML: icml.cc/virtual/2026/poster/… @deepqlearning

Agentic Systems Lab | ETH Zurich

Agentic Systems Lab | ETH Zurich

@ETH_agent_lab

Jun 8

📄 Paper 3/6 Multi-Agent Reinforcement Learning of Karma Bidding Strategies Riehl K, Psarou A, Müller R, Wu F, Langer P, Jakob R, Hollbeck G, Kouvelas A, Kucharski R, Makridis MA. Accepted at ICML 2026 NExT-Game Workshop OpenReview: openreview.net/forum?id=HaUU… @DerRiehl @deepqlearning @PatrickLanger20 @robertjakob @gaborhollbeck

Multi-Agent Reinforcement Learning of Karma Bidding Strategies

Capacity-constrained shared infrastructure systems require demand management mechanisms that balance efficiency and fairness. Karma mechanisms address this challenge using an artificial...

openreview.net

105

Agentic Systems Lab | ETH Zurich

Agentic Systems Lab | ETH Zurich

@ETH_agent_lab

Jun 8

Excited to share that members of our lab co-authored 6 papers accepted at #ICML2026, including three Main Track and three Workshop papers 🔥🚀 📄 Accepted papers: ▪️ OpenTSLM: Time-Series Language Models for Reasoning over Multivariate Medical Text- and Time-Series Data [Main Track] ▪️ Auditing Emotion-Vector-Steered Political Bias in Open-Weight LLMs [AI4GOOD Workshop] ▪️ Reinforcement Learning of Karma Bidding Strategies [NExT-Game Workshop] ▪️ Cinematic Source Separation with Dialogue-Driven Sidechain Ducking [Workshop on Machine Learning for Audio] ▪️ Reinforcement Learning for Tool-Calling Agents in Fast Healthcare Interoperability Resources (FHIR) [Main Track] ▪️ Preserving Plasticity in Continual Learning via Dynamical Isometry [Main Track] @robertjakob @PatrickLanger20 @gaborhollbeck @f14wn @DerRiehl @atoof_sh @deepqlearning @kev_osull @cs06thegreat @nzuma0 @maxrosenblattl Huge congratulations to everyone involved. We are looking forward to presenting these works, reconnecting with colleagues, and meeting new friends in Seoul 🇰🇷 🔗 Full links in the comments.

1,779

阿空

阿空 @RemptyGame

Mar 25

【 Ren'Py 深度學習 (Deep Q-Learning)】空師傅又來啦！當 Galgame 引擎遇上神經網路，在 Ren'Py 做深度學習！這AI走位是真的燒，難道人工AI要被輾壓了嗎？(っ °Д °;)っ youtu.be/A9WWtILHTp4 #遊戲開發 #renpy #遊戲引擎 #格鬥遊戲 #人工智慧 #強化學習 #深度學習 #deepqlearning

當 Galgame 引擎遇上神經網路，在 Ren'Py 裡養出了一個格鬥 AI。

見識咱們 Renpy 的無限潛力！【🚩粉絲作品集、遊戲募集中】https://forms.gle/F2u5GMCd4qUUVVex7...

youtube.com

187

Bioengineering MDPI

Bioengineering MDPI @Bioeng_MDPI

Feb 17

💥Highly recommended publication: "On Automated Object Grasping for Intelligent Prosthetic Hands Using Machine Learning" 🔗shorturl.at/WZY01 📌#DeepQLearning #AIinRobotics #ProstheticHands

Applied Sciences MDPI

Applied Sciences MDPI

@Applsci

24 Jul 2025

🔥 Read our Highly Cited Paper 📚 #DeepQLearning-Based Smart Scheduling of EVs for #DemandResponse in #SmartGrids 🔗 mdpi.com/2076-3417/14/4/1421 👨‍🔬 Viorica Rozina Chifu et al. 🏫 @utcluj #EVscheduling #reinforcementlearning

Glen Berseth

Glen Berseth @GlenBerseth

18 Mar 2025

#DeepQlearning for continuous actions is key for controlling #robots, but it has been tricky to train #largeModels to get those performance gains. In these lectures I cover the fundamentals and explain how new research is bending the rules of #thedeadlytriad to advance #scaling.

106

5,491

Surya Prakash

Surya Prakash @SuryaMadasi

30 Oct 2023

Deep Q Learning with PyTorch. #PyTorch #development #python #deepqlearning @ThePracticalDev @akshayballal95 dev.to/akshayballal/deep-q-l…

Deep Q Learning with PyTorch

Introduction This blog is going to be my second one on Reinforcement Learning. You can...

dev.to

105

enlightwise

enlightwise @enlightwise

8 Apr 2023

Although training a pendulum to swing up is a difficult task in itself, the real challenge lies in defining the reward system to guide the learning process. #screenshotsaturday #madewithunity #MachineLearning #DeepQLearning #PendulumProblems

0:07

Cobra 𐤊

Cobra 𐤊

@entropia_acc

12 Mar 2023

#MPO #Robotics #AI #ReinforcementLearning #RL #MachineLearning #LibTorch #PyTorch #DeepQLearning algorithm in C that sees the #actorcritic and their targets as distributions and updates with a #KLDivergence. Networks can be adjusted to make it #MOMPO github.com/MotorCityCobra/C_…

GitHub - MotorCityCobra/C_plusplus_mpo

Contribute to MotorCityCobra/C_plusplus_mpo development by creating an account on GitHub.

github.com

130

Reluctant Quant

Reluctant Quant

@DrMattCrowson

4 Jan 2023

RT Applied Reinforcement Learning III: Deep Q-Networks (DQN) dlvr.it/SgNpVj #machinelearning #deepqlearning #dqn #artificialintelligence

117

Gulshan Yadav 🥇

Dr. Ganapathi Pulipaka 🇺🇸

Dr. Ganapathi Pulipaka 🇺🇸

@gp_pulipaka

3 Nov 2022

#ReinforcementLearning: #DeepQLearning. #BigData #Analytics #DataScience #AI #MachineLearning #IoT #IIoT #PyTorch #Python #RStats #TensorFlow #Java #JavaScript #ReactJS #CloudComputing #Serverless #DataScientist #Linux #Programming #Coding #100DaysofCode geni.us/RLearning-DeepQ

Dr. Ganapathi Pulipaka 🇺🇸

Dr. Ganapathi Pulipaka 🇺🇸

@gp_pulipaka

10 Nov 2021

Exploring #DeepQLearning for #ReinforcementLearning. #BigData #Analytics #DataScience #AI #MachineLearning #IoT #IIoT #Python #RStats #TensorFlow #JavaScript #ReactJS #CloudComputing #Serverless #DataScientist #Linux #Programming #Coding #100DaysofCode go.nature.com/3cfZObH

Dr. Ganapathi Pulipaka 🇺🇸

Dr. Ganapathi Pulipaka 🇺🇸

@gp_pulipaka

4 Nov 2021

Dr. Ganapathi Pulipaka 🇺🇸

Dr. Ganapathi Pulipaka 🇺🇸

@gp_pulipaka

7 Oct 2021

#DeepQLearning to Solve (Banana Collector). #BigData #Analytics #DataScience #AI #MachineLearning #ReinforcementLearning #IoT #IIoT #Python #RStats #TensorFlow #JavaScript #ReactJS #Serverless #DataScientist #Linux #Programming #Coding #100DaysofCode bit.ly/3iB8Tzm

0:42

Institute for TMDT

Institute for TMDT @TMDTWuppertal

15 Sep 2021

Die Uhr läuft, in 60 Sekunden erklären wir @TMDTWuppertal , @Uni_Wuppertal, wieder einen #Forschungsbegriff. Diesmal: Was ist Deep Q-Learning? Das erklärt Euch @ja_r_pe auf unserem YouTube-Kanal ➡youtu.be/gN02oEdOzpI #DeepQLearning #ReinforcementLearning #60secondspitches

AISulyman

AISulyman @AiSulyman

13 Sep 2021

I accidentally commented out this line of code and it caused issues with my #AI training Took me months to realize and now I get PTSD whenever I see it... --- #100DaysOfCode #MachineLearning #NeuralNetwork #QLearning #DeepQLearning #ArtificialIntelligence

0:03