Fourth-year Ph.D. student

Baptiste Debes

Speech and Image Processing Department (PSI - ESAT), KU Leuven

My research focuses on value-gradient methods in reinforcement learning: using learned environment models to provide richer training signals than scalar rewards or value targets alone. More broadly, I am interested in decoupled training of neural networks, with the goal of reducing the memory footprint of learning and making training less dependent on end-to-end backpropagation.

Baptiste Debes

Research Interests

Distributional reinforcement learning

Learning distributions over returns, rather than only expected values, to better represent uncertainty in reinforcement learning problems.

Pointers

  • A Distributional Perspective on Reinforcement Learning, Bellemare, Dabney & Munos
  • Distributional Value Gradients for Stochastic Environments, Debes & Tuytelaars

Value-gradient methods

Using learned environment models to estimate gradients of return targets with respect to actions, giving policy learning access to richer local information.

Pointers

  • Learning Continuous Control Policies by Stochastic Value Gradients, Heess et al.
  • How to Learn a Useful Critic? Model-Based Action-Gradient-Estimator Policy Optimization, D'Oro & Jaśkowski
  • Distributional Value Gradients for Stochastic Environments, Debes & Tuytelaars

Decoupled training of neural networks

Training neural-network modules with local or predicted learning signals, aiming to reduce update locking and memory requirements during optimization.

Pointers

  • Decoupled Neural Interfaces using Synthetic Gradients, Jaderberg et al.
  • Sobolev Training for Neural Networks, Czarnecki et al.

Selected Publications

Distributional Value Gradients for Stochastic Environments

Baptiste Debes and Tinne Tuytelaars

The Fourteenth International Conference on Learning Representations (ICLR), Rio de Janeiro, Brazil, 2026

Multivariate Distributional Reinforcement Learning Using Sliced Divergences

Baptiste Debes and Tinne Tuytelaars

Proceedings of the 43rd International Conference on Machine Learning (ICML), Seoul, South Korea, 2026

Recent Posts