Home | Research | Groups | Felix Dietrich

Research Group Felix Dietrich

Felix Dietrich

Prof. Dr.

Associate

A2 | Mathematical Foundations

Physics-enhanced Machine Learning

Felix Dietrich

holds a professorship for Physics-Enhanced Machine Learning at TU Munich.

His research focus on the analysis and development of numerical algorithms for machine learning. This covers algorithms to enable, accelerate, and optimize simulation and analysis of complex dynamical systems, as well as nonlinear manifold learning techniques, including data-driven approximations of Koopman and Laplace operators. Recently, his group has also worked on energy-efficient training of neural networks inspired by random feature modeling.

Team members @MCML

PhD Students

Erik Lien Bolager

A2 | Mathematical Foundations
→ Group Felix Dietrich

Physics-enhanced Machine Learning

Iryna Burak

A2 | Mathematical Foundations
→ Group Felix Dietrich

Physics-enhanced Machine Learning

Ana Cukarska

A2 | Mathematical Foundations
→ Group Felix Dietrich

Physics-enhanced Machine Learning

Chinmay Datar

A2 | Mathematical Foundations
→ Group Felix Dietrich

Physics-enhanced Machine Learning

Atamert Rahma

A2 | Mathematical Foundations
→ Group Felix Dietrich

Physics-enhanced Machine Learning

Qing Sun

A2 | Mathematical Foundations
→ Group Felix Dietrich

Physics-enhanced Machine Learning

Publications @MCML

2025

[2]

A. Datar, A. Datar, F. Dietrich and W. Schilders.
Systematic Construction of Continuous-Time Neural Networks for Linear Dynamical Systems.
SIAM Journal on Scientific Computing 47.4 (Jul. 2025). DOI

Abstract

Discovering a suitable neural network architecture for modeling complex dynamical systems poses a formidable challenge, often involving extensive trial and error and navigation through a high-dimensional hyperparameter space. In this paper, we discuss a systematic approach to constructing neural architectures for modeling a subclass of dynamical systems, namely, linear time-invariant (LTI) systems. We use a variant of continuous-time neural networks in which the output of each neuron evolves continuously as a solution of a first-order or second-order ordinary differential equation. Instead of deriving the network architecture and parameters from data, we propose a gradient-free algorithm to compute sparse architecture and network parameters directly from the given LTI system, leveraging its properties. We bring forth a novel neural architecture paradigm featuring horizontal hidden layers and provide insights into why employing conventional neural architectures with vertical hidden layers may not be favorable. We also provide an upper bound on the numerical errors of our neural networks. Finally, we demonstrate the high accuracy of our constructed networks on three numerical examples.

MCML Authors

Felix Dietrich

Prof. Dr.

A2 | Mathematical Foundations

Physics-enhanced Machine Learning

[1]

A. Rahma, C. Datar, A. Cukarska and F. Dietrich.
Rapid training of Hamiltonian graph networks without gradient descent.
Preprint (Jun. 2025). arXiv

Abstract

Learning dynamical systems that respect physical symmetries and constraints remains a fundamental challenge in data-driven modeling. Integrating physical laws with graph neural networks facilitates principled modeling of complex N-body dynamics and yields accurate and permutation-invariant models. However, training graph neural networks with iterative, gradient-based optimization algorithms (e.g., Adam, RMSProp, LBFGS) often leads to slow training, especially for large, complex systems. In comparison to 15 different optimizers, we demonstrate that Hamiltonian Graph Networks (HGN) can be trained up to 600x faster–but with comparable accuracy–by replacing iterative optimization with random feature-based parameter construction. We show robust performance in diverse simulations, including N-body mass-spring systems in up to 3 dimensions with different geometries, while retaining essential physical invariances with respect to permutation, rotation, and translation. We reveal that even when trained on minimal 8-node systems, the model can generalize in a zero-shot manner to systems as large as 4096 nodes without retraining. Our work challenges the dominance of iterative gradient-descent-based optimization algorithms for training neural network models for physical systems.

MCML Authors

Atamert Rahma

A2 | Mathematical Foundations
→ Group Felix Dietrich

Physics-enhanced Machine Learning