Home | Research | Groups | Matthias Althoff

Research Group Matthias Althoff

Link to website at TUM

Matthias Althoff

Prof. Dr.

Principal Investigator

B3 | Multimodal Perception

Cyber Physical Systems

Matthias Althoff

is Professor of Cyber Physical Systems at TU Munich.

His research interests lie in systems whose computations are closely connected with their physical behavior. Referred to as cyber-physical systems, these systems require an integrated approach applying methods from computer science and engineering. Examples of such systems are autonomous vehicles, smart grids, intelligent production systems and medical robotics. Stefan Bauer’s research primarily focuses on formal methods for guaranteeing safety and correct operation as well as the model-based design of cyber-physical systems.

Team members @MCML

PhD Students

Link to website

Michael Eichelbeck

B3 | Multimodal Perception
→ Group Matthias Althoff

Cyber Physical Systems

Link to website

Jonathan Külz

B3 | Multimodal Perception
→ Group Matthias Althoff

Cyber Physical Systems

Link to website

Tobias Ladner

B3 | Multimodal Perception
→ Group Matthias Althoff

Cyber Physical Systems

Link to website

Laura Lützow

B3 | Multimodal Perception
→ Group Matthias Althoff

Cyber Physical Systems

Link to website

Marlon Müller

B3 | Multimodal Perception
→ Group Matthias Althoff

Cyber Physical Systems

Recent News @MCML

23.06.2025

Autonomous Driving: From Infinite Possibilities to Safe Decisions— With Matthias Althoff

05.12.2024

MCML Researchers With 31 Papers at NeurIPS 2024

24.07.2024

Matthias Althoff and Team Receive TeachInfAward

Publications @MCML

2025

[9]

J. Külz, M. Terzer, M. Magri, A. Giusti and M. Althoff.
Holistic Construction Automation with Modular Robots: From High-Level Task Specification to Execution.
IEEE Transactions on Automation Science and Engineering Early Access (Jun. 2025). DOI

Abstract

In situ robotic automation in construction is challenging due to constantly changing environments, a shortage of robotic experts, and a lack of standardized frameworks bridging robotics and construction practices. This work proposes a holistic framework for construction task specification, optimization of robot morphology, and mission execution using a mobile modular reconfigurable robot. Users can specify and monitor the desired robot behavior through a graphical interface. In contrast to existing, monolithic solutions, we automatically identify a new task-tailored robot for every task by integrating Building Information Modeling (BIM). Our framework leverages modular robot components that enable the fast adaption of robot hardware to the specific demands of the construction task. Other than previous works on modular robot optimization, we consider multiple competing objectives, which allow us to explicitly model the challenges of real-world transfer, such as calibration errors. We demonstrate our framework in simulation by optimizing robots for drilling and spray painting. Finally, experimental validation demonstrates that our approach robustly enables the autonomous execution of robotic drilling.

MCML Authors

Link to website

Jonathan Külz

B3 | Multimodal Perception
→ Group Matthias Althoff

Cyber Physical Systems

Matthias Althoff

Prof. Dr.

B3 | Multimodal Perception

Cyber Physical Systems

[8]

T. Walter, H. Markgraf, J. Külz and M. Althoff.
Provably Safe Reinforcement Learning from Analytic Gradients.
Preprint (Jun. 2025). arXiv

Abstract

Deploying autonomous robots in safety-critical applications requires safety guarantees. Provably safe reinforcement learning is an active field of research which aims to provide such guarantees using safeguards. These safeguards should be integrated during training to prevent a large sim-to-real gap. While there are several approaches for safeguarding sampling-based reinforcement learning, analytic gradient-based reinforcement learning often achieves superior performance and sample efficiency. However, there is no safeguarding approach for this learning paradigm yet. Our work addresses this gap by developing the first effective safeguard for analytic gradient-based reinforcement learning. We analyse existing, differentiable safeguards, adapt them through modified mappings and gradient formulations, and integrate them with a state-of-the-art learning algorithm and a differentiable simulation. We evaluate how different safeguards affect policy optimisation using numerical experiments on two classical control tasks. The results demonstrate safeguarded training without compromising performance.

MCML Authors

Link to website

Jonathan Külz

B3 | Multimodal Perception
→ Group Matthias Althoff

Cyber Physical Systems

Matthias Althoff

Prof. Dr.

B3 | Multimodal Perception

Cyber Physical Systems

2024

[7]

R. Stolz, H. Krasowski, J. Thumm, M. Eichelbeck, P. Gassert and M. Althoff.
Excluding the Irrelevant: Focusing Reinforcement Learning through Continuous Action Masking.
NeurIPS 2024 - 38th Conference on Neural Information Processing Systems. Vancouver, Canada, Dec 10-15, 2024. URL

Abstract

Continuous action spaces in reinforcement learning (RL) are commonly defined as multidimensional intervals. While intervals usually reflect the action boundaries for tasks well, they can be challenging for learning because the typically large global action space leads to frequent exploration of irrelevant actions. Yet, little task knowledge can be sufficient to identify significantly smaller state-specific sets of relevant actions. Focusing learning on these relevant actions can significantly improve training efficiency and effectiveness. In this paper, we propose to focus learning on the set of relevant actions and introduce three continuous action masking methods for exactly mapping the action space to the state-dependent set of relevant actions. Thus, our methods ensure that only relevant actions are executed, enhancing the predictability of the RL agent and enabling its use in safety-critical applications. We further derive the implications of the proposed methods on the policy gradient. Using proximal policy optimization (PPO), we evaluate our methods on four control tasks, where the relevant action set is computed based on the system dynamics and a relevant state set. Our experiments show that the three action masking methods achieve higher final rewards and converge faster than the baseline without action masking.

MCML Authors

Hanna Krasowski

Hanna Krasowski

Dr.

B3 | Multimodal Perception
→ Group Matthias Althoff

* Former Member

Link to website

Michael Eichelbeck

B3 | Multimodal Perception
→ Group Matthias Althoff

Cyber Physical Systems

Philipp Gassert

Philipp Gassert

B3 | Multimodal Perception
→ Group Matthias Althoff

* Former Member

Matthias Althoff

Prof. Dr.

B3 | Multimodal Perception

Cyber Physical Systems

[6]

P. Gassert and M. Althoff.
Stepping Out of the Shadows: Reinforcement Learning in Shadow Mode.
Preprint (Oct. 2024). arXiv

Abstract

Reinforcement learning (RL) is not yet competitive for many cyber-physical systems, such as robotics, process automation, and power systems, as training on a system with physical components cannot be accelerated, and simulation models do not exist or suffer from a large simulation-to-reality gap. During the long training time, expensive equipment cannot be used and might even be damaged due to inappropriate actions of the reinforcement learning agent. Our novel approach addresses exactly this problem: We train the reinforcement agent in a so-called shadow mode with the assistance of an existing conventional controller, which does not have to be trained and instantaneously performs reasonably well. In shadow mode, the agent relies on the controller to provide action samples and guidance towards favourable states to learn the task, while simultaneously estimating for which states the learned agent will receive a higher reward than the conventional controller. The RL agent will then control the system for these states and all other regions remain under the control of the existing controller. Over time, the RL agent will take over for an increasing amount of states, while leaving control to the baseline, where it cannot surpass its performance. Thus, we keep regret during training low and improve the performance compared to only using conventional controllers or reinforcement learning. We present and evaluate two mechanisms for deciding whether to use the RL agent or the conventional controller. The usefulness of our approach is demonstrated for a reach-avoid task, for which we are able to effectively train an agent, where standard approaches fail.

MCML Authors

Philipp Gassert

Philipp Gassert

B3 | Multimodal Perception
→ Group Matthias Althoff

* Former Member

Matthias Althoff

Prof. Dr.

B3 | Multimodal Perception

Cyber Physical Systems

[5]

D. Ostermeier, J. Külz and M. Althoff.
Automatic Geometric Decomposition for Analytical Inverse Kinematics.
Preprint (Sep. 2024). arXiv

Abstract

Calculating the inverse kinematics (IK) is fundamental for motion planning in robotics. Compared to numerical or learning-based approaches, analytical IK provides higher efficiency and accuracy. However, existing analytical approaches require manual intervention, are ill-conditioned, or rely on time-consuming symbolic manipulation. In this paper, we propose a fast and stable method that enables automatic online derivation and computation of analytical inverse kinematics. Our approach is based on remodeling the kinematic chain of a manipulator to automatically decompose its IK into pre-solved geometric subproblems. We exploit intersecting and parallel joint axes to assign a given manipulator to a certain kinematic class and the corresponding subproblem decomposition. In numerical experiments, we demonstrate that our decomposition is orders of magnitudes faster in deriving the IK than existing tools that employ symbolic manipulation. Following this one-time derivation, our method matches and even surpasses baselines, such as IKFast, in terms of speed and accuracy during the online computation of explicit IK solutions. Finally, we provide a C++ toolbox with Python wrappers that, for the first time, enables plug-and-play analytical IK within less than a millisecond.

MCML Authors

Link to website

Jonathan Külz

B3 | Multimodal Perception
→ Group Matthias Althoff

Cyber Physical Systems

Matthias Althoff

Prof. Dr.

B3 | Multimodal Perception

Cyber Physical Systems

[4]

H. Krasowski.
Guaranteeing Complex Safety Specifications for Autonomous Vehicles via Reinforcement Learning with Formal Methods.
Dissertation 2024. URL

Abstract

Reinforcement learning (RL) solves complicated motion planning tasks for autonomous vehicles. Current RL methods lack safety guarantees. This dissertation combines RL with formal methods that verify safety specifications so that only verified actions are executed. The safe RL approaches are developed for autonomous vehicles and their complex safety specifications. The evaluation confirms the safety guarantees and real-time capability.

MCML Authors

Hanna Krasowski

Hanna Krasowski

Dr.

B3 | Multimodal Perception
→ Group Matthias Althoff

* Former Member

[3]

H. Krasowski and M. Althoff.
Provable Traffic Rule Compliance in Safe Reinforcement Learning on the Open Sea.
IEEE Transactions on Intelligent Vehicles Early Access (May. 2024). DOI

Abstract

For safe operation, autonomous vehicles have to obey traffic rules that are set forth in legal documents formulated in natural language. Temporal logic is a suitable concept to formalize such traffic rules. Still, temporal logic rules often result in constraints that are hard to solve using optimization-based motion planners. Reinforcement learning (RL) is a promising method to find motion plans for autonomous vehicles. However, vanilla RL algorithms are based on random exploration and do not automatically comply with traffic rules. Our approach accomplishes guaranteed rule-compliance by integrating temporal logic specifications into RL. Specifically, we consider the application of vessels on the open sea, which must adhere to the Convention on the International Regulations for Preventing Collisions at Sea (COLREGS). To efficiently synthesize rule-compliant actions, we combine predicates based on set-based prediction with a statechart representing our formalized rules and their priorities. Action masking then restricts the RL agent to this set of verified rule-compliant actions. In numerical evaluations on critical maritime traffic situations, our agent always complies with the formalized legal rules and never collides while achieving a high goal-reaching rate during training and deployment. In contrast, vanilla and traffic rule-informed RL agents frequently violate traffic rules and collide even after training.

MCML Authors

Hanna Krasowski

Hanna Krasowski

Dr.

B3 | Multimodal Perception
→ Group Matthias Althoff

* Former Member

Matthias Althoff

Prof. Dr.

B3 | Multimodal Perception

Cyber Physical Systems

2023

[2]

J. Külz, M. Mayer and M. Althoff.
Timor Python: A Toolbox for Industrial Modular Robotics.
IROS 2023 - IEEE/RSJ International Conference on Intelligent Robots and Systems. Detroit, MI, USA, Oct 01-05, 2023. DOI

Abstract

Modular Reconfigurable Robots (MRRs) represent an exciting path forward for industrial robotics, opening up new possibilities for robot design. Compared to monolithic manipulators, they promise greater flexibility, improved maintainability, and cost-efficiency. However, there is no tool or standardized way to model and simulate assemblies of modules in the same way it has been done for robotic manipulators for decades. We introduce the Toolbox for Industrial Modular Robotics (Timor), a Python toolbox to bridge this gap and integrate modular robotics into existing simulation and optimization pipelines. Our open-source library offers model generation and task-based configuration optimization for MRRs. It can easily be integrated with existing simulation tools - not least by offering URDF export of arbitrary modular robot assemblies. Moreover, our experimental study demonstrates the effectiveness of Timor as a tool for designing modular robots optimized for specific use cases.

MCML Authors

Link to website

Jonathan Külz

B3 | Multimodal Perception
→ Group Matthias Althoff

Cyber Physical Systems

Matthias Althoff

Prof. Dr.

B3 | Multimodal Perception

Cyber Physical Systems

[1]

T. Ladner and M. Althoff.
Automatic Abstraction Refinement in Neural Network Verification Using Sensitivity Analysis.
HSCC 2023 - 26th ACM International Conference on Hybrid Systems: Computation and Control. San Antonio, TX, USA, May 09-12, 2023. DOI

Abstract

The formal verification of neural networks is essential for their application in safety-critical environments. However, the set-based verification of neural networks using linear approximations often obtains overly conservative results, while nonlinear approximations quickly become computationally infeasible in deep neural networks. We address this issue for the first time by automatically balancing between precision and computation time without splitting the propagated set. Our work introduces a novel automatic abstraction refinement approach using sensitivity analysis to iteratively reduce the abstraction error at the neuron level until either the specifications are met or a maximum number of iterations is reached. Our evaluation shows that we can tightly over-approximate the output sets of deep neural networks and that our approach is up to a thousand times faster than a naive approach. We further demonstrate the applicability of our approach in closed-loop settings.

MCML Authors

Link to website

Tobias Ladner

B3 | Multimodal Perception
→ Group Matthias Althoff

Cyber Physical Systems

Matthias Althoff

Prof. Dr.

B3 | Multimodal Perception

Cyber Physical Systems

©all images: LMU | TUM

2024-12-27 - Last modified: 2024-12-27