Home | Research | Groups | Gitta Kutyniok

Research Group Gitta Kutyniok

Gitta Kutyniok

Prof. Dr.

Principal Investigator

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

Gitta Kutyniok

holds the Bavarian AI Chair for Mathematical Foundations of Artificial Intelligence at LMU Munich.

The chair’s research focus on the intersection of mathematics and artificial intelligence, aiming for both a mathematical understanding of artificial intelligence and artificial intelligence for mathematical problems.

Team members @MCML

PostDocs

Juan Esteban Suarez

Dr.

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

PhD Students

Christopher Bülte

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Vit Fojtik

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Adalbert Fono

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Sohir Maskey

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Maria Matveev

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Raffaele Paolino

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Philipp Scholl

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Manjot Singh

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Jonas von Berg

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Publications @MCML

2025

[47]

J. von Berg, A. Fono, M. Datres, S. Maskey and G. Kutyniok.
The Price of Robustness: Stable Classifiers Need Overparameterization.
HiLD @ICML 2025 - Workshop on High-dimensional Learning Dynamics at the 42nd International Conference on Machine Learning (ICML 2025). Vancouver, Canada, Jul 13-19, 2025. To be published. Preprint available. URL

Abstract

In this work, we show that class stability, the expected distance of an input to the decision boundary, captures what classical capacity measures, such as weight norms, fail to explain. We prove a generalization bound that improves inversely with the class stability, interpreted as a quantifiable notion of robustness. As a corollary, we derive a law of robustness for classification: any interpolating model with parameters must be unstable, so high stability requires significant overparameterization. Crucially, our results extend beyond smoothness assumptions and apply to discontinuous classifiers. Preliminary experiments support our theory: empirical stability increases with model size, while norm-based measures remain uninformative.

MCML Authors

Jonas von Berg

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Adalbert Fono

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Sohir Maskey

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

[46]

D. A. Nguyen, E. Araya, A. Fono and G. Kutyniok.
Time to Spike? Understanding the Representational Power of Spiking Neural Networks in Discrete Time.
ICML 2025 - 42nd International Conference on Machine Learning. Vancouver, Canada, Jul 13-19, 2025. To be published. Preprint available. arXiv URL

Abstract

Recent years have seen significant progress in developing spiking neural networks (SNNs) as a potential solution to the energy challenges posed by conventional artificial neural networks (ANNs). However, our theoretical understanding of SNNs remains relatively limited compared to the ever-growing body of literature on ANNs. In this paper, we study a discrete-time model of SNNs based on leaky integrate-and-fire (LIF) neurons, referred to as discrete-time LIF-SNNs, a widely used framework that still lacks solid theoretical foundations. We demonstrate that discrete-time LIF-SNNs with static inputs and outputs realize piecewise constant functions defined on polyhedral regions, and more importantly, we quantify the network size required to approximate continuous functions. Moreover, we investigate the impact of latency (number of time steps) and depth (number of layers) on the complexity of the input space partitioning induced by discrete-time LIF-SNNs. Our analysis highlights the importance of latency and contrasts these networks with ANNs employing piecewise linear activation functions. Finally, we present numerical experiments to support our theoretical findings.

MCML Authors

Adalbert Fono

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

[45]

J. Li and G. Kutyniok.
Expressivity of deep neural networks.
Preprint (Jul. 2025). PDF

Abstract

This chapter focuses on the approximation theory of deep ReLU neural networks, analyzing their ability to approximate various target functions with different network architectures. We begin by introducing the universal approximation theory of deep neural networks, stating that given enough neurons, neural networks can approximate general functions. We then delve into the fundamental properties of ReLU neural networks and explore the role of width and depth of neural networks, highlighting that increasing layers could be more effective than increasing width in improving approximation accuracy. Next, we discuss the approximation rates for Sobolev functions using fully connected and convolutional neural networks. To alleviate the curse of dimensionality, we further consider Korobov functions. Finally, we focus on the approximation properties of self-attention and transformers, which have become increasingly important in modern deep learning. These results shed light on the expressivity and reliability of deep learning models, providing valuable insights into networks’ behavior and performance.

MCML Authors

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

[44]

H. Boche, A. Fono and G. Kutyniok.
Mathematical Algorithm Design for Deep Learning under Societal and Judicial Constraints: The Algorithmic Transparency Requirement.
Applied and Computational Harmonic Analysis 77.101763 (Jun. 2025). DOI

Abstract

Deep learning still has drawbacks in terms of trustworthiness, which describes a comprehensible, fair, safe, and reliable method. To mitigate the potential risk of AI, clear obligations associated to trustworthiness have been proposed via regulatory guidelines, e.g., in the European AI Act. Therefore, a central question is to what extent trustworthy deep learning can be realized. Establishing the described properties constituting trustworthiness requires that the factors influencing an algorithmic computation can be retraced, i.e., the algorithmic implementation is transparent. Motivated by the observation that the current evolution of deep learning models necessitates a change in computing technology, we derive a mathematical framework which enables us to analyze whether a transparent implementation in a computing model is feasible. We exemplarily apply our trustworthiness framework to analyze deep learning approaches for inverse problems in digital and analog computing models represented by Turing and Blum-Shub-Smale Machines, respectively. Based on previous results, we find that Blum-Shub-Smale Machines have the potential to establish trustworthy solvers for inverse problems under fairly general conditions, whereas Turing machines cannot guarantee trustworthiness to the same degree.

MCML Authors

Adalbert Fono

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

[43]

S. Maskey, G. Kutyniok and R. Levie.
Generalization Bounds for Message Passing Networks on Mixture of Graphons.
SIAM Journal on Mathematics of Data Science 7.2 (Jun. 2025). DOI

Abstract

We study the generalization capabilities of Message Passing Neural Networks (MPNNs), a prevalent class of Graph Neural Networks (GNN). We derive generalization bounds specifically for MPNNs with normalized sum aggregation and mean aggregation. Our analysis is based on a data generation model incorporating a finite set of template graphons. Each graph within this framework is generated by sampling from one of the graphons with a certain degree of perturbation. In particular, we extend previous MPNN generalization results to a more realistic setting, which includes the following modifications: 1) we analyze simple random graphs with Bernoulli-distributed edges instead of weighted graphs; 2) we sample both graphs and graph signals from perturbed graphons instead of clean graphons; and 3) we analyze sparse graphs instead of dense graphs. In this more realistic and challenging scenario, we provide a generalization bound that decreases as the average number of nodes in the graphs increases. Our results imply that MPNNs with higher complexity than the size of the training set can still generalize effectively, as long as the graphs are sufficiently large.

MCML Authors

Sohir Maskey

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

[42]

H. Boche, V. Fojtik, A. Fono and G. Kutyniok.
Computability of Classification and Deep Learning: From Theoretical Limits to Practical Feasibility through Quantization.
Journal of Fourier Analysis and Applications 31.35 (May. 2025). DOI

Abstract

The unwavering success of deep learning in the past decade led to the increasing prevalence of deep learning methods in various application fields. However, the downsides of deep learning, most prominently its lack of trustworthiness, may not be compatible with safety-critical or high-responsibility applications requiring stricter performance guarantees. Recently, several instances of deep learning applications have been shown to be subject to theoretical limitations of computability, undermining the feasibility of performance guarantees when employed on real-world computers. We extend the findings by studying computability in the deep learning framework from two perspectives: From an application viewpoint in the context of classification problems and a general limitation viewpoint in the context of training neural networks. In particular, we show restrictions on the algorithmic solvability of classification problems that also render the algorithmic detection of failure in computations in a general setting infeasible. Subsequently, we prove algorithmic limitations in training deep neural networks even in cases where the underlying problem is well-behaved. Finally, we end with a positive observation, showing that in quantized versions of classification and deep network training, computability restrictions do not arise or can be overcome to a certain degree.

MCML Authors

Vit Fojtik

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Adalbert Fono

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

[41]

V. Fojtik, M. Matveev, H.-H. Chou, G. Kutyniok and J. Maly.
Conflicting Biases at the Edge of Stability: Norm versus Sharpness Regularization.
Preprint (May. 2025). arXiv

Abstract

A widely believed explanation for the remarkable generalization capacities of overparameterized neural networks is that the optimization algorithms used for training induce an implicit bias towards benign solutions. To grasp this theoretically, recent works examine gradient descent and its variants in simplified training settings, often assuming vanishing learning rates. These studies reveal various forms of implicit regularization, such as ℓ1-norm minimizing parameters in regression and max-margin solutions in classification. Concurrently, empirical findings show that moderate to large learning rates exceeding standard stability thresholds lead to faster, albeit oscillatory, convergence in the so-called Edge-of-Stability regime, and induce an implicit bias towards minima of low sharpness (norm of training loss Hessian). In this work, we argue that a comprehensive understanding of the generalization performance of gradient descent requires analyzing the interaction between these various forms of implicit regularization. We empirically demonstrate that the learning rate balances between low parameter norm and low sharpness of the trained model. We furthermore prove for diagonal linear networks trained on a simple regression task that neither implicit bias alone minimizes the generalization error. These findings demonstrate that focusing on a single implicit bias is insufficient to explain good generalization, and they motivate a broader view of implicit regularization that captures the dynamic trade-off between norm and sharpness induced by non-negligible learning rates.

MCML Authors

Vit Fojtik

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Maria Matveev

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

Johannes Maly

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Data Science and Artificial Intelligence

[40]

S. Maskey, R. Paolino, F. Jogl, G. Kutyniok and J. Lutzeyer.
Graph Representational Learning: When Does More Expressivity Hurt Generalization?
Preprint (May. 2025). arXiv

Abstract

Graph Neural Networks (GNNs) are powerful tools for learning on structured data, yet the relationship between their expressivity and predictive performance remains unclear. We introduce a family of premetrics that capture different degrees of structural similarity between graphs and relate these similarities to generalization, and consequently, the performance of expressive GNNs. By considering a setting where graph labels are correlated with structural features, we derive generalization bounds that depend on the distance between training and test graphs, model complexity, and training set size. These bounds reveal that more expressive GNNs may generalize worse unless their increased complexity is balanced by a sufficiently large training set or reduced distance between training and test graphs. Our findings relate expressivity and generalization, offering theoretical insights supported by empirical results.

MCML Authors

Sohir Maskey

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Raffaele Paolino

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

[39]

P. Scholl, A. Dietrich, S. Wolf, J. Lee, A.-A. Schäffer, G. Kutyniok and M. Iskandar.
Interpretable Robotic Friction Learning via Symbolic Regression.
Preprint (May. 2025). arXiv

Abstract

Accurately modeling the friction torque in robotic joints has long been challenging due to the request for a robust mathematical description. Traditional model-based approaches are often labor-intensive, requiring extensive experiments and expert knowledge, and they are difficult to adapt to new scenarios and dependencies. On the other hand, data-driven methods based on neural networks are easier to implement but often lack robustness, interpretability, and trustworthiness–key considerations for robotic hardware and safety-critical applications such as human-robot interaction. To address the limitations of both approaches, we propose the use of symbolic regression (SR) to estimate the friction torque. SR generates interpretable symbolic formulas similar to those produced by model-based methods while being flexible to accommodate various dynamic effects and dependencies. In this work, we apply SR algorithms to approximate the friction torque using collected data from a KUKA LWR-IV+ robot. Our results show that SR not only yields formulas with comparable complexity to model-based approaches but also achieves higher accuracy. Moreover, SR-derived formulas can be seamlessly extended to include load dependencies and other dynamic factors.

MCML Authors

Philipp Scholl

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

[38]

C. Bülte, S. Maskey, P. Scholl, J. von Berg and G. Kutyniok.
Graph Neural Networks for Enhancing Ensemble Forecasts of Extreme Rainfall.
Climate Change AI @ICLR 2025 - Workshop on Tackling Climate Change with Machine Learning at the 13th International Conference on Learning Representations (ICLR 2025). Singapore, Apr 24-28, 2025. To be published. Preprint available. arXiv

Abstract

Climate change is increasing the occurrence of extreme precipitation events, threatening infrastructure, agriculture, and public safety. Ensemble prediction systems provide probabilistic forecasts but exhibit biases and difficulties in capturing extreme weather. While post-processing techniques aim to enhance forecast accuracy, they rarely focus on precipitation, which exhibits complex spatial dependencies and tail behavior. Our novel framework leverages graph neural networks to post-process ensemble forecasts, specifically modeling the extremes of the underlying distribution. This allows to capture spatial dependencies and improves forecast accuracy for extreme events, thus leading to more reliable forecasts and mitigating risks of extreme precipitation and flooding.

MCML Authors

Christopher Bülte

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Sohir Maskey

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Philipp Scholl

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Jonas von Berg

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

[37]

P. Scholl, K. Bieker, H. Hauger and G. Kutyniok.
ParFam -- (Neural Guided) Symbolic Regression Based on Continuous Global Optimization.
ICLR 2025 - 13th International Conference on Learning Representations. Singapore, Apr 24-28, 2025. To be published. Preprint available. arXiv GitHub

Abstract

The problem of symbolic regression (SR) arises in many different applications, such as identifying physical laws or deriving mathematical equations describing the behavior of financial markets from given data. Various methods exist to address the problem of SR, often based on genetic programming. However, these methods are usually complicated and involve various hyperparameters. In this paper, we present our new approach ParFam that utilizes parametric families of suitable symbolic functions to translate the discrete symbolic regression problem into a continuous one, resulting in a more straightforward setup compared to current state-of-the-art methods. In combination with a global optimizer, this approach results in a highly effective method to tackle the problem of SR. We theoretically analyze the expressivity of ParFam and demonstrate its performance with extensive numerical experiments based on the common SR benchmark suit SRBench, showing that we achieve state-of-the-art results. Moreover, we present an extension incorporating a pre-trained transformer network DL-ParFam to guide ParFam, accelerating the optimization process by up to two magnitudes.

MCML Authors

Philipp Scholl

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

[36]

H. Hauger, P. Scholl and G. Kutyniok.
Robust identifiability for symbolic recovery of differential equations.
ICASSP 2025 - IEEE International Conference on Acoustics, Speech and Signal Processing. Hyderabad, India, Apr 06-11, 2025. DOI

Abstract

Recent advancements in machine learning have transformed the discovery of physical laws, moving from manual derivation to data-driven methods that simultaneously learn both the structure and parameters of governing equations. This shift introduces new challenges regarding the validity of the discovered equations, particularly concerning their uniqueness and, hence, identifiability. While the issue of non-uniqueness has been well-studied in the context of parameter estimation, it remains underexplored for algorithms that recover both structure and parameters simultaneously. Early studies have primarily focused on idealized scenarios with perfect, noise-free data. In contrast, this paper investigates how noise influences the uniqueness and identifiability of physical laws governed by partial differential equations (PDEs). We develop a comprehensive mathematical framework to analyze the uniqueness of PDEs in the presence of noise and introduce new algorithms that account for noise, providing thresholds to assess uniqueness and identifying situations where excessive noise hinders reliable conclusions. Numerical experiments demonstrate the effectiveness of these algorithms in detecting uniqueness despite the presence of noise.

MCML Authors

Philipp Scholl

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

[35]

G. Kutyniok.
How Can Reliability of Artificial Intelligence Be Ensured?
Harvard Data Science Review 7.2 (Apr. 2025). DOI

Abstract

Column Editor’s Note: Artificial intelligence (AI) is having a profound impact across many areas of science and society. However, there remain important gaps in our understanding of the deep neural networks that underpin these developments, and in many cases AI models lack robustness and reliability. In this Diving into Data column, Professor Kutyniok explores these issues from a mathematical perspective, highlighting open theoretical questions that will need to be resolved in order to develop AIs that are truly reliable, generalizable, and trustworthy.

MCML Authors

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

[34]

C. Bülte, Y. Sale, T. Löhr, P. Hofman, G. Kutyniok and E. Hüllermeier.
An Axiomatic Assessment of Entropy- and Variance-based Uncertainty Quantification in Regression.
Preprint (Apr. 2025). arXiv

Abstract

Uncertainty quantification (UQ) is crucial in machine learning, yet most (axiomatic) studies of uncertainty measures focus on classification, leaving a gap in regression settings with limited formal justification and evaluations. In this work, we introduce a set of axioms to rigorously assess measures of aleatoric, epistemic, and total uncertainty in supervised regression. By utilizing a predictive exponential family, we can generalize commonly used approaches for uncertainty representation and corresponding uncertainty measures. More specifically, we analyze the widely used entropy- and variance-based measures regarding limitations and challenges. Our findings provide a principled foundation for UQ in regression, offering theoretical insights and practical guidelines for reliable uncertainty assessment.

MCML Authors

Christopher Bülte

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Yusuf Sale

A3 | Computational Models
→ Group Eyke Hüllermeier

Artificial Intelligence and Machine Learning

Paul Hofman

A3 | Computational Models
→ Group Eyke Hüllermeier

Artificial Intelligence and Machine Learning

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

Eyke Hüllermeier

Prof. Dr.

A3 | Computational Models

Artificial Intelligence and Machine Learning

[33]

C. Bülte, P. Scholl and G. Kutyniok.
Probabilistic neural operators for functional uncertainty quantification.
Transactions on Machine Learning Research (Mar. 2025). URL

Abstract

Neural operators aim to approximate the solution operator of a system of differential equations purely from data. They have shown immense success in modeling complex dynamical systems across various domains. However, the occurrence of uncertainties inherent in both model and data has so far rarely been taken into accounttextemdash{}a critical limitation in complex, chaotic systems such as weather forecasting. In this paper, we introduce the probabilistic neural operator (PNO), a framework for learning probability distributions over the output function space of neural operators. PNO extends neural operators with generative modeling based on strictly proper scoring rules, integrating uncertainty information directly into the training process. We provide a theoretical justification for the approach and demonstrate improved performance in quantifying uncertainty across different domains and with respect to different baselines. Furthermore, PNO requires minimal adjustment to existing architectures, shows improved performance for most probabilistic prediction tasks, and leads to well-calibrated predictive distributions and adequate uncertainty representations even for long dynamical trajectories. Implementing our approach into large-scale models for physical applications can lead to improvements in corresponding uncertainty quantification and extreme event identification, ultimately leading to a deeper understanding of the prediction of such surrogate models.

MCML Authors

Christopher Bülte

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Philipp Scholl

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

[32]

A. Fono, M. Singh, E. Araya, P. C. Petersen, H. Boche and G. Kutyniok.
Sustainable AI: Mathematical Foundations of Spiking Neural Networks.
Preprint (Mar. 2025). arXiv

Abstract

Deep learning’s success comes with growing energy demands, raising concerns about the long-term sustainability of the field. Spiking neural networks, inspired by biological neurons, offer a promising alternative with potential computational and energy-efficiency gains. This article examines the computational properties of spiking networks through the lens of learning theory, focusing on expressivity, training, and generalization, as well as energy-efficient implementations while comparing them to artificial neural networks. By categorizing spiking models based on time representation and information encoding, we highlight their strengths, challenges, and potential as an alternative computational paradigm.

MCML Authors

Adalbert Fono

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Manjot Singh

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

2024

[31]

K. Bieker, H. T. Kussaba, P. Scholl, J. Jung, A. Swikir, S. Haddadin and G. Kutyniok.
Compositional Construction of Barrier Functions for Switched Impulsive Systems.
CDC 2024 - 63rd IEEE Conference on Decision and Control. Milan, Italy, Dec 16-19, 2024. DOI

Abstract

Many systems occurring in real-world applications, such as controlling the motions of robots or modeling the spread of diseases, are switched impulsive systems. To ensure that the system state stays in a safe region (e.g., to avoid collisions with obstacles), barrier functions are widely utilized. As the system dimension increases, deriving suitable barrier functions becomes extremely complex. Fortunately, many systems consist of multiple subsystems, such as different areas where the disease occurs. In this work, we present sufficient conditions for interconnected switched impulsive systems to maintain safety by constructing local barrier functions for the individual subsystems instead of a global one, allowing for much easier and more efficient derivation. To validate our results, we numerically demonstrate its effectiveness using an epidemiological model.

MCML Authors

Philipp Scholl

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

[30]

C. Bülte, P. Scholl and G. Kutyniok.
Probabilistic predictions with Fourier neural operators.
BDU @NeurIPS 2024 - Workshop Bayesian Decision-making and Uncertainty: from probabilistic and spatiotemporal modeling to sequential experiment design at the 38th Conference on Neural Information Processing Systems (NeurIPS 2024). Vancouver, Canada, Dec 10-15, 2024. URL

Abstract

Neural networks have been successfully applied in modeling partial differential equations, especially in dynamical systems. Commonly used models, such as neural operators, are performing well at deterministic prediction tasks, but lack a quantification of the uncertainty inherent in many complex systems, for example weather forecasting. In this paper, we explore a new approach that combines Fourier neural operators with generative modeling based on strictly proper scoring rules in order to create well-calibrated probabilistic predictions of dynamical systems. We demonstrate improved predictive uncertainty for our approach, especially in settings with very high inherent uncertainty.

MCML Authors

Christopher Bülte

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Philipp Scholl

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

[29]

R. Paolino, S. Maskey, P. Welke and G. Kutyniok.
Weisfeiler and Leman Go Loopy: A New Hierarchy for Graph Representational Learning.
NeurIPS 2024 - 38th Conference on Neural Information Processing Systems. Vancouver, Canada, Dec 10-15, 2024. URL GitHub

Abstract

We introduce r-loopy Weisfeiler-Leman (r-ℓWL), a novel hierarchy of graph isomorphism tests and a corresponding GNN framework, r-ℓMPNN, that can count cycles up to length r+2. Most notably, we show that r-ℓWL can count homomorphisms of cactus graphs. This strictly extends classical 1-WL, which can only count homomorphisms of trees and, in fact, is incomparable to k-WL for any fixed k. We empirically validate the expressive and counting power of the proposed r-ℓMPNN on several synthetic datasets and present state-of-the-art predictive performance on various real-world datasets.

MCML Authors

Raffaele Paolino

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Sohir Maskey

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

[28]

Y. N. Böck, H. Boche, F. H. P. Fitzek and G. Kutyniok.
Computing-Model and Computing-Hardware Selection for ICT Under Societal and Judicial Constraints.
IEEE Access 12 (Dec. 2024). DOI

Abstract

This article discusses a formalization of aspects of Cyber-Sovereignty (CyS) for information and communication technology (ICT), linking them to technological trustworthiness and deriving an associated paradigm for hard- and software design. The upcoming 6G ICT standard is considered a keystone within modern society’s increasing interconnectedness and automatization, as it provides the necessary technological infrastructure for applications such as the Metaverse or large-scale digital twinning. Since emerging technological systems increasingly affect sensitive human goods, hard- and software manufacturers must consider a new dimension of societal and judicial constraints in the context of technological trustworthiness. This article aims to establish a formalized theory of specific aspects of CyS, providing a paradigm for hard- and software engineering in ICT. This paradigm is directly applicable in formal technology assessment and ensures that the relevant facets of CyS – specifically, the principle of Algorithmic Transparency (AgT) – are satisfied. The framework follows an axiomatic approach. Particularly, the formal basis of our theory consists of four fundamental assumptions about the general nature of physical problems and algorithmic implementations. This formal basis allows for drawing general conclusions on the relation between CyS and technological trustworthiness and entails a formal meta-thesis on AgT in digital computing.

MCML Authors

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

[27]

P. Scholl, M. Iskandar, S. Wolf, J. Lee, A. Bacho, A. Dietrich, A. Albu-Schäffer and G. Kutyniok.
Learning-based adaption of robotic friction models.
Robotics and Computer-Integrated Manufacturing 89 (Oct. 2024). DOI

Abstract

In the Fourth Industrial Revolution, wherein artificial intelligence and the automation of machines occupy a central role, the deployment of robots is indispensable. However, the manufacturing process using robots, especially in collaboration with humans, is highly intricate. In particular, modeling the friction torque in robotic joints is a longstanding problem due to the lack of a good mathematical description. This motivates the usage of data-driven methods in recent works. However, model-based and data-driven models often exhibit limitations in their ability to generalize beyond the specific dynamics they were trained on, as we demonstrate in this paper. To address this challenge, we introduce a novel approach based on residual learning, which aims to adapt an existing friction model to new dynamics using as little data as possible. We validate our approach by training a base neural network on a symmetric friction data set to learn an accurate relation between the velocity and the friction torque. Subsequently, to adapt to more complex asymmetric settings, we train a second network on a small dataset, focusing on predicting the residual of the initial network’s output. By combining the output of both networks in a suitable manner, our proposed estimator outperforms the conventional model-based approach, an extended LuGre model, and the base neural network significantly. Furthermore, we evaluate our method on trajectories involving external loads and still observe a substantial improvement, approximately 60%–70%, over the conventional approach. Our method does not rely on data with external load during training, eliminating the need for external torque sensors. This demonstrates the generalization capability of our approach, even with a small amount of data – less than a minute – enabling adaptation to diverse scenarios based on prior knowledge about friction in different settings.

MCML Authors

Philipp Scholl

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

[26]

P. Scholl, A. Bacho, H. Boche and G. Kutyniok.
Symbolic Recovery of Differential Equations: The Identifiability Problem.
Preprint (Oct. 2024). arXiv

Abstract

Symbolic recovery of differential equations is the ambitious attempt at automating the derivation of governing equations with the use of machine learning techniques. In contrast to classical methods which assume the structure of the equation to be known and focus on the estimation of specific parameters, these algorithms aim to learn the structure and the parameters simultaneously. While the uniqueness and, therefore, the identifiability of parameters of governing equations are a well-addressed problem in the field of parameter estimation, it has not been investigated for symbolic recovery. However, this problem should be even more present in this field since the algorithms aim to cover larger spaces of governing equations. In this paper, we investigate under which conditions a solution of a differential equation does not uniquely determine the equation itself. For various classes of differential equations, we provide both necessary and sufficient conditions for a function to uniquely determine the corresponding differential equation. We then use our results to devise numerical algorithms aiming to determine whether a function solves a differential equation uniquely. Finally, we provide extensive numerical experiments showing that our algorithms can indeed guarantee the uniqueness of the learned governing differential equation, without assuming any knowledge about the analytic form of function, thereby ensuring the reliability of the learned equation.

MCML Authors

Philipp Scholl

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

[25]

Ç. Yapar, R. Levie, G. Kutyniok and G. Caire.
Dataset of Pathloss and ToA Radio Maps With Localization Application.
Preprint (Sep. 2024). arXiv

Abstract

In this article, we present a collection of radio map datasets in dense urban setting, which we generated and made publicly available. The datasets include simulated pathloss/received signal strength (RSS) and time of arrival (ToA) radio maps over a large collection of realistic dense urban setting in real city maps. The two main applications of the presented dataset are 1) learning methods that predict the pathloss from input city maps (namely, deep learning-based simulations), and, 2) wireless localization. The fact that the RSS and ToA maps are computed by the same simulations over the same city maps allows for a fair comparison of the RSS and ToA-based localization methods.

MCML Authors

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

[24]

H. Boche, A. Fono and G. Kutyniok.
A Mathematical Framework for Computability Aspects of Algorithmic Transparency.
ISIT 2024 - IEEE International Symposium on Information Theory. Athens, Greece, Jul 07-12, 2024. DOI

Abstract

The lack of trustworthiness is a major downside of deep learning. To mitigate the associated risks clear obligations of deep learning models have been proposed via regulatory guidelines. Therefore, a crucial question is to what extent trustworthy deep learning can be realized. Establishing trust-worthiness requires that the factors influencing an algorithmic computation can be retraced, i.e., the algorithmic implementation is transparent. Motivated by the observation that the current evolution of deep learning models necessitates a change in computing technology, we derive a mathematical framework that enables us to analyze whether a transparent implementation in a given computing model is feasible. We exemplarily apply our trustworthiness framework to analyze deep learning approaches for inverse problems in digital and analog computing models represented by Turing and Blum-Shub-Smale Machines, respectively. Based on previous results, we find that Blum-Shub-Smale Machines have the potential to establish trustworthy solvers for inverse problems under fairly general conditions, whereas, Turing machines cannot guarantee trustworthiness to the same degree. For a longer version of this paper with more details and proofs, we refer to [1].

MCML Authors

Adalbert Fono

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

[23]

Ç. Yapar, F. Jaensch, R. Levie, G. Kutyniok and G. Caire.
Overview of the First Pathloss Radio Map Prediction Challenge.
IEEE Open Journal of Signal Processing 5 (Jun. 2024). DOI

Abstract

Pathloss quantifies the reduction in power density of a signal radiated from a transmitter. The attenuation is due to large-scale effects such as free-space propagation loss and interactions (e.g., penetration, reflection, and diffraction) of the signal with objects such as buildings, vehicles, trees, and pedestrians in the propagation environment. Many current or planned wireless communications applications require the knowledge (or a reliable approximation) of the pathloss on a dense grid (radio map) of the environment of interest. Deterministic simulation methods such as ray tracing are known to provide very good estimates of pathloss values. However, their high computational complexity makes them unsuitable for most of the applications envisaged. To promote research and facilitate a fair comparison among the recently proposed fast and accurate deep learning-based pathloss radio map prediction methods, we have organized the ICASSP 2023 First Pathloss Radio Map Prediction Challenge. In this overview paper, we describe the pathloss radio map prediction problem, provide a literature survey of the current state of the art, describe the challenge datasets, the challenge task, and the challenge evaluation methodology. Finally, we provide a brief overview of the submitted methods and present the results of the challenge.

MCML Authors

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

[22]

Y. Lee, H. Boche and G. Kutyniok.
Computability of Optimizers.
IEEE Transactions on Information Theory 70.4 (Apr. 2024). DOI

Abstract

Optimization problems are a staple of today’s scientific and technical landscape. However, at present, solvers of such problems are almost exclusively run on digital hardware. Using Turing machines as a mathematical model for any type of digital hardware, in this paper, we analyze fundamental limitations of this conceptual approach of solving optimization problems. Since in most applications, the optimizer itself is of significantly more interest than the optimal value of the corresponding function, we will focus on computability of the optimizer. In fact, we will show that in various situations the optimizer is unattainable on Turing machines and consequently on digital computers. Moreover, even worse, there does not exist a Turing machine, which approximates the optimizer itself up to a certain constant error. We prove such results for a variety of well-known problems from very different areas, including artificial intelligence, financial mathematics, and information theory, often deriving the even stronger result that such problems are not Banach-Mazur computable, also not even in an approximate sense.

MCML Authors

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

[21]

B. Lorenz, A. Bacho and G. Kutyniok.
Error Estimation for Physics-informed Neural Networks Approximating Semilinear Wave Equations.
Preprint (Mar. 2024). arXiv

Abstract

This paper provides rigorous error bounds for physics-informed neural networks approximating the semilinear wave equation. We provide bounds for the generalization and training error in terms of the width of the network’s layers and the number of training points for a tanh neural network with two hidden layers. Our main result is a bound of the total error in the H1([0,T];L2(Ω))-norm in terms of the training error and the number of training points, which can be made arbitrarily small under some assumptions. We illustrate our theoretical bounds with numerical experiments.

MCML Authors

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

[20]

M. Singh, A. Fono and G. Kutyniok.
Expressivity of Spiking Neural Networks.
Preprint (Mar. 2024). arXiv

Abstract

The synergy between spiking neural networks and neuromorphic hardware holds promise for the development of energy-efficient AI applications. Inspired by this potential, we revisit the foundational aspects to study the capabilities of spiking neural networks where information is encoded in the firing time of neurons. Under the Spike Response Model as a mathematical model of a spiking neuron with a linear response function, we compare the expressive power of artificial and spiking neural networks, where we initially show that they realize piecewise linear mappings. In contrast to ReLU networks, we prove that spiking neural networks can realize both continuous and discontinuous functions. Moreover, we provide complexity bounds on the size of spiking neural networks to emulate multi-layer (ReLU) neural networks. Restricting to the continuous setting, we also establish complexity bounds in the reverse direction for one-layer spiking neural networks.

MCML Authors

Manjot Singh

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Adalbert Fono

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

2023

[19]

S. Maskey, R. Paolino, A. Bacho and G. Kutyniok.
A Fractional Graph Laplacian Approach to Oversmoothing.
NeurIPS 2023 - 37th Conference on Neural Information Processing Systems. New Orleans, LA, USA, Dec 10-16, 2023. URL GitHub

Abstract

Graph neural networks (GNNs) have shown state-of-the-art performances in various applications. However, GNNs often struggle to capture long-range dependencies in graphs due to oversmoothing. In this paper, we generalize the concept of oversmoothing from undirected to directed graphs. To this aim, we extend the notion of Dirichlet energy by considering a directed symmetrically normalized Laplacian. As vanilla graph convolutional networks are prone to oversmooth, we adopt a neural graph ODE framework. Specifically, we propose fractional graph Laplacian neural ODEs, which describe non-local dynamics. We prove that our approach allows propagating information between distant nodes while maintaining a low probability of long-distance jumps. Moreover, we show that our method is more flexible with respect to the convergence of the graph’s Dirichlet energy, thereby mitigating oversmoothing. We conduct extensive experiments on synthetic and real-world graphs, both directed and undirected, demonstrating our method’s versatility across diverse graph homophily levels.

MCML Authors

Sohir Maskey

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Raffaele Paolino

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

[18]

M. Seleznova, D. Weitzner, R. Giryes, G. Kutyniok and H.-H. Chou.
Neural (Tangent Kernel) Collapse.
NeurIPS 2023 - 37th Conference on Neural Information Processing Systems. New Orleans, LA, USA, Dec 10-16, 2023. URL

Abstract

This work bridges two important concepts: the Neural Tangent Kernel (NTK), which captures the evolution of deep neural networks (DNNs) during training, and the Neural Collapse (NC) phenomenon, which refers to the emergence of symmetry and structure in the last-layer features of well-trained classification DNNs. We adopt the natural assumption that the empirical NTK develops a block structure aligned with the class labels, i.e., samples within the same class have stronger correlations than samples from different classes. Under this assumption, we derive the dynamics of DNNs trained with mean squared (MSE) loss and break them into interpretable phases. Moreover, we identify an invariant that captures the essence of the dynamics, and use it to prove the emergence of NC in DNNs with block-structured NTK. We provide large-scale numerical experiments on three common DNN architectures and three benchmark datasets to support our theory.

MCML Authors

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

[17]

M. Singh, A. Fono and G. Kutyniok.
Are Spiking Neural Networks more expressive than Artificial Neural Networks?
UniReps @NeurIPS 2023 - 1st Workshop on Unifying Representations in Neural Models at the 37th Conference on Neural Information Processing Systems (NeurIPS 2023). New Orleans, LA, USA, Dec 10-16, 2023. URL

Abstract

This article studies the expressive power of spiking neural networks with firing-time-based information encoding, highlighting their potential for future energy-efficient AI applications when deployed on neuromorphic hardware. The computational power of a network of spiking neurons has already been studied via their capability of approximating any continuous function. By using the Spike Response Model as a mathematical model of a spiking neuron and assuming a linear response function, we delve deeper into this analysis and prove that spiking neural networks generate continuous piecewise linear mappings. We also show that they can emulate any multi-layer (ReLU) neural network with similar complexity. Furthermore, we prove that the maximum number of linear regions generated by a spiking neuron scales exponentially with respect to the input dimension, a characteristic that distinguishes it significantly from an artificial (ReLU) neuron. Our results further extend the understanding of the approximation properties of spiking neural networks and open up new avenues where spiking neural networks can be deployed instead of artificial neural networks without any performance loss.

MCML Authors

Manjot Singh

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Adalbert Fono

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

[16]

H. Boche, A. Fono and G. Kutyniok.
Limitations of Deep Learning for Inverse Problems on Digital Hardware.
IEEE Transactions on Information Theory 69.12 (Dec. 2023). DOI

Abstract

Deep neural networks have seen tremendous success over the last years. Since the training is performed on digital hardware, in this paper, we analyze what actually can be computed on current hardware platforms modeled as Turing machines, which would lead to inherent restrictions of deep learning. For this, we focus on the class of inverse problems, which, in particular, encompasses any task to reconstruct data from measurements. We prove that finite-dimensional inverse problems are not Banach-Mazur computable for small relaxation parameters. Even more, our results introduce a lower bound on the accuracy that can be obtained algorithmically.

MCML Authors

Adalbert Fono

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

[15]

Ç. Yapar, R. Levie, G. Kutyniok and G. Caire.
Real-Time Outdoor Localization Using Radio Maps: A Deep Learning Approach.
IEEE Transactions on Wireless Communications 22.12 (Dec. 2023). DOI

Abstract

Global Navigation Satellite Systems typically perform poorly in urban environments, where the likelihood of line-of-sight conditions between devices and satellites is low. Therefore, alternative location methods are required to achieve good accuracy. We present LocUNet: A convolutional, end-to-end trained neural network (NN) for the localization task, which is able to estimate the position of a user from the received signal strength (RSS) of a small number of Base Stations (BS). Using estimations of pathloss radio maps of the BSs and the RSS measurements of the users to be localized, LocUNet can localize users with state-of-the-art accuracy and enjoys high robustness to inaccuracies in the estimations of radio maps. The proposed method does not require generating RSS fingerprints of each specific area where the localization task is performed and is suitable for real-time applications. Moreover, two novel datasets that allow for numerical evaluations of RSS and ToA methods in realistic urban environments are presented and made publicly available for the research community. By using these datasets, we also provide a fair comparison of state-of-the-art RSS and ToA-based methods in the dense urban scenario and show numerically that LocUNet outperforms all the compared methods.

MCML Authors

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

[14]

S. Kolek, A. Chattopadhyay, K. H. R. Chan, H. Andrade-Loarca, G. Kutyniok and R. Vidal.
Learning Interpretable Queries for Explainable Image Classification with Information Pursuit.
Preprint (Dec. 2023). arXiv

Abstract

Information Pursuit (IP) is an explainable prediction algorithm that greedily selects a sequence of interpretable queries about the data in order of information gain, updating its posterior at each step based on observed query-answer pairs. The standard paradigm uses hand-crafted dictionaries of potential data queries curated by a domain expert or a large language model after a human prompt. However, in practice, hand-crafted dictionaries are limited by the expertise of the curator and the heuristics of prompt engineering. This paper introduces a novel approach: learning a dictionary of interpretable queries directly from the dataset. Our query dictionary learning problem is formulated as an optimization problem by augmenting IP’s variational formulation with learnable dictionary parameters. To formulate learnable and interpretable queries, we leverage the latent space of large vision and language models like CLIP. To solve the optimization problem, we propose a new query dictionary learning algorithm inspired by classical sparse dictionary learning. Our experiments demonstrate that learned dictionaries significantly outperform hand-crafted dictionaries generated with large language models.

MCML Authors

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

[13]

H. Andrade-Loarca, J. Hege, D. Cremers and G. Kutyniok.
Neural Poisson Surface Reconstruction: Resolution-Agnostic Shape Reconstruction from Point Clouds.
Preprint (Nov. 2023). arXiv

Abstract

We introduce Neural Poisson Surface Reconstruction (nPSR), an architecture for shape reconstruction that addresses the challenge of recovering 3D shapes from points. Traditional deep neural networks face challenges with common 3D shape discretization techniques due to their computational complexity at higher resolutions. To overcome this, we leverage Fourier Neural Operators to solve the Poisson equation and reconstruct a mesh from oriented point cloud measurements. nPSR exhibits two main advantages: First, it enables efficient training on low-resolution data while achieving comparable performance at high-resolution evaluation, thanks to the resolution-agnostic nature of FNOs. This feature allows for one-shot super-resolution. Second, our method surpasses existing approaches in reconstruction quality while being differentiable and robust with respect to point sampling rates. Overall, the neural Poisson surface reconstruction not only improves upon the limitations of classical deep neural networks in shape reconstruction but also achieves superior results in terms of reconstruction quality, running time, and resolution agnosticism.

MCML Authors

Daniel Cremers

Prof. Dr.

B1 | Computer Vision

Computer Vision & Artificial Intelligence

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

[12]

Ç. Yapar, F. Jaensch, R. Ron, G. Kutyniok and G. Caire.
Overview of the Urban Wireless Localization Competition.
MLSP 2023 - IEEE Workshop on Machine Learning for Signal Processing. Rome, Italy, Sep 17-20, 2023. DOI

Abstract

In dense urban environments, Global Navigation Satellite Systems do not provide good accuracy due to the low probability of line-of-sight (LOS) between the user equipment (UE) to be located and the satellites due to the presence of obstacles such as buildings. As a result, it is necessary to resort to other technologies that can operate reliably under non-line-of-sight (NLOS) conditions. To promote research in the reviving field of radio map-based wireless localization, we have launched the MLSP 2023 Urban Wireless Localization Competition. In this short overview paper, we describe the urban wireless localization problem, the provided datasets and baseline methods, the challenge task, and the challenge evaluation methodology. Finally, we present the results of the challenge.

MCML Authors

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

[11]

A. Bacho, H. Boche and G. Kutyniok.
Complexity Blowup for Solutions of the Laplace and the Diffusion Equation.
Preprint (Sep. 2023). arXiv

Abstract

In this paper, we investigate the computational complexity of solutions to the Laplace and the diffusion equation. We show that for a certain class of initial-boundary value problems of the Laplace and the diffusion equation, the solution operator is #P1/#P-complete in the sense that it maps polynomial-time computable functions to the set of #P1/#P-complete functions. Consequently, there exists polynomial-time (Turing) computable input data such that the solution is not polynomial-time computable, unless FP=#P or FP1=#P1. In this case, we can, in general, not simulate the solution of the Laplace or the diffusion equation on a digital computer without having a complexity blowup, i.e., the computation time for obtaining an approximation of the solution with up to a finite number of significant digits grows non-polynomially in the number of digits. This indicates that the computational complexity of the solution operator that models a physical phenomena is intrinsically high, independent of the numerical algorithm that is used to approximate a solution.

MCML Authors

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

[10]

S. Alberti, N. Dern, L. Thesing and G. Kutyniok.
Sumformer: Universal Approximation for Efficient Transformers.
TAG-ML @ICML 2023 - 2nd Annual Workshop on Topology, Algebra, and Geometry in Machine Learning at the 40th International Conference on Machine Learning (ICML 2023). Honolulu, Hawaii, Jul 23-29, 2023. URL

Abstract

Natural language processing (NLP) made an impressive jump with the introduction of Transformers. ChatGPT is one of the most famous examples, changing the perception of the possibilities of AI even outside the research community. However, besides the impressive performance, the quadratic time and space complexity of Transformers with respect to sequence length pose significant limitations for handling long sequences. While efficient Transformer architectures like Linformer and Performer with linear complexity have emerged as promising solutions, their theoretical understanding remains limited. In this paper, we introduce Sumformer, a novel and simple architecture capable of universally approximating equivariant sequence-to-sequence functions. We use Sumformer to give the first universal approximation results for Linformer and Performer. Moreover, we derive a new proof for Transformers, showing that just one attention layer is sufficient for universal approximation.

MCML Authors

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

[9]

G. Kutyniok.
An introduction to the mathematics of deep learning.
European Congress of Mathematics (Jul. 2023). DOI

Abstract

Despite the outstanding success of deep neural networks in real-world applications, ranging from science to public life, most of the related research is empirically driven and a comprehensive mathematical foundation is still missing. At the same time, these methods have already shown their impressive potential in mathematical research areas such as imaging sciences, inverse problems, or numerical analysis of partial differential equations, sometimes by far outperforming classical mathematical approaches for particular problem classes. The goal of this paper, which is based on a plenary lecture at the 8th European Congress of Mathematics in 2021, is to first provide an introduction into this new vibrant research area. We will then showcase some recent advances in two directions, namely the development of a mathematical foundation of deep learning and the introduction of novel deep learning-based approaches to solve inverse problems and partial differential equations.

MCML Authors

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

[8]

A. Bacho, H. Boche and G. Kutyniok.
Reliable AI: Does the Next Generation Require Quantum Computing?
Preprint (Jul. 2023). arXiv

Abstract

In this survey, we aim to explore the fundamental question of whether the next generation of artificial intelligence requires quantum computing. Artificial intelligence is increasingly playing a crucial role in many aspects of our daily lives and is central to the fourth industrial revolution. It is therefore imperative that artificial intelligence is reliable and trustworthy. However, there are still many issues with reliability of artificial intelligence, such as privacy, responsibility, safety, and security, in areas such as autonomous driving, healthcare, robotics, and others. These problems can have various causes, including insufficient data, biases, and robustness problems, as well as fundamental issues such as computability problems on digital hardware. The cause of these computability problems is rooted in the fact that digital hardware is based on the computing model of the Turing machine, which is inherently discrete. Notably, our findings demonstrate that digital hardware is inherently constrained in solving problems about optimization, deep learning, or differential equations. Therefore, these limitations carry substantial implications for the field of artificial intelligence, in particular for machine learning. Furthermore, although it is well known that the quantum computer shows a quantum advantage for certain classes of problems, our findings establish that some of these limitations persist when employing quantum computing models based on the quantum circuit or the quantum Turing machine paradigm. In contrast, analog computing models, such as the Blum-Shub-Smale machine, exhibit the potential to surmount these limitations.

MCML Authors

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

[7]

P. Scholl, A. Bacho, H. Boche and G. Kutyniok.
The Uniqueness Problem of Physical Law Learning.
ICASSP 2023 - IEEE International Conference on Acoustics, Speech and Signal Processing. Rhode Island, Greece, Jun 04-10, 2023. DOI

Abstract

Physical law learning is the ambiguous attempt at automating the derivation of governing equations with the use of machine learning techniques. This paper shall serve as a first step to build a comprehensive theoretical framework for learning physical laws, aiming to provide reliability to according algorithms. One key problem consists in the fact that the governing equations might not be uniquely determined by the given data. We will study this problem in the common situation that a physical law is described by an ordinary or partial differential equation. For various different classes of differential equations, we provide both necessary and sufficient conditions for a function from a given function class to uniquely determine the differential equation which is governing the phenomenon. We then use our results to determine in extensive numerical experiments whether a function solves a differential equation uniquely.

MCML Authors

Philipp Scholl

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

[6]

Ç. Yapar, F. Jaensch, R. Levie, G. Kutyniok and G. Caire.
The First Pathloss Radio Map Prediction Challenge.
ICASSP 2023 - IEEE International Conference on Acoustics, Speech and Signal Processing. Rhode Island, Greece, Jun 04-10, 2023. DOI

Abstract

To foster research and facilitate fair comparisons among recently proposed pathloss radio map prediction methods, we have launched the ICASSP 2023 First Pathloss Radio Map Prediction Challenge. In this short overview paper, we briefly describe the pathloss prediction problem, the provided datasets, the challenge task and the challenge evaluation methodology. Finally, we present the results of the challenge.

MCML Authors

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

[5]

R. Paolino, A. Bojchevski, S. Günnemann, G. Kutyniok and R. Levie.
Unveiling the Sampling Density in Non-Uniform Geometric Graphs.
ICLR 2023 - 11th International Conference on Learning Representations. Kigali, Rwanda, May 01-05, 2023. URL

Abstract

A powerful framework for studying graphs is to consider them as geometric graphs: nodes are randomly sampled from an underlying metric space, and any pair of nodes is connected if their distance is less than a specified neighborhood radius. Currently, the literature mostly focuses on uniform sampling and constant neighborhood radius. However, real-world graphs are likely to be better represented by a model in which the sampling density and the neighborhood radius can both vary over the latent space. For instance, in a social network communities can be modeled as densely sampled areas, and hubs as nodes with larger neighborhood radius. In this work, we first perform a rigorous mathematical analysis of this (more general) class of models, including derivations of the resulting graph shift operators. The key insight is that graph shift operators should be corrected in order to avoid potential distortions introduced by the non-uniform sampling. Then, we develop methods to estimate the unknown sampling density in a self-supervised fashion. Finally, we present exemplary applications in which the learnt density is used to 1) correct the graph shift operator and improve performance on a variety of tasks, 2) improve pooling, and 3) extract knowledge from networks. Our experimental findings support our theory and provide strong evidence for our model.

MCML Authors

Raffaele Paolino

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Stephan Günnemann

Prof. Dr.

A3 | Computational Models

Data Analytics & Machine Learning

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

2022

[4]

H. Boche, A. Fono and G. Kutyniok.
Non-Computability of the Pseudoinverse on Digital Computers.
Preprint (Dec. 2022). arXiv

Abstract

The pseudoinverse of a matrix, a generalized notion of the inverse, is of fundamental importance in linear algebra. However, there does not exist a closed form representation of the pseudoinverse, which can be straightforwardly computed. Therefore, an algorithmic computation is necessary. An algorithmic computation can only be evaluated by also considering the underlying hardware, typically digital hardware, which is responsible for performing the actual computations step by step. In this paper, we analyze if and to what degree the pseudoinverse actually can be computed on digital hardware platforms modeled as Turing machines. For this, we utilize the notion of an effective algorithm which describes a provably correct computation: upon an input of any error parameter, the algorithm provides an approximation within the given error bound with respect to the unknown solution. We prove that an effective algorithm for computing the pseudoinverse of any matrix can not exist on a Turing machine, although provably correct algorithms do exist for specific classes of matrices. Even more, our results introduce a lower bound on the accuracy that can be obtained algorithmically when computing the pseudoinverse on Turing machines.

MCML Authors

Adalbert Fono

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

[3]

C. Koke and G. Kutyniok.
Graph Scattering beyond Wavelet Shackles.
NeurIPS 2022 - 36th Conference on Neural Information Processing Systems. New Orleans, LA, USA, Nov 28-Dec 09, 2022. URL

Abstract

This work develops a flexible and mathematically sound framework for the design and analysis of graph scattering networks with variable branching ratios and generic functional calculus filters.Spectrally-agnostic stability guarantees for node- and graph-level perturbations are derived; the vertex-set non-preserving case is treated by utilizing recently developed mathematical-physics based tools. Energy propagation through the network layers is investigated and related to truncation stability. New methods of graph-level feature aggregation are introduced and stability of the resulting composite scattering architectures is established. Finally, scattering transforms are extended to edge- and higher order tensorial input. Theoretical results are complemented by numerical investigations: Suitably chosen scattering networks conforming to the developed theory perform better than traditional graph-wavelet based scattering approaches in social network graph classification tasks andsignificantly outperform other graph-based learning approaches to regression of quantum-chemical energies on QM7.

MCML Authors

Christian Koke

B1 | Computer Vision
→ Group Daniel Cremers

Computer Vision & Artificial Intelligence

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

[2]

S. Maskey, R. Levie, Y. Lee and G. Kutyniok.
Generalization Analysis of Message Passing Neural Networks on Large Random Graphs.
NeurIPS 2022 - 36th Conference on Neural Information Processing Systems. New Orleans, LA, USA, Nov 28-Dec 09, 2022. URL

Abstract

Message passing neural networks (MPNN) have seen a steep rise in popularity since their introduction as generalizations of convolutional neural networks to graph-structured data, and are now considered state-of-the-art tools for solving a large variety of graph-focused problems. We study the generalization error of MPNNs in graph classification and regression. We assume that graphs of different classes are sampled from different random graph models. We show that, when training a MPNN on a dataset sampled from such a distribution, the generalization gap increases in the complexity of the MPNN, and decreases, not only with respect to the number of training samples, but also with the average number of nodes in the graphs. This shows how a MPNN with high complexity can generalize from a small dataset of graphs, as long as the graphs are large. The generalization bound is derived from a uniform convergence result, that shows that any MPNN, applied on a graph, approximates the MPNN applied on the geometric model that the graph discretizes.

MCML Authors

Sohir Maskey

A2 | Mathematical Foundations
→ Group Gitta Kutyniok

Mathematical Foundations of Artificial Intelligence

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

[1]

Y. Zhou, G. Kutyniok and B. Ribeiro.
OOD Link Prediction Generalization Capabilities of Message-Passing GNNs in Larger Test Graphs.
NeurIPS 2022 - 36th Conference on Neural Information Processing Systems. New Orleans, LA, USA, Nov 28-Dec 09, 2022. URL

Abstract

This work provides the first theoretical study on the ability of graph Message Passing Neural Networks (gMPNNs) —such as Graph Neural Networks (GNNs)— to perform inductive out-of-distribution (OOD) link prediction tasks, where deployment (test) graph sizes are larger than training graphs. We first prove non-asymptotic bounds showing that link predictors based on permutation-equivariant (structural) node embeddings obtained by gMPNNs can converge to a random guess as test graphs get larger. We then propose a theoretically-sound gMPNN that outputs structural pairwise (2-node) embeddings and prove non-asymptotic bounds showing that, as test graphs grow, these embeddings converge to embeddings of a continuous function that retains its ability to predict links OOD. Empirical results on random graphs show agreement with our theoretical results.

MCML Authors

Gitta Kutyniok

Prof. Dr.

A2 | Mathematical Foundations

Mathematical Foundations of Artificial Intelligence

Research Group Gitta Kutyniok

Gitta Kutyniok

Team members @MCML

PostDocs

PhD Students

Recent News @MCML

MCML Researchers With 24 Papers at ICML 2025

MCML Researchers With 51 Papers at ICLR 2025

Gitta Kutyniok Joins National Academy of Artificial Intelligence

AI Competition: Where Does Europe Stand?

Gitta Kutyniok on Supporting Companies With the EU’s AI Act

Publications @MCML

2025

2024

2023

2022