Home  | News

31.07.2025

Teaser image to From Vulnerable to Verified: Exact Certificates Shield Models from Label‑Flipping

From Vulnerable to Verified: Exact Certificates Shield Models From Label‑Flipping

MCML Research Insight - With Lukas Gosch, Stephan Günnemann and Debarghya Ghoshdastidar

Machine‑learning models can be undermined before training even starts. By silently altering a small share of training labels - marking “spam” as “not‑spam,” for instance - an attacker can cut accuracy by double‑digit percentages.

The paper “Exact Certification of (Graph) Neural Networks Against Label Poisoning” by MCML Junior Member Lukas Gosch, PIs Stephan Günnemann and Debarghya Ghoshdastidar and collaborator Mahalakshmi Sabanayagam, introduces the first exact guarantees that a neural network will remain stable under a prescribed number of label flips. Although demonstrated on graph‑neural networks (GNNs), the method applies to any sufficiently wide neural network.


How the Certification Works

Illustration of the label-flipping certificate

Figure 1: Illustration of the label-flipping certificate

  • Neural‑tangent view. In the wide‑network limit, training behaves like a support‑vector machine using the network’s neural tangent kernel (NTK).
  • Single‑level reformulation. Substituting this NTK model allows to convert the attacker‑versus‑learner game for certification into one optimization problem.
  • Mixed‑integer linear program. That problem is expressed as a mixed‑integer linear program whose solution yields (i) sample‑wise certificates for individual test nodes and (ii) collective certificates for the entire test set.

What Experiments Show

Certified ratios

Figure 2: Certified ratios (the share of test‑set predictions that the certificate proves cannot be overturned even if an attacker flips up to a fraction of the training labels) of selected architectures as calculated with the sample-wise and collective certificate on the Cora-MLb dataset.

  • No universal best architecture. The most robust GNN depends on the data set.
  • Design choices matter. Linear activations improve robustness, while deeper architectures often weaken it.
  • A robustness plateau. Collective certificates reveal a flattening of vulnerability at medium attack budgets - an effect not noted before (see Figure 2).

«Machine learning models are highly vulnerable to label flipping, i.e., the adversarial modification (poisoning) of training labels to compromise performance.»


Lukas Gosch et al.

MCML Junior Members

Practical Implications

Because the approach relies only on the NTK, it extends to standard (non‑graph) wide neural networks, giving practitioners the first provable defence against label poisoning in deep learning.


«There is no silver bullet: robustness hierarchies of GNNs are strongly data dependent.»


Lukas Gosch et al.

MCML Junior Members

Key Takeaway

Exact certification shifts robustness from a best‑effort practice to a provable property. For anyone concerned about poisoned training data, this work provides a clear path toward verifiably trustworthy machine‑learning models.


Interested in Exploring Further?

Published as a spotlight presentation at at the A* conference ICLR 2025, you can explore the full paper—including proofs, algorithmic details, and additional experiments—and find the open-source code on GitHub.

A* Conference
M. Sabanayagam • L. GoschS. Günnemann • D. Ghoshdastidar
Exact Certification of (Graph) Neural Networks Against Label Poisoning.
ICLR 2025 - 13th International Conference on Learning Representations. Singapore, Apr 24-28, 2025. Spotlight Presentation. URL GitHub

Share Your Research!


Get in touch with us!

Are you an MCML Junior Member and interested in showcasing your research on our blog?

We’re happy to feature your work—get in touch with us to present your paper.

#blog #research #guennemann

Related

Link to Cordelia Schmid Featured in Süddeutsche Zeitung

11.05.2026

Cordelia Schmid Featured in Süddeutsche Zeitung

Cordelia Schmid, a member of the MCML Advisory Board, was recently featured in Süddeutsche Zeitung for her work in computer vision and robotics.

Read more
Link to Research Stay at Imperial College London

11.05.2026

Research Stay at Imperial College London

Jun Li joined a research stay at Imperial College London via MCML AI-X, exploring medical AI, multimodal models, and uncertainty.

Read more
Link to Right answer, wrong reasoning - Is AI Thinking or Cheating?

08.05.2026

Right Answer, Wrong Reasoning - Is AI Thinking or Cheating?

Can AI cheat without us noticing? Our PI Barbara Plank and her team introduce a new detection method at ICLR 2026.

Read more
Link to MCML Delegation Visit to the UK

07.05.2026

MCML Delegation Visit to the UK

MCML delegation visited top U.S. universities to advance AI X-Change and foster collaboration in generative and medical AI.

Read more
Link to Matthias Niessner Featured in Handelsblatt Disrupt podcast

07.05.2026

Matthias Niessner Featured in Handelsblatt Disrupt Podcast

Matthias Niessner joined the Handelsblatt Disrupt podcast to discuss the growing hype around “World Models”.

Read more
Back to Top