Detecting Brittle Decisions for Free: Leveraging Margin Consistency in Deep Robust Classifiers.
Published in Neurips 2024, 2024
A novel property of deep robust classifiers that allows to use the logit margin as a proxy score for input margin and efficiently detect non-robust samples, vulnerable to adversarial attacks.
Recommended citation: Ngnawé, J., Sahoo, S., Pequignot, Y., Precioso, F., & Gagné, C. (2024). Detecting Brittle Decisions for Free: Leveraging Margin Consistency in Deep Robust Classifiers. arXiv preprint arXiv:2406.18451.
Download Paper | Download Slides