FACADE: A Framework for Adversarial Circuit Anomaly Detection and Evaluation.
Published in ICML, 2023
We present FACADE, a novel probabilistic and geometric framework designed for unsupervised mechanistic anomaly detection in deep neural networks.
Recommended citation: Pai, DB, Carranza, A, Tandon, A, Schaeffer, R, Koyejo, S. “FACADE: A Framework for Adversarial Circuit Anomaly Detection and Evaluation.” Adversarial ML Frontiers, ICML Workshops, Jun 20, 2023. https://openreview.net/forum?id=4j8KuZOmQH