Presentation + Paper
22 April 2020 Countermeasure against backdoor attacks using epistemic classifiers
Zhaoyuan Yang, Nurali Virani, Naresh S. Iyer
Author Affiliations +
Abstract
In machine learning, backdoor or trojan attacks during model training can cause the targeted model to deceptively learn to misclassify in the presence of specific triggers. This mechanism of deception enables the attacker to exercise full control on when the model behavior becomes malicious through use of a trigger. In this paper, we introduce Epistemic Classifiers as a new category of defense mechanism and show their effectiveness in detecting backdoor attacks, which can be used to trigger default mechanisms, or solicit human intervention, on occasions where an untrustworthy model prediction can adversely impact the system within which it operates. We show experimental results with multiple public datasets and explain the reasons with visualization for effectiveness of the proposed approach. This empowers the war fighter to trust the AI on the tactical edge to be reliable and to become sensitive to scenarios with deception and noise where reliability cannot be provided.
Conference Presentation
© (2020) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Zhaoyuan Yang, Nurali Virani, and Naresh S. Iyer "Countermeasure against backdoor attacks using epistemic classifiers", Proc. SPIE 11413, Artificial Intelligence and Machine Learning for Multi-Domain Operations Applications II, 114130P (22 April 2020); https://doi.org/10.1117/12.2558255
Lens.org Logo
CITATIONS
Cited by 2 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Data modeling

Reliability

Machine learning

Neural networks

Artificial intelligence

Defense and security

Image classification

RELATED CONTENT

COVID 19 fake news detection based on long short term...
Proceedings of SPIE (April 22 2022)
Obstacle detection and recognition using SSD
Proceedings of SPIE (October 29 2018)
Few-shot learning for defence and security
Proceedings of SPIE (April 21 2020)

Back to Top