Geometrically Guided Saliency Maps

Sergey Plis | Mar 5, 2022| Publications

Authors : Md Mahfuzur Rahman, Noah Lewis, Sergey Plis

Publication date : 2014/3/1

Conference : ICLR 2022 Workshop on PAIR {\textasciicircum

Description

Interpretability methods for deep neural networks mainly focus on modifying the rules of automatic differentiation or perturbing the input and observing the score drop to determine the most relevant features. Among them, gradient-based attribution methods, such as saliency maps, are arguably the most popular. Still, the produced saliency maps may often lack intelligibility. We address this problem based on recent discoveries in geometric properties of deep neural networks’ loss landscape that reveal the existence of a multiplicity of local minima in the vicinity of a trained model’s loss surface. We introduce two methods that leverage the geometry of the loss landscape to improve interpretability: 1)" Geometrically Guided Integrated Gradients", applying gradient ascent to each interpolation point of the linear path as a guide. 2)" Geometric Ensemble Gradients", generating ensemble saliency maps by sampling proximal iso-loss models. Compared to vanilla and integrated gradients, these methods significantly improve saliency maps in quantitative and visual terms. We verify our findings on MNIST and Imagenet datasets across convolutional, ResNet, and Inception V3 architectures.

View article

#Paper #Publications

I received my Ph.D. degree in Computer Science in 2007 from The University of New Mexico, Albuquerque, NM, USA. Currently, I am an associate professor of CS at Georgia State University, and the Director of Machine Learning core at the TReNDS institute. My research interests within the fields of machine learning, AI, and data science lie in developing novel and applying existing techniques and approaches to analyzing large scale datasets. One of my key goals is to take advantage of the strengths of multiple data modalities and infer structure and patterns that are hard to obtain non-invasively and/or that are unavailable for direct observation. Ongoing work is focused on inferring multimodal probabilistic and causal descriptions based on fusion of fast and slow imaging modalities. This includes feature estimation via deep learning-based pattern recognition and learning causal graphical models. Besides academia, I’ve led AI research teams developing deep tech AI products including NLP and speech processing solutions that are currently shipped to customers. My research is supported by generous grants from NSF and NIH.

Geometrically Guided Saliency Maps

See Also