[161] MM-SHAP: A Performance-agnostic Metric for Measuring Multimodal Contributions in Vision and Language Models & Tasks

July 9, 2024 ยท 1 min ยท long8v ยท 

[158] A Mathematical Framework for Transformer Circuits

May 9, 2024 ยท 4 min ยท long8v ยท 

[156] Interpreting CLIP's Image Representation via Text-Based Decomposition

May 6, 2024 ยท 2 min ยท long8v ยท 

[157] LeGrad: An Explainability Method for Vision Transformers via Feature Formation Sensitivity

May 6, 2024 ยท 2 min ยท long8v ยท 

[154] Mismatch Quest: Visual and Textual Feedback for Image-Text Misalignment

April 3, 2024 ยท 2 min ยท long8v ยท 

[153] Contrastive Explanations for Model Interpretability

April 1, 2024 ยท 2 min ยท long8v ยท 

[151] FOIL it! Find One mismatch between Image and Language caption

March 3, 2024 ยท 3 min ยท long8v ยท 

[150] Grad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization

February 13, 2024 ยท 3 min ยท long8v ยท 

[147] Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers

February 7, 2024 ยท 3 min ยท long8v ยท 

[146] Transformer Interpretability Beyond Attention Visualization

February 6, 2024 ยท 4 min ยท long8v ยท 

[77] Interpretable Image Classification with Differentiable Prototype Assignment

November 9, 2022 ยท 3 min ยท long8v ยท 

[75] SESS: Saliency Enhancing with Scaling and Sliding

November 8, 2022 ยท 2 min ยท long8v ยท