[161] MM-SHAP: A Performance-agnostic Metric for Measuring Multimodal Contributions in Vision and Language Models & Tasks

July 9, 2024 · 1 min · long8v · 

[158] A Mathematical Framework for Transformer Circuits

May 9, 2024 · 4 min · long8v · 

[156] Interpreting CLIP's Image Representation via Text-Based Decomposition

May 6, 2024 · 2 min · long8v · 

[157] LeGrad: An Explainability Method for Vision Transformers via Feature Formation Sensitivity

May 6, 2024 · 2 min · long8v · 

[154] Mismatch Quest: Visual and Textual Feedback for Image-Text Misalignment

April 3, 2024 · 2 min · long8v · 

[153] Contrastive Explanations for Model Interpretability

April 1, 2024 · 2 min · long8v · 

[151] FOIL it! Find One mismatch between Image and Language caption

March 3, 2024 · 3 min · long8v · 

[150] Grad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization

February 13, 2024 · 3 min · long8v · 

[147] Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers

February 7, 2024 · 3 min · long8v · 

[146] Transformer Interpretability Beyond Attention Visualization

February 6, 2024 · 4 min · long8v · 

[77] Interpretable Image Classification with Differentiable Prototype Assignment

November 9, 2022 · 3 min · long8v · 

[75] SESS: Saliency Enhancing with Scaling and Sliding

November 8, 2022 · 2 min · long8v ·