[190] Solving math word problems with process and outcome-based feedback

December 16, 2024 · 4 min · long8v · 

[161] MM-SHAP: A Performance-agnostic Metric for Measuring Multimodal Contributions in Vision and Language Models & Tasks

July 9, 2024 · 1 min · long8v · 

[149] Noise-aware Learning from Web-crawled Image-Text Data for Image Captioning

February 12, 2024 · 1 min · long8v · 

[89] Relational Attention: Generalizing Transformers for Graph-Structured Tasks

December 15, 2022 · 2 min · long8v · 

[71] Large Models are Parsimonious Learners: Activation Sparsity in Trained Transformers

October 17, 2022 · 1 min · long8v ·