[190] Solving math word problems with process and outcome-based feedback

2024λ…„ 12μ›” 16일 Β· 3 λΆ„ Β· long8v Β· 

[161] MM-SHAP: A Performance-agnostic Metric for Measuring Multimodal Contributions in Vision and Language Models & Tasks

2024λ…„ 7μ›” 9일 Β· 1 λΆ„ Β· long8v Β· 

[149] Noise-aware Learning from Web-crawled Image-Text Data for Image Captioning

2024λ…„ 2μ›” 12일 Β· 1 λΆ„ Β· long8v Β· 

[97] Contrastive Language-Image Pre-Training with Knowledge Graph

2023λ…„ 1μ›” 12일 Β· 2 λΆ„ Β· long8v Β· 

[89] Relational Attention: Generalizing Transformers for Graph-Structured Tasks

2022λ…„ 12μ›” 15일 Β· 2 λΆ„ Β· long8v Β· 

[71] Large Models are Parsimonious Learners: Activation Sparsity in Trained Transformers

2022λ…„ 10μ›” 17일 Β· 1 λΆ„ Β· long8v Β·