[170] Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback

2024λ…„ 8μ›” 27일 Β· 2 λΆ„ Β· long8v Β· 

[153] Contrastive Explanations for Model Interpretability

2024λ…„ 4μ›” 1일 Β· 2 λΆ„ Β· long8v Β· 

[148] I Can't Believe There's No Images! Learning Visual Tasks Using only Language Supervision

2024λ…„ 2μ›” 11일 Β· 2 λΆ„ Β· long8v Β· 

[145] CLIPScore: A Reference-free Evaluation Metric for Image Captioning

2024λ…„ 2μ›” 5일 Β· 2 λΆ„ Β· long8v Β·