[182] Calibrated Self-Rewarding Vision Language Models

October 10, 2024 · 2 min · long8v · 

[178] RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness

September 23, 2024 · 3 min · long8v · 

[172] RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback

August 30, 2024 · 2 min · long8v · 

[170] Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback

August 27, 2024 · 2 min · long8v · 

[160] ALOHa: A New Measure for Hallucination in Captioning Models

June 15, 2024 · 2 min · long8v · 

[157] LeGrad: An Explainability Method for Vision Transformers via Feature Formation Sensitivity

May 6, 2024 · 2 min · long8v · 

feat: add LeGrad

May 6, 2024 · 1 min · long8v · 

[155] Revisiting Text-to-Image Evaluation with Gecko: On Metrics, Prompts, and Human Ratings

May 3, 2024 · 2 min · long8v · 

[154] Mismatch Quest: Visual and Textual Feedback for Image-Text Misalignment

April 3, 2024 · 2 min · long8v ·