[174] Evaluations for Object Hallucinations

2024λ…„ 9μ›” 2일 Β· 2 λΆ„ Β· long8v Β· 

[165] Rich Human Feedback for Text-to-Image Generation

2024λ…„ 7μ›” 19일 Β· 2 λΆ„ Β· long8v Β· 

[163] What You See is What You Read? Improving Text-Image Alignment Evaluation

2024λ…„ 7μ›” 18일 Β· 1 λΆ„ Β· long8v Β· 

[164] TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering

2024λ…„ 7μ›” 18일 Β· 1 λΆ„ Β· long8v Β· 

[160] ALOHa: A New Measure for Hallucination in Captioning Models

2024λ…„ 6μ›” 15일 Β· 2 λΆ„ Β· long8v Β· 

[155] Revisiting Text-to-Image Evaluation with Gecko: On Metrics, Prompts, and Human Ratings

2024λ…„ 5μ›” 3일 Β· 1 λΆ„ Β· long8v Β· 

[154] Mismatch Quest: Visual and Textual Feedback for Image-Text Misalignment

2024λ…„ 4μ›” 3일 Β· 2 λΆ„ Β· long8v Β· 

[151] FOIL it! Find One mismatch between Image and Language caption

2024λ…„ 3μ›” 3일 Β· 2 λΆ„ Β· long8v Β· 

[145] CLIPScore: A Reference-free Evaluation Metric for Image Captioning

2024λ…„ 2μ›” 5일 Β· 2 λΆ„ Β· long8v Β· 

[139] Davidsonian Scene Graph: Improving Reliability in Fine-Grained Evaluation for Text-to-Image Generation

2023λ…„ 12μ›” 11일 Β· 2 λΆ„ Β· long8v Β·