[174] Evaluations for Object Hallucinations

2024๋…„ 9์›” 2์ผ ยท 2 ๋ถ„ ยท long8v ยท 

[165] Rich Human Feedback for Text-to-Image Generation

2024๋…„ 7์›” 19์ผ ยท 2 ๋ถ„ ยท long8v ยท 

[163] What You See is What You Read? Improving Text-Image Alignment Evaluation

2024๋…„ 7์›” 18์ผ ยท 1 ๋ถ„ ยท long8v ยท 

[164] TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering

2024๋…„ 7์›” 18์ผ ยท 1 ๋ถ„ ยท long8v ยท 

[160] ALOHa: A New Measure for Hallucination in Captioning Models

2024๋…„ 6์›” 15์ผ ยท 2 ๋ถ„ ยท long8v ยท 

[155] Revisiting Text-to-Image Evaluation with Gecko: On Metrics, Prompts, and Human Ratings

2024๋…„ 5์›” 3์ผ ยท 1 ๋ถ„ ยท long8v ยท 

[154] Mismatch Quest: Visual and Textual Feedback for Image-Text Misalignment

2024๋…„ 4์›” 3์ผ ยท 2 ๋ถ„ ยท long8v ยท 

[151] FOIL it! Find One mismatch between Image and Language caption

2024๋…„ 3์›” 3์ผ ยท 2 ๋ถ„ ยท long8v ยท 

[145] CLIPScore: A Reference-free Evaluation Metric for Image Captioning

2024๋…„ 2์›” 5์ผ ยท 2 ๋ถ„ ยท long8v ยท 

[139] Davidsonian Scene Graph: Improving Reliability in Fine-Grained Evaluation for Text-to-Image Generation

2023๋…„ 12์›” 11์ผ ยท 2 ๋ถ„ ยท long8v ยท