[171] CLIP-DPO: Vision-Language Models as a Source of Preference for Fixing Hallucinations in LVLMs

2024๋…„ 8์›” 30์ผ ยท 2 ๋ถ„ ยท long8v ยท 

[77] Interpretable Image Classification with Differentiable Prototype Assignment

2022๋…„ 11์›” 9์ผ ยท 2 ๋ถ„ ยท long8v ยท 

[75] SESS: Saliency Enhancing with Scaling and Sliding

2022๋…„ 11์›” 8์ผ ยท 2 ๋ถ„ ยท long8v ยท 

[76] Long-tail Detection with Effective Class-Margins

2022๋…„ 11์›” 8์ผ ยท 2 ๋ถ„ ยท long8v ยท 

[74] โ€œThis is my unicorn, Fluffyโ€: Personalizing frozen vision-language representations

2022๋…„ 11์›” 4์ผ ยท 2 ๋ถ„ ยท long8v ยท 

[65] Margin Calibration for Long-Tailed Visual Recognition

2022๋…„ 9์›” 19์ผ ยท 1 ๋ถ„ ยท long8v ยท 

[64] Open-Vocabulary DETR with Conditional Matching

2022๋…„ 9์›” 16์ผ ยท 1 ๋ถ„ ยท long8v ยท 

[62] What to Hide from Your Students: Attention-Guided Masked Image Modeling

2022๋…„ 9์›” 6์ผ ยท 1 ๋ถ„ ยท long8v ยท 

[37] Relationformer: A Unified Framework for Image-to-Graph Generation

2022๋…„ 7์›” 21์ผ ยท 2 ๋ถ„ ยท long8v ยท