[171] CLIP-DPO: Vision-Language Models as a Source of Preference for Fixing Hallucinations in LVLMs

August 30, 2024 · 2 min · long8v · 

[77] Interpretable Image Classification with Differentiable Prototype Assignment

November 9, 2022 · 3 min · long8v · 

[75] SESS: Saliency Enhancing with Scaling and Sliding

November 8, 2022 · 2 min · long8v · 

[76] Long-tail Detection with Effective Class-Margins

November 8, 2022 · 2 min · long8v · 

[74] “This is my unicorn, Fluffy”: Personalizing frozen vision-language representations

November 4, 2022 · 2 min · long8v · 

[65] Margin Calibration for Long-Tailed Visual Recognition

September 19, 2022 · 2 min · long8v · 

[62] What to Hide from Your Students: Attention-Guided Masked Image Modeling

September 6, 2022 · 1 min · long8v · 

[37] Relationformer: A Unified Framework for Image-to-Graph Generation

July 21, 2022 · 2 min · long8v ·