[172] RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback

August 30, 2024 ยท 2 min ยท long8v ยท 

[165] Rich Human Feedback for Text-to-Image Generation

July 19, 2024 ยท 2 min ยท long8v ยท 

[146] Transformer Interpretability Beyond Attention Visualization

February 6, 2024 ยท 4 min ยท long8v ยท 

[131] Re-labeling ImageNet: from Single to Multi-Labels, from Global to Localized Labels

September 13, 2023 ยท 2 min ยท long8v ยท 

[125] RILS: Masked Visual Reconstruction in Language Semantic Space

August 2, 2023 ยท 2 min ยท long8v ยท 

feat: add sparse rcnn

July 24, 2023 ยท 1 min ยท long8v ยท 

[108] Unsupervised Vision-Language Parsing: Seamlessly Bridging Visual Scene Graphs with Language Structures via Dependency Relationships

April 4, 2023 ยท 4 min ยท long8v ยท 

[90] Neural Collaborative Graph Machines for Table Structure Recognition

December 22, 2022 ยท 1 min ยท long8v ยท 

[87] Bipartite Graph Network with Adaptive Message Passing for Unbiased Scene Graph Generation

December 8, 2022 ยท 2 min ยท long8v ยท 

[51] Structured Sparse R-CNN for Direct Scene Graph Generation

August 19, 2022 ยท 3 min ยท long8v ยท 

[36] SGTR: End-to-end Scene Graph Generation with Transformer

July 19, 2022 ยท 3 min ยท long8v ยท 

[28] Learning to Compare: Relation Network for Few-Shot Learning

May 31, 2022 ยท 2 min ยท long8v ยท