[195] STaR: Self-Taught Reasoner Bootstrapping Reasoning With Reasoning

January 9, 2025 ยท 1 min ยท long8v ยท 

[120] Large-scale Bilingual Language-Image Contrastive Learning

June 19, 2023 ยท 3 min ยท long8v ยท 

[110] Understanding the Role of Self Attention for Efficient Speech Recognition

April 17, 2023 ยท 2 min ยท long8v ยท 

[108] Unsupervised Vision-Language Parsing: Seamlessly Bridging Visual Scene Graphs with Language Structures via Dependency Relationships

April 4, 2023 ยท 4 min ยท long8v ยท 

[96] Vision GNN: An Image is Worth Graph of Nodes

January 5, 2023 ยท 3 min ยท long8v ยท 

[64] Open-Vocabulary DETR with Conditional Matching

September 16, 2022 ยท 2 min ยท long8v ยท 

[62] What to Hide from Your Students: Attention-Guided Masked Image Modeling

September 6, 2022 ยท 1 min ยท long8v ยท 

[37] Relationformer: A Unified Framework for Image-to-Graph Generation

July 21, 2022 ยท 2 min ยท long8v ยท 

RelTR code reading

July 21, 2022 ยท 1 min ยท long8v ยท 

[36] SGTR: End-to-end Scene Graph Generation with Transformer

July 19, 2022 ยท 3 min ยท long8v ยท 

[35] RelTR: Relation Transformer for Scene Graph Generation

July 18, 2022 ยท 5 min ยท long8v ยท 

MoEBERT code reading

May 23, 2022 ยท 1 min ยท long8v ยท 

[21] cosFormer: Rethinking Softmax in Attention

April 20, 2022 ยท 3 min ยท long8v ยท 

[15] Quantifying Memorization Across Neural Language Models

March 24, 2022 ยท 3 min ยท long8v ยท