[195] STaR: Self-Taught Reasoner Bootstrapping Reasoning With Reasoning

2025๋…„ 1์›” 9์ผ ยท 1 ๋ถ„ ยท long8v ยท 

[120] Large-scale Bilingual Language-Image Contrastive Learning

2023๋…„ 6์›” 19์ผ ยท 3 ๋ถ„ ยท long8v ยท 

[110] Understanding the Role of Self Attention for Efficient Speech Recognition

2023๋…„ 4์›” 17์ผ ยท 2 ๋ถ„ ยท long8v ยท 

[108] Unsupervised Vision-Language Parsing: Seamlessly Bridging Visual Scene Graphs with Language Structures via Dependency Relationships

2023๋…„ 4์›” 4์ผ ยท 3 ๋ถ„ ยท long8v ยท 

[96] Vision GNN: An Image is Worth Graph of Nodes

2023๋…„ 1์›” 5์ผ ยท 3 ๋ถ„ ยท long8v ยท 

[64] Open-Vocabulary DETR with Conditional Matching

2022๋…„ 9์›” 16์ผ ยท 1 ๋ถ„ ยท long8v ยท 

[62] What to Hide from Your Students: Attention-Guided Masked Image Modeling

2022๋…„ 9์›” 6์ผ ยท 1 ๋ถ„ ยท long8v ยท 

[37] Relationformer: A Unified Framework for Image-to-Graph Generation

2022๋…„ 7์›” 21์ผ ยท 2 ๋ถ„ ยท long8v ยท 

RelTR code reading

2022๋…„ 7์›” 21์ผ ยท 1 ๋ถ„ ยท long8v ยท 

[36] SGTR: End-to-end Scene Graph Generation with Transformer

2022๋…„ 7์›” 19์ผ ยท 2 ๋ถ„ ยท long8v ยท 

[35] RelTR: Relation Transformer for Scene Graph Generation

2022๋…„ 7์›” 18์ผ ยท 4 ๋ถ„ ยท long8v ยท 

MoEBERT code reading

2022๋…„ 5์›” 23์ผ ยท 1 ๋ถ„ ยท long8v ยท 

[21] cosFormer: Rethinking Softmax in Attention

2022๋…„ 4์›” 20์ผ ยท 2 ๋ถ„ ยท long8v ยท 

[20] Memorizing Transformer

2022๋…„ 4์›” 7์ผ ยท 3 ๋ถ„ ยท long8v ยท 

[15] Quantifying Memorization Across Neural Language Models

2022๋…„ 3์›” 24์ผ ยท 3 ๋ถ„ ยท long8v ยท