[189] Training Verifiers to Solve Math Word Problems

December 9, 2024 ยท 1 min ยท long8v ยท 

[158] A Mathematical Framework for Transformer Circuits

May 9, 2024 ยท 4 min ยท long8v ยท 

[124] LiT: Zero-Shot Transfer with Locked-image text Tuning

July 6, 2023 ยท 4 min ยท long8v ยท 

[90] Neural Collaborative Graph Machines for Table Structure Recognition

December 22, 2022 ยท 1 min ยท long8v ยท 

[87] Bipartite Graph Network with Adaptive Message Passing for Unbiased Scene Graph Generation

December 8, 2022 ยท 2 min ยท long8v ยท 

[65] Margin Calibration for Long-Tailed Visual Recognition

September 19, 2022 ยท 2 min ยท long8v ยท 

[63] Masked Autoencoders Are Scalable Vision Learners

September 7, 2022 ยท 2 min ยท long8v ยท 

[58] MetaFormer Is Actually What You Need for Vision

August 31, 2022 ยท 1 min ยท long8v ยท 

[45] BGT-Net: Bidirectional GRU Transformer Network for Scene Graph Generation

August 3, 2022 ยท 1 min ยท long8v ยท 

[44] Context-Aware Scene Graph Generation With Seq2Seq Transformers

August 2, 2022 ยท 3 min ยท long8v ยท 

[16] Counterfactual Memorization in Neural Language Models

March 25, 2022 ยท 3 min ยท long8v ยท 

[7] SLIP: Self-supervision meets Language-Image Pre-training

January 20, 2022 ยท 1 min ยท long8v ยท 

[6] Crossing the Format Boundary of Text and Boxes: Towards Unified Vision-Language Modeling

January 18, 2022 ยท 1 min ยท long8v ยท 

[2] ELSA: Enhanced Local Self-Attention for Vision Transformer

January 7, 2022 ยท 1 min ยท long8v ยท 

[1] Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet

January 5, 2022 ยท 1 min ยท long8v ยท