[110] Understanding the Role of Self Attention for Efficient Speech Recognition2022Q1 ICLR 25min transformer
[108] Unsupervised Vision-Language Parsing: Seamlessly Bridging Visual Scene Graphs with Language Structures via Dependency Relationships2022Q1 dataset CVPR graph
[37] Relationformer: A Unified Framework for Image-to-Graph Generation2022Q1 SGG graph one-stage ECCV