2022Q3 | 🍎 Paper Today I Read 🦔

[162] CALIP: Zero-Shot Enhancement of CLIP with Parameter-free Attention

AAAI 2022Q3 25min CLIP

[128] Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding

ICML google 2022Q3 document

[123] Robust fine-tuning of zero-shot models

openAI google CVPR 2022Q3 CLIP domainshift

[98] Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection

NeurIPS object detection 2022Q3 CLIP

[77] Interpretable Image Classification with Differentiable Prototype Assignment

2022Q3 ECCV XAI

[75] SESS: Saliency Enhancing with Scaling and Sliding

2022Q3 25min ECCV XAI

[76] Long-tail Detection with Effective Class-Margins

2022Q3 imbalance ECCV

[74] “This is my unicorn, Fluffy”: Personalizing frozen vision-language representations

dataset 2022Q3 25min ECCV nvidia CLIP

[68] Iterative Scene Graph Generation

SGG 2022Q3 one-stage

[60] Efficient Sparsely Activated Transformers

MoE 2022Q3 25min AutoML

[54] Branch-Train-Merge: Embarrassingly Parallel Training of Expert Language Models

LM MoE 2022Q3 25min

[41] Panoptic Scene Graph Generation

dataset SGG 2022Q3 25min

[42] DETRs with Hybrid Matching

object detection 2022Q3 25min DETR