image arxiv , code image

problem : Self-supervised-Learning์ด CLIP ๋ชจ๋ธ ๊ตฌ์กฐ์—์„œ๋„ ์ž˜ ์ž‘๋™ํ•˜๋Š”์ง€ ์‹คํ—˜ solution : ์ด๋ฏธ์ง€์™€ ํ…์ŠคํŠธ์— ๋Œ€ํ•ด์„œ๋Š” contrastive learning ์„ ํ•˜๊ณ , ์ด๋ฏธ์ง€์— ๋Œ€ํ•ด์„œ๋Š” self-supervision learning ์„ ํ•˜์—ฌ ๋กœ์Šค๋ฅผ ํ•ฉํ•˜์—ฌ ํ•™์Šต. result : linear prediction(=representation์— FCN ๋ถ™์—ฌ์„œ ๋ถ„๋ฅ˜ํ•˜๋Š”๊ฒƒ. linear๋งŒ ํ•™์Šตํ•จ. BERT์—์„œ ์–ธ๊ธ‰ํ•˜๋Š” ‘feature-based learning’), zero-shot learning, end-to-end finetuning์—์„œ SOTA