image

paper

TL;DR

  • I read this because.. : ResNet50๊ณผ 101์˜ ์ฐจ์ด๋ฅผ ๋ชจ๋ฆ„ ^^
  • task : image classification, object detection
  • problem : ๋ ˆ์ด์–ด๊ฐ€ ๋‚ฎ์€ ๋„คํŠธ์›Œํฌ๊ฐ€ ์žˆ๊ณ  ๊ฑฐ๊ธฐ์— identity mapping๋งŒ ์ถ”๊ฐ€ํ•œ ๊นŠ์€ ๋„คํŠธ์›Œํฌ๊ฐ€ ์žˆ์„ ๋•Œ ์‚ฌ์‹ค์ƒ ๊ฐ™์€ ๋„คํŠธ์›Œํฌ์ธ๋ฐ๋„ ๋ถˆ๊ตฌํ•˜๊ณ  ๊นŠ์€ ๋„คํŠธ์›Œํฌ์˜ training error๊ฐ€ ๋” ๋†’์€ ํ˜„์ƒ. ์ฆ‰ ๊นŠ์„ ์ˆ˜๋ก ํ•™์Šต์ด ๋ถˆ์•ˆ์ •ํ•˜๊ฒŒ ์ตœ์ ํ•ด๋ฅผ ์ฐพ์Œ.
  • idea : residual connection. f(x) + x๋ฅผ ํ•˜์ž. ์ด๋ ‡๊ฒŒ ๋˜๋ฉด ๊นŠ์€ ๋ ˆ์ด์–ด๊ฐ€ ํ•„์š”์—†์œผ๋ฉด f(x)=0์ด ๋˜์–ด์„œ identity mapping์„ ํ•˜๋Š” ๊ฒƒ๊ณผ ๊ฐ™์€ ์—ญํ• ์„ ํ•  ๊ฒƒ.
  • architecture : VGG์˜ ์›์น™์„ ๋”ฐ๋ผ 1) ๋งค ๋ ˆ์ด์–ด์˜ ํ•„ํ„ฐ ๊ฐœ์ˆ˜๋ฅผ ๊ฐ™๊ฒŒ ์„ค์ • 2) feature mapํฌ๊ธฐ๊ฐ€ ๋ฐ˜์œผ๋กœ ์ค„๋ฉด filter ๊ฐœ์ˆ˜๋ฅผ ๋‘๋ฐฐ๋กœ ํ–ˆ์ง€๋งŒ ํ•„ํ„ฐ ๊ฐœ์ˆ˜๊ฐ€ VGG๋ณด๋‹ค ์ž‘์€ ๋Œ€์‹  ๋” ๊นŠ๊ฒŒ ์Œ“์•„์„œ ํŒŒ๋ผ๋ฏธํ„ฐ์ˆ˜๋‚˜ FLOPS๋Š” VGG๋ณด๋‹ค ๋‚ฎ์Œ.
  • objective : CE loss for classification, object detection loss
  • baseline : VGG-16, GoogLeNet, plain(ResNet์— residual connection ๋บ€๊ฑฐ)
  • data : CIFAR-10, COCO 2015m ILSVRC 2015
  • evaluation : accuracy, mAP, # params, FLOPS
  • result : ์ด๋ฏธ์ง€ ๋ถ„๋ฅ˜์—์„œ sota. object detection์—์„œ ์„ฑ๋Šฅ 28% ๊ฐœ์„ 
  • contribution : residual connection

Details

Motivation

image

degration์ด๋ผ๋Š” ํ˜„์ƒ. ๊นŠ์œผ๋ฉด training error๊ฐ€ ๋” ๋†’์Œ. ์ฆ‰ overfitting์ด ๋ฌธ์ œ๊ฐ€ ์•„๋‹ˆ๋ผ ํ•™์Šต ์ž์ฒด๊ฐ€ ์ž˜ ์•ˆ๋œ ์ƒํ™ฉ

Residual learning

image

residualํ•˜๋Š” block์€ ์ตœ์†Œ 2๊ฐœ ์ด์ƒ์ด์–ด์•ผ(1๊ฐœ๋ฉด ๊ทธ๋ƒฅ linearํ•˜๋Š” ํšจ๊ณผ), ์ฐจ์›๋„ ๊ฐ™์•„์•ผ ํ•จ.

Network architecture

image

Network variants

image

๊ถ๊ธˆ์ฆ ํ•ด๊ฒฐ ^^ 101๊ฐœ ๋ ˆ์ด์–ด ์Œ“์€ ๊ฒƒ์ž„

training error on ImageNet

image

๊ธฐํƒ€

์ดˆ๊ธฐ ๋…ผ๋ฌธ๋“ค ์ฝ์œผ๋ฉด ์žฌ๋ฐŒ์„ ๋“ฏ