image

paper

TL;DR

  • task : instance segmentation
  • problem : ์„ธ๊ทธ๋ฉ˜ํ…Œ์ด์…˜ ์–ด๋…ธํ…Œ์ด์…˜ ๋น„์šฉ ๋„ˆ๋ฌด ์„ธ๋‹ค! weakly-supervised๋Š” supervised์˜ 85%์ •๋„ ๋ฐ–์— ์„ฑ๋Šฅ์ด ์•ˆ๋‚˜์˜จ๋‹ค
  • idea : point level์˜ ์–ด๋…ธํ…Œ์ด์…˜์„ ํ•˜์ž! bbox๋ฅผ ๋จผ์ € ์–ด๋…ธํ…Œ์ด์…˜์„ ํ•˜๊ณ  ๊ทธ ์ค‘์— ๋žœ๋ค 10๊ฐœ์˜ ์ ์„ ์ฐ์–ด์„œ ์–ด๋…ธํ…Œ์ดํ„ฐ๊ฐ€ ์ด๊ฒŒ background์ธ์ง€ object์ธ์ง€ binary ๋ ˆ์ด๋ธ”๋ง์„ ํ•จ.
  • architecture : mask RCNN
  • objective : 10๊ฐœ์˜ ์ ์— ๋Œ€ํ•ด์„œ ๋‚˜์˜จ prediction์— ๋Œ€ํ•ด bi-linear interpolate๋ฅผ ํ•œ ๋’ค cross entropy loss
  • baseline : fully supervised mask RCNN
  • data : ImageNet, COCO
  • result : ImageNet์€ supervised์˜ 97% ์ •๋„ ์„ฑ๋Šฅ, COCO๋Š” 99% ์„ฑ๋Šฅ
  • contribution : ์›๋ž˜ ์„ธ๊ทธ๋ฉ˜ํ…Œ์ด์…˜์„ ํ•˜๋Š”๋ฐ ๊ฐœ๋‹น 79์ดˆ ์ •๋„ ๊ฑธ๋ฆฌ๋Š”๋ฐ ์ด ๋ฐฉ๋ฒ•๋ก ์œผ๋กœ๋Š” 7์ดˆ๋ฉด ์–ด๋…ธํ…Œ์ด์…˜ ๊ฐ€๋Šฅ.
  • limitation or ์ดํ•ด ์•ˆ๋˜๋Š” ๋ถ€๋ถ„ : PointRend model ๋ถ€๋ถ„ ์•ˆ ์ฝ์Œ

Details

image

image

  • augmentation ๋ณดํ†ต ์‚ฌ์šฉํ•˜๋Š” ์ด๋ฏธ์ง€ ์–ด๊ทธ๋ฉ˜ํ…Œ์ด์…˜ ์‚ฌ์šฉ + ํ•™์Šต epoch ๋•Œ๋งˆ๋‹ค 10 ๊ฐœ์ค‘ 5๊ฐœ ๋žœ๋ค์ƒ˜ํ”Œ๋งํ•ด์„œ ๊ทธ๊ฒƒ๋งŒ ์‚ฌ์šฉํ•ด์„œ ํ•™์Šต.

  • dice loss์™€ IoU์˜ ์ฐจ์ด https://stackoverflow.com/questions/60268728/why-dice-coefficient-and-not-iou-for-segmentation-tasks image

segmentation์—๋Š” dice, object detection์—๋Š” iou์“ฐ๋Š” ๋“ฏ. ๋”ฑํžˆ ๊ทธ ์ด์œ ๋Š” ์—†๋Š”๋“ฏ?