image

paper

TL;DR

  • task : dense object detection with localization score
  • problem : ๊ธฐ์กด์˜ localization score๋ฅผ ๋ฝ‘๋Š” work๋“ค์€ 1) ํ•™์Šตํ•  ๋•Œ IoU score ๋ธŒ๋žœ์น˜๋ฅผ ๋”ฐ๋กœ ํ•™์Šตํ•˜๋‹ค๊ฐ€ inferํ•  ๋•Œ ๊ฒฐํ•ฉํ•ด์„œ train - infer ๊ฐ„์˜ ๊ดด๋ฆฌ๊ฐ€ ์ƒ๊ฒผ๊ณ  2) localization quality๊ฐ€ positive์—๋งŒ ๋ถ€๊ณผ๋˜์–ด์„œ negative sample๋„ IoU score๊ฐ€ ๋งค์šฐ ๋†’๊ฒŒ ๋‚˜์˜ค๋Š” ๊ฒฝ์šฐ๊ฐ€ ์žˆ์—ˆ์œผ๋ฉฐ 3) bbox ๋ถ„ํฌ์— ๋Œ€ํ•œ ๊ฐ€์ •์ด Dirac-Delta์ด๊ฑฐ๋‚˜ gaussian์œผ๋กœ ๋„ˆ๋ฌด ๋‹จ์ˆœํ–ˆ๋‹ค.
  • idea : ํ•™์Šต ๋•Œ category์™€ IoU score๋ฅผ ๊ฒฐํ•ฉํ•˜์—ฌ smoothํ•œ target์„ ์ฃผ์–ด ํ•™์Šต-์ถ”๋ก  ๊ดด๋ฆฌ๋ฅผ ์—†์• ๊ณ  bbox์— ๋Œ€ํ•œ ๋ถ„ํฌ๋„ ํ•™์Šตํ•˜๊ฒŒ ํ•˜์—ฌ ๋ถ„ํฌ์˜ ๊ฐ•ํ•œ ์ œ์•ฝ์„ ์—†์• ์ž
  • architecture : ResNet with FPN + ???
  • objective : 1) focal loss์˜ $(1-p_t)^\gamma$ ๋Œ€์‹ ์— target ๊ณผ์˜ ๊ฑฐ๋ฆฌ term์ธ $|y-\sigma|^\beta$๋ฅผ ๊ณฑํ•ด์ฃผ๊ณ  2) ํ•™์Šตํ•œ discrete ๋ถ„ํฌ์— ๋Œ€ํ•œ ๊ฐ’๋„ ๋ฐ˜์˜ํ•ด์คŒ => Generalized Focal Loss
  • baseline : w/o quality branch, IoU branch, centerness-guided, IoU guided
  • data : COCO
  • result : quality branch ์—†๋Š” ๊ฒƒ๋ณด๋‹จ ์„ฑ๋Šฅ์ด ์ข‹๊ณ  ๋‚˜๋จธ์ง€๋Š” IoU-branch์— ๋’ค์ง€๋Š”๊ฒŒ ๋ช‡๊ฐœ ์žˆ๋Š”๋ฐ ๋Œ€๋ถ€๋ถ„ ๊ฐœ์„ 
  • contribution :
  • limitation or ์ดํ•ด ์•ˆ๋˜๋Š” ๋ถ€๋ถ„ : Distribution Focal Loss ๋ถ€๋ถ„์€ ์ž˜ ์ดํ•ด ์•ˆ๋˜๊ณ  ์—ฌ๊ธฐ์„œ ์‹คํ—˜ํ•œ ์•„ํ‚คํ…์ณ๊ฐ€ ์–ด๋–ป๊ฒŒ ๋˜๋Š”์ง€๋„ ์ž˜ ๋ชจ๋ฅด๊ฒ ์Œ. ๊ทธ๋ƒฅ ResNet + FPN ์—๋‹ค๊ฐ€ ๋ชจ๋“  ํ”ฝ์…€์— bbox ์˜ˆ์ธกํ•˜๋Š”๊ฑด๊ฐ€? ATSS๋Š” ๋ฌด์—‡์ธ๊ฐ€

Details

๊ธฐ์กด ๋ฐฉ๋ฒ•์˜ ๋ฌธ์ œ์ 

image

image

Generalized Focal Loss์˜ ์ฃผ์š” ์•„์ด๋””์–ด

image

  • focal loss image

  • quality focal loss image

  • distribution focal loss image

image

  • generalized focal loss image

Result

image