image

paper , code

TL;DR

  • task : representation learning / generative model
  • problem : ๋ฐ์ดํ„ฐ์˜ ์ค‘์š”ํ•œ ๋ถ„ํฌ๋ฅผ ์ž˜ ์„ค๋ช…ํ•˜๋Š” representation์„ ๋งŒ๋“ค๊ณ  ์‹ถ์€๋ฐ, ์ข‹์€ representation์ด๋ž€ ๋ชจ๋ธ๋งํ•˜๊ธฐ ์‰ฌ์›Œ์•ผ ํ•˜๊ณ , factorize ๊ฐ€๋Šฅ ํ•ด์•ผํ•œ๋‹ค.
  • idea : change of variable rule์„ ์‚ฌ์šฉํ•˜์—ฌ ์–ด๋–ค transformation h=f(x)๋ฅผ ์—ญํ•จ์ˆ˜ x=f^(-1)(h)๋กœ ๋งŒ๋“ค์–ด์„œ ๋ฐ์ดํ„ฐ x๋ฅผ ํ‘œํ˜„ํ•ด๋ณด๋„๋ก ํ•˜์ž
  • architecture : hidden layer๋ฅผ ๋ฐ˜๊ฐˆ ํ•˜๊ณ  ์ฒซ๋ฒˆ์งธ ๋ฐ˜์„ mlp, ๋‚˜๋จธ์ง€ ๋ฐ˜์€ mlpํ•œ ์ฒซ๋ฒˆ์งธ ๋ฐ˜๊ณผ ๋ฐ”๋กœ ํ•ฉ, ์ด๋Ÿฐ ๋ณ€ํ™˜์„ additive coupling layer๋ผ๊ณ  ํ•˜๊ณ  ์ด mlp ํ•˜๋Š” ๋ฐ˜์„ ๋ ˆ์ด์–ด๋งˆ๋‹ค ๋ฒˆ๊ฐˆ์•„๊ฐ€๋ฉด์„œ ํ•จ.
  • objective : log-likelihood
  • baseline : Deep MFA, GRBM
  • data : MNIST, Toronto Face Dataset(TFD), Street View House Numbers dataset(SVHN), CIFAR-10
  • result : ๋†’์€ likelihood. h ์ƒ˜ํ”Œ ๋ฝ‘๊ณ  inverse ํ•จ์ˆ˜์— ๋„ฃ์œผ๋ฉด ์ƒ์„ฑ์€ ๋จ.
  • contribution : flow based model๋“ค ์ค‘ ์„ ํ–‰์—ฐ๊ตฌ์ธ๋“ฏ
  • limitation or ์ดํ•ด ์•ˆ๋˜๋Š” ๋ถ€๋ถ„ :

Details

notion