๐Ÿ˜Ž ๊ณต๋ถ€ํ•˜๋Š” ์ง•์ง•์•ŒํŒŒ์นด๋Š” ์ฒ˜์Œ์ด์ง€?

Self-supervised Learning(์ž๊ธฐ์ฃผ๋„ํ•™์Šต) ์™€ Supervised Contrastive Learing ๋ณธ๋ฌธ

๐Ÿ‘ฉ‍๐Ÿ’ป ์ธ๊ณต์ง€๋Šฅ (ML & DL)/ML & DL

Self-supervised Learning(์ž๊ธฐ์ฃผ๋„ํ•™์Šต) ์™€ Supervised Contrastive Learing

์ง•์ง•์•ŒํŒŒ์นด 2022. 11. 23. 13:52
728x90
๋ฐ˜์‘ํ˜•

<๋ณธ ๋ธ”๋กœ๊ทธ๋Š” daeun-computer-uneasy ๋‹˜์˜ ๋ธ”๋กœ๊ทธ๋ฅผ ์ฐธ๊ณ ํ•ด์„œ ๊ณต๋ถ€ํ•˜๋ฉฐ ์ž‘์„ฑํ•˜์˜€์Šต๋‹ˆ๋‹ค :-)>

https://daeun-computer-uneasy.tistory.com/37

 

[CV] Self-supervised learning(์ž๊ธฐ์ฃผ๋„ํ•™์Šต)๊ณผ Contrastive learning - ์Šค์Šค๋กœ ํ•™์Šตํ•˜๋Š” ์•Œ๊ณ ๋ฆฌ์ฆ˜

์˜ค๋Š˜์€ Self-supervised learning(์ž๊ธฐ์ฃผ๋„ํ•™์Šต)๊ณผ ์ฃผ๋œ ํ•™์Šต ๋ฐฉ๋ฒ•์ธ Contrastive learning์— ๋Œ€ํ•ด ํฌ์ŠคํŒ…ํ•˜๊ฒ ์Šต๋‹ˆ๋‹ค. ๋จผ์ € Self supervised learning์ด ์™œ ํ•„์š”ํ•œ์ง€๋ถ€ํ„ฐ ์‚ดํŽด๋ณด๊ฒ ์Šต๋‹ˆ๋‹ค. Self-supervised learning์˜ ํ•„์š”

daeun-computer-uneasy.tistory.com

 

 

๐ŸŒฐ  ์ž๊ธฐ์ฃผ๋„ํ•™์Šต(Self-supervised learning)

๋”ฅ๋Ÿฌ๋‹ ํ•™์Šต์—๋Š” ์ถฉ๋ถ„ํ•œ ์–‘์งˆ์˜ ๋ฐ์ดํ„ฐ๊ฐ€ ํ•„์š”

์ด๋Ÿฌํ•œ ๋ฐ์ดํ„ฐ๋“ค์˜ ์ง€๋„ํ•™์Šต์„ ์œ„ํ•ด์„œ๋Š” ๋ผ๋ฒจ๋ง ๊ณผ์ •์ด ํ•„์ˆ˜์ 

 '์ž๊ธฐ์ฃผ๋„ํ•™์Šต(Self-supervised learning)'์€ ๋น„์ง€๋„ํ•™์Šต์˜ ํ•œ ๋ถ„์•ผ์— ์†ํ•˜๋Š” ๋ฐฉ๋ฒ•

์Šค์Šค๋กœ Supervision์„ ์ฃผ๋Š” ๋ฐฉ๋ฒ• -> ์ „ํ˜€ ๋ผ๋ฒจ๋ง ๋˜์–ด์žˆ์ง€ ์•Š์€ ๋ฐ์ดํ„ฐ๋กœ ํ•™์Šต์„ ์ง„ํ–‰

 

 

๐Ÿฅ” Pretext task

์‚ฌ์šฉ์ž๊ฐ€ ์ƒˆ๋กœ์šด ๋ฌธ์ œ๋ฅผ ์ •์˜ํ•œ๋‹ค๋Š” ๊ฒƒ

pretext task๋ฅผ ํ•™์Šตํ•จ์œผ๋กœ์จ ๋ชจ๋ธ์€ ๋ฐ์ดํ„ฐ '์ž์ฒด'์— ๋Œ€ํ•œ ์ดํ•ด๋ฅผ ๋†’์ผ ์ˆ˜ ์žˆ๊ฒŒ ๋จ

 

Pretext task๊ฐ€ ์ œ์•ˆ๋œ ์ตœ์ดˆ์˜ ๋…ผ๋ฌธ Exemplar

๋ถ‰์€์ƒ‰์œผ๋กœ ํ‘œ์‹œ๋˜์–ด ์žˆ๋Š” ๋ถ€๋ถ„์ด ์›๋ž˜์˜ ์›๋ณธ image(Seed patch)

๋‚˜๋จธ์ง€๋Š” ์ผ๋ฐ˜์ ์œผ๋กœ ์‚ฌ์šฉํ•˜๋Š” augmentation ๋ฐฉ๋ฒ•์„ ์ด์šฉํ•˜์—ฌ crop, ์ƒ‰์ƒ๋ณ€ํ™˜ ๋“ฑ์˜ ๊ณผ์ •์„ ๊ฑฐ์นœ ๊ฒƒ

 

augmentation์œผ๋กœ ์ƒ์„ฑ๋œ image๋“ค์„ ๋ชจ๋‘ ๊ฐ™์€ class๋กœ ๋ถ„๋ฅ˜ํ•˜๋„๋ก ๋ถ„๋ฅ˜๊ธฐ๋ฅผ ํ•™์Šต ์‹œํ‚ด

 

๐Ÿ“› BUT

image ํ•˜๋‚˜ํ•˜๋‚˜๊ฐ€ ํ•˜๋‚˜์˜ class๋ฅผ ๋ถ€์—ฌ๋ฐ›๊ฒŒ๋˜๋‹ˆ, ํฐ ๋ฐ์ดํ„ฐ์…‹์— ์ ์šฉํ•˜๊ธฐ์—๋Š” ๋ฌด๋ฆฌ๊ฐ€ ์žˆ์„ ๊ฒƒ

๋ฐ์ดํ„ฐ ๊ฐœ์ˆ˜๊ฐ€ class ๊ฐœ์ˆ˜๊ฐ€ ๋˜๋Š” ํ˜„์ƒ์ด ๋ฐœ์ƒํ•˜๋Š” ๊ฒƒ

 

 

๐Ÿฅ” Contrastive learning

์น˜ํƒ€๋ฅผ Input image๋กœ ๋„ฃ์—ˆ๋Š”๋ฐ ์น˜ํƒ€, ๋ž˜์˜คํŒŒ๋“œ, ์žฌ๊ทœ์–ด ๋“ฑ์— ์œ ์‚ฌํ•œ feature๊ฐ€ ์ž‘์šฉํ•˜๊ณ  ์žˆ์Œ์„ ํ™•์ธ

์ž˜ ์ถ”์ถœ๋œ ํŠน์ง•๊ฐ’์€ instance๊ฐ„์˜ ์œ ์‚ฌ๋„ ์ •๋ณด๋ฅผ ๊ฐ€์ง€๊ณ  ์žˆ์„ ๊ฒƒ์ด๋ผ๋Š” ๊ฐ€์ •์œผ๋กœ ์‹œ์ž‘

 

Contrastive Learning์€ Positive pair์™€ Negative pair๋กœ ๊ตฌ์„ฑ

Positive pair ๋ผ๋ฆฌ๋Š” ๊ฑฐ๋ฆฌ๋ฅผ ์ขํžˆ๊ณ , Negative pair ๋ผ๋ฆฌ๋Š” ๊ฑฐ๋ฆฌ๋ฅผ ๋ฉ€๋ฆฌ ๋„์›Œ๋†“๋Š” ๊ฒƒ์ด ํ•™์Šต ์›๋ฆฌ

=> Contrastive Learning์€ ๊ฐ™์€ image์— ์„œ๋กœ ๋‹ค๋ฅธ augmentation์„ ๊ฐ€ํ•œ ๋’ค, ๋‘ positive pair์˜ feature representation์€ ๊ฑฐ๋ฆฌ๊ฐ€ ๊ฐ€๊นŒ์›Œ ์ง€๋„๋ก(์œ ์‚ฌํ•ด์ง€๋„๋ก) ํ•™์Šต

=> ๋‹ค๋ฅธ image์— ์„œ๋กœ ๋‹ค๋ฅธ augmentation์„ ๊ฐ€ํ•œ ๋’ค, ๋‘ negative pair์˜ feature representation์€ ๊ฑฐ๋ฆฌ๊ฐ€ ๋ฉ€์–ด์ง€๋„๋ก ํ•™์Šต

๊ฐ•์•„์ง€์™€ ์˜์ž์˜ ์ด๋ฏธ์ง€๋ฅผ ๊ฐ๊ฐ augmentationํ•˜๊ณ , ์ด augmentation ๋œ ๊ฒƒ๋ผ๋ฆฌ๋Š” ๊ฐ๊ฐ positive pairs๋กœ ๊ฑฐ๋ฆฌ๋ฅผ ์ขํ˜€์ฃผ๋„๋ก ํ•™์Šต

๊ฐ•์•„์ง€์™€ ์˜์ž๋Š” ๊ตฌ๋ถ„ํ•ด์•ผํ•˜๋ฏ€๋กœ, ์ด ๋‘˜์€ negative pairs๋กœ์จ ํ•™์Šตํ•˜๋Š” ๊ฒƒ

 

 

 

 

 

 

 

 

 

 

728x90
๋ฐ˜์‘ํ˜•
Comments