๐Ÿ˜Ž ๊ณต๋ถ€ํ•˜๋Š” ์ง•์ง•์•ŒํŒŒ์นด๋Š” ์ฒ˜์Œ์ด์ง€?

[DEEPNOID ์›ํฌ์ธํŠธ๋ ˆ์Šจ]_7_Object Detection 2 ๋ณธ๋ฌธ

๐Ÿ‘ฉ‍๐Ÿ’ป ์ธ๊ณต์ง€๋Šฅ (ML & DL)/ML & DL

[DEEPNOID ์›ํฌ์ธํŠธ๋ ˆ์Šจ]_7_Object Detection 2

์ง•์ง•์•ŒํŒŒ์นด 2022. 1. 27. 15:18
728x90
๋ฐ˜์‘ํ˜•

220127 ์ž‘์„ฑ

<๋ณธ ๋ธ”๋กœ๊ทธ๋Š” DEEPNOID ์›ํฌ์ธํŠธ๋ ˆ์Šจ์„ ์ฐธ๊ณ ํ•ด์„œ ๊ณต๋ถ€ํ•˜๋ฉฐ ์ž‘์„ฑํ•˜์˜€์Šต๋‹ˆ๋‹ค>

https://www.deepnoid.com/

 

์ธ๊ณต์ง€๋Šฅ | Deepnoid

DEEPNOID๋Š” ์ธ๊ณต์ง€๋Šฅ์„ ํ†ตํ•œ ์ธ๋ฅ˜์˜ ๊ฑด๊ฐ•๊ณผ ์‚ถ์˜ ์งˆ ํ–ฅ์ƒ์„ ๊ธฐ์—…์ด๋…์œผ๋กœ ํ•˜๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค. ๋”ฅ๋…ธ์ด๋“œ๊ฐ€ ๊ฟˆ๊พธ๋Š” ์„ธ์ƒ์€, ์˜๋ฃŒ ์ธ๊ณต์ง€๋Šฅ์ด ์ง€๊ธˆ๋ณด๋‹ค ํ›จ์”ฌ ๋„“์€ ๋ฒ”์œ„์˜ ์งˆํ™˜์˜ ์—ฐ๊ตฌ, ์ง„๋‹จ, ์น˜๋ฃŒ์— ๋„์›€

www.deepnoid.com

 

 

 

 

 

1. Classification

1) Classification

: feature Extractor -> classifier

 

2) Object Detection 

 

3) Instance Segmentation

: Semantic VS Instance Segmentation

: input -> pixelwise classification -> probability map -> threshold

 

 

 

 

 

2. One stage VS Multi Stage Detection

1) One stage

Feature Extraction -> Region proposal -> Classifiction -> Box Regression -> semantic, Instance Segmentation

 

 

 

2) YOLO, SSD

Feature Extraction -> Classifiction -> Box Regression

yolo
SSD

=> Yolo๋Š” 7x7 grid ํ•˜๋‚˜

=> SSD๋Š” ์ „์ฒด ์ด๋ฏธ์ง€๋ฅผ 38x38, 19x19, 10x10, 5x5, 3x3, 1x1์˜ ๊ทธ๋ฆฌ๋“œ๋กœ ๋‚˜๋ˆ„๊ณ  output๊ณผ ์—ฐ๊ฒฐ

 

 

 

 

 

3. Comparision

- Multi Stage

: Backbone Network ๋กœ ์ถ”์ถœํ•œ Feature ๋ฅผ ๊ฐ๊ฐ์˜ ๋ชฉ์ ์— ๋งž๋Š” ์ž‘์€ ์‹ ๊ฒฝ๋ง์œผ๋กœ class scoring, box coordinate ๋”ฐ๋กœ ํš๋“

: Object ๊ฒ€์ถœ ์„ฑ๋Šฅ, class ๋ถ„๋ฅ˜ ๋›ฐ์–ด๋‚จ

 

- One Stage

: Backbone Network ๋กœ ์ถ”์ถœํ•œ Feature ๋ฅผ ๊ฐ๊ฐ์˜ ๋ชฉ์ ์— ๋งž๋Š” ์ž‘์€ ์‹ ๊ฒฝ๋ง์œผ๋กœ class scoring, box coordinate ํ•œ๋ฒˆ์— ํš๋“

: Object ๊ฒ€์ถœ ์„ฑ๋Šฅ, class ๋ถ„๋ฅ˜ ์ƒ๋Œ€์ ์œผ๋กœ ๋–จ์–ด์ง

 

 

 

 

 

4. YOLO, You Only Look Once

  • yolo v1

: Localization, Classification ์„ ํ•˜๋‚˜์˜ Regression problem ์œผ๋กœ ํ•ด์„

: MMS๋กœ ๊ฐ ROI์˜ Box ๋ฅผ ์ค‘๋ณต ์ œ๊ฑฐ ํ›„ ํ›„์ฒ˜๋ฆฌ๋ฅผ ํ†ตํ•œ score ํš๋“

: ๊ฐ Grid ๋ณ„๋กœ obejct class probablity ๋ฅผ ๊ณ„์‚ฐ

: ์ฐพ์•„๋‚ธ Box ๋“ค์˜ ์ค‘์‹ฌ์  ์ขŒํ‘œ, ๋†’๋‚ฎ์ด, class socre์„ ํ†ตํ•ด ground truth์™€ prediction ๊ฐ„์˜ ์ฐจ์ด๋ฅผ ๊ณ„์‚ฐํ•˜๋„๋ก ์†์‹ค ํ•จ์ˆ˜ ๊ตฌ์„ฑํ•˜๊ณ  ์ด๋ฅผ minimize

: ์†์‹คํ•จ์ˆ˜๋Š” sum of squared error, ์˜ค์ฐจ ์ œ๊ณฑํ•ฉ(SSE) ์ด์šฉ

: BOX์— object ์—†์œผ๋ฉด confidence ๊ฐ€ 0์ด ๋˜๋„๋ก penalty ๋ถ€์—ฌ!

 

  •  yolo v2

: Batch Normalization, ๋ชจ๋“  conv layer ๋’ค์— BN ๊ฒฐํ•ฉ, ๋ณ„๋„์˜ Regularizer ๋‹ค ์ œ๊ฑฐ

: High Resolution Classifier, feature mpa ์„ grid ๋กœ ์‚ฌ์šฉํ•˜๊ธฐ ๋•Œ๋ฌธ์— ๊ณ ํ•ด์ƒ๋„ ํ”ผ์ฒ˜๋งต ํš๋“

: ์ž…๋ ฅ ์ด๋ฏธ์ง€๋ฅผ 416*416 ๋กœ ํ•œ๋‹ค

-> object ์ค‘์‹ฌ์ ์„ gird ์ƒ์—์„œ ์ž˜ ์ฐพ์•„๋‚ด๊ธฐ ์šฐํ•ด

-> ๋Œ€๋ถ€๋ถ„ ๋ฐ์ดํ„ฐ๋Š” ์˜์ƒ์˜ ์ค‘์•™์— object ์œ„์น˜

 

: Convolution with Anchor Box

: Dimension Clustering

: Object Detection Task ์—์„œ Recall ์˜๋ฏธ

-> ์‹ค์ œ obejct์˜ ์œ„์น˜๋ฅผ ์˜ˆ์ธกํ•œ ๋น„์œจ์ด ๋†’์Œ ์˜๋ฏธ

 

: Fine-Grained Feature

-> ์ตœ์ข… feature map ํฌ๊ธฐ 13*13์€ ํฐ obejct ๊ฒ€์ถœ์—” ์ถฉ๋ถ„ํ•˜์ง€๋งŒ ์ž‘์€ object ๊ฒ€์ถœ์—” ๋ถˆ์ถฉ๋ถ„

-> 26*26*512 => 13*13*2048 => stacking => 13*13*3072

 

: VGGNet ์˜ 3*3 filter ์‚ฌ์šฉ

 

 

  •  yolo v3

: feature pyramid Network

 

  •  yolo v4

: ๋†’์€ ์ •ํ™•๋„์™€ ๋น ๋ฅธ Detectin ์†๋„ ํ™•๋ณด

 

 

 

 

5. Single-shot Detector

: ์ž‘์€ object ์ •ํ™•ํ•˜๊ฒŒ ๊ฒ€์ถœ

: ๋‹ค์–‘ํ•œ scale์˜ feature map์„ ํš๋“ํ•˜๊ธฐ ์œ„ํ•ด ๋ณด์กฐ ์‹ ๊ฒฝ๋ง ์‚ฌ์šฉ

: ์„œ๋กœ ๋‹ค๋ฅธ scale์˜ feature map ๋ณ„๋กœ ๊ฐ๊ฐ ๋‹ค๋ฅธ ํ˜•ํƒœ anchor box ์‚ฌ์šฉ

: YOLO v1 ๋‹จ์ ์€ ์ตœ์ข… feature map ๋งŒ์œผ๋กœ grid ๊ตฌ์„ฑํ•ด์„œ ์‚ฌ์šฉ

 

 

 

 

 

 

 

 

 

 

728x90
๋ฐ˜์‘ํ˜•
Comments