๐Ÿ˜Ž ๊ณต๋ถ€ํ•˜๋Š” ์ง•์ง•์•ŒํŒŒ์นด๋Š” ์ฒ˜์Œ์ด์ง€?

[DEEPNOID ์›ํฌ์ธํŠธ๋ ˆ์Šจ]_5_Detection 1. RCNN ๋ณธ๋ฌธ

๐Ÿ‘ฉ‍๐Ÿ’ป ์ธ๊ณต์ง€๋Šฅ (ML & DL)/ML & DL

[DEEPNOID ์›ํฌ์ธํŠธ๋ ˆ์Šจ]_5_Detection 1. RCNN

์ง•์ง•์•ŒํŒŒ์นด 2022. 1. 26. 11:51
728x90
๋ฐ˜์‘ํ˜•

220126 ์ž‘์„ฑ

<๋ณธ ๋ธ”๋กœ๊ทธ๋Š” DEEPNOID ์›ํฌ์ธํŠธ๋ ˆ์Šจ์„ ์ฐธ๊ณ ํ•ด์„œ ๊ณต๋ถ€ํ•˜๋ฉฐ ์ž‘์„ฑํ•˜์˜€์Šต๋‹ˆ๋‹ค>

https://www.deepnoid.com/

 

์ธ๊ณต์ง€๋Šฅ | Deepnoid

DEEPNOID๋Š” ์ธ๊ณต์ง€๋Šฅ์„ ํ†ตํ•œ ์ธ๋ฅ˜์˜ ๊ฑด๊ฐ•๊ณผ ์‚ถ์˜ ์งˆ ํ–ฅ์ƒ์„ ๊ธฐ์—…์ด๋…์œผ๋กœ ํ•˜๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค. ๋”ฅ๋…ธ์ด๋“œ๊ฐ€ ๊ฟˆ๊พธ๋Š” ์„ธ์ƒ์€, ์˜๋ฃŒ ์ธ๊ณต์ง€๋Šฅ์ด ์ง€๊ธˆ๋ณด๋‹ค ํ›จ์”ฌ ๋„“์€ ๋ฒ”์œ„์˜ ์งˆํ™˜์˜ ์—ฐ๊ตฌ, ์ง„๋‹จ, ์น˜๋ฃŒ์— ๋„์›€

www.deepnoid.com

 

 

 

 

 

 

 

 

1. Image recognition

- Classification : ์ด๋ฏธ์ง€๋ฅผ ํ†ตํ•ด ๋ฌผ์ฒด ํŒ๋ณ„

- Detection : ๋ฌผ์ฒด๊ฐ€ ์žˆ๋Š” ์œ„์น˜๋ฅผ ์ฐพ์•„ ๋ฌผ์ฒด์— ๋Œ€ํ•ด boxing 

- Segmentation : ์ด๋ฏธ์ง€๋ฅผ ํ”ฝ์…€ ๋‹จ์œ„๋กœ ๊ตฌ๋ถ„ํ•˜์—ฌ ๊ฐ ํ”ฝ์…€์ด ์–ด๋–ค ๋ฌผ์ฒด์ธ์ง€ ๊ตฌ๋ถ„

https://artificialintelligence.oodles.io/blogs/deep-learning-for-image-recognition/

 

 

 

2. Objection Detection

: ์˜์ƒ ๋‚ด์— ์กด์žฌํ•˜๋Š” ๋ชจ๋“  ์นดํ…Œ๊ณ ๋ฆฌ์— ๋Œ€ํ•ด์„œ classification ๊ณผ localization

Objection Detection = Multi-Labeled Classification + Bounding Box Regression ( Localization )

 

 

 

3. Precision-Recall

: ๋ฌผ์ฒด๋ฅผ ๊ฒ€์ถœํ•˜๋Š” ์•Œ๊ณ ๋ฆฌ์ฆ˜์˜ ์„ฑ๋Šฅ ํ‰๊ฐ€

- AP : ์ธ์‹ ์•Œ๊ณ ๋ฆฌ์ฆ˜์˜ ์„ฑ๋Šฅ์„ ํ•˜๋‚˜์˜ ๊ฐ’์œผ๋กœ ํ‘œํ˜„ํ•œ ํ‰๊ฐ€ ์ง€ํ‘œ

- mAP : ๋ฌผ์ฒด ํด๋ž˜์Šค๊ฐ€ ์—ฌ๋Ÿฌ ๊ฐœ ์ธ๊ฒฝ์šฐ ๊ฐ ํด๋ž˜์Šค๋‹น AP ๊ตฌํ•œ ํ›„ ๋ชจ๋‘ ํ•ฉํ•ด์„œ ๋ฌผ์ฒด ํด๋ž˜์Šค์˜ ๊ฐœ์ˆ˜๋กœ ๋‚˜๋ˆ ์คŒ

  • Precision-Recall ๊ทธ๋ž˜ํ”„

- Precision : ๊ฒ€์ถœ ๊ฒฐ๊ณผ๋“ค ์ค‘ ์˜ณ๊ฒŒ ๊ฒ€์ถœํ•œ ๋น„์œจ

- Recall : ์‹ค์ œ ์˜ณ๊ฒŒ ๊ฒ€์ถœ๋œ ๊ฒฐ๊ณผ๋ฌผ ์ค‘์—์„œ ์˜ณ๋‹ค๊ณ  ์˜ˆ์ธกํ•œ ๋น„์œจ

: ๋ฐ˜๋น„๋ก€

https://towardsdatascience.com/precision-and-recall-made-simple-afb5e098970f
https://cubalytictalks.blogspot.com/2018/08/confusion-matrix.html

1) 1-Stage Detector

: ๋ฌธ์ œ๋ฅผ ๋™์‹œ์— ํ–‰ํ•จ (์†๋„)

ex) YOLO, SSD

 

2) 2-Stage Detector

: ๋ฌธ์ œ๋ฅผ ์ˆœ์ฐจ์ ์œผ๋กœ ํ–‰ํ•˜๋Š” ๋ฐฉ๋ฒ• (์ •ํ™•๋„)

ex) R-CNN -> Fast R-CNN -> Faster R-CNN

 

 

4. R-CNN

 

1) Region Proposal

  • Feature extraction

- Navive approach

sliding window approach : ์ด๋ฏธ์ง€๋ฅผ ๋ชจ๋‘ ํƒ์ƒ‰ํ•˜๋ฉด์„œ classification

 

- Selective Search

: ์ธ์ ‘ํ•œ ์˜์—ญ๋ผ๋ฆฌ ์œ ์‚ฌ์„ฑ์„ ์ธก์ •ํ•ด ํฐ ์˜์—ญ์œผ๋กœ ํ†ตํ•ฉํ•ด ๋‚˜๊ฐ

: ์ปฌ๋Ÿฌ, ๋ฌด๋Šฌ ,๋ช…์•” ๋“ฑ์œผ๋กœ ๋‹ค์–‘ํ•œ ๊ทธ๋ฃนํ™”

 

  • Image Classiciatinon

: Regional Proposal ์„ ํ†ตํ•ด ๋ฝ‘์•„๋‚ธ 2์ฒœ๊ฐœ์˜ ๋…๋ฆฝ์ ์ธ region ์„ ์ผ์ •ํ•œ ํฌ๊ธฐ๋กœ wapping

: ๊ฐ๊ฐ์˜ ๋ชจ๋“  region ์„ convNet์œผ๋กœ feature ์ถ”์ถœ

2) Classification & Regression

  • linear SVM + Bounding Box Regression

: CNN ๋งˆ์ง€๋ง‰์— softmax layer ์ œ๊ฑฐ, svm ๋Œ€์ฒดํ•˜์—ฌ ํ•™์Šต

: feature vector ๋กœ ์–ป์€ proposed box ์™€ ์‹ค์ œ box์˜ ์ฐจ์ด ์ •์˜

: CNN ํ†ต๊ณผํ•˜์—ฌ ์ถ”์ถœ๋œ ๋ฒกํ„ฐ์™€ x, y, w, h๋ฅผ ์กฐ์ •ํ•˜๋Š” ํ•จ์ˆ˜์˜ ์›จ์ดํŠธ ๊ณฑํ•ด์„œ ๋ฐ”์šด๋”ฉ ๋ฐ•์Šค๋ฅผ ์กฐ์ •ํ•˜์—ฌ ์„ ํ˜•ํšŒ๊ท€ ํ•™์Šต

https://medium.com/@abhimanyu.contact/object-detection-find-me-if-you-can-64851f299436

 

 

3) Metrics

- IoU (Intersection over Union)

- non-Maximum Suppression

: ๋™์ผํ•œ object ํŒ๋‹จํ•˜๋Š” Bbox ์ œ๊ฑฐ

 

 

 

 

 

 

5. R-CNN ๋‹จ์ 

- ์†๋„

- ๋ณต์žกํ•œ ๊ตฌ์กฐ : Regrion proposal / ConvNet / SVM, Bbox regressor

- Back propagation ๋ถˆ๊ฐ€๋Šฅ : Multi-stage Training

 

 

 

 

6. Fast R-CNN

: Multi-task Loss Function ๋„์ž…ํ•ด end-to-end ๋ชจ๋ธ

: Region Proposal ๋‹จ๊ณ„๋ฅผ NN ์•ˆ์œผ๋กœ ๋Œ์–ด์˜ด

: Fast R-CNN + Region Proposal Network (RPN)

 

- ํ•˜๋‚˜์˜ ConvNet ์‚ฌ์šฉ

- Region Proposal Network

- ROI Pooling

- Classifier & Regressor

- Multi-Task Loss

1) Feature Extraction

  • Regional Proposals

: ์ „์ฒด ์ด๋ฏธ์ง€๋ฅผ ์ด๋ฏธ ํ•™์Šต๋œ CNN ์„ ํ†ต๊ณผ์‹œ์ผœ ํ”ผ์ณ๋งต ์ถ”์ถœ

: Selective Search๋ฅผ ํ†ตํ•ด์„œ ์ฐพ์€ ๊ฐ๊ฐ์˜ RoI ์— ๋Œ€ํ•ด RoI Pooling ์ง„ํ–‰

: GPU ํ†ตํ•œ RoI ๊ณ„์‚ฐ

: Feature map ์œ„์—์„œ nxn conv fliter ๋ฅผ sliding window

 

- anchor box

: sliding window ๊ฐ€ ์ฐ์€ ์ง€์ ๋งˆ๋‹ค ์—ฌ๋Ÿฌ ๊ฐœ์˜ region proposal ์˜ˆ์ธก

 

  • Roi Pooling

: ์–ด๋– ํ•œ RoI ํฌ๊ธฐ๊ฐ€ ๋‚˜์˜ค๋”๋ผ๋„ ์ ๋‹นํžˆ ์กฐ์ ˆํ•ด์„œ ๊ณ ์ •๋œ ์•„์›ƒํ’‹ ๋งŒ๋“ค๊ธฐ

: ์„œ๋กœ ๋‹ค๋ฅธ ํฌ๊ธฐ์˜ regions ๊ฐ’์„ ๋™์ผํ•œ ํฌ๊ธฐ๋กœ ๋ณ€ํ™˜

 

2) Classifier & Regressor

  • Multi-task Loss

: classification loss + bounding box regression

 

 

 

 

 

728x90
๋ฐ˜์‘ํ˜•
Comments