๐Ÿ˜Ž ๊ณต๋ถ€ํ•˜๋Š” ์ง•์ง•์•ŒํŒŒ์นด๋Š” ์ฒ˜์Œ์ด์ง€?

๋ชจ๋ธ ํŒŒ๋ผ๋ฏธํ„ฐ batch, epoch, learning rate๋ž€? ๋ณธ๋ฌธ

๐Ÿ‘ฉ‍๐Ÿ’ป ์ธ๊ณต์ง€๋Šฅ (ML & DL)/ML & DL

๋ชจ๋ธ ํŒŒ๋ผ๋ฏธํ„ฐ batch, epoch, learning rate๋ž€?

์ง•์ง•์•ŒํŒŒ์นด 2022. 12. 15. 16:23
728x90
๋ฐ˜์‘ํ˜•

batch_size : ๋งค ํ›ˆ๋ จ ๋‹จ๊ณ„๋งˆ๋‹ค ํ•™์Šตํ•  ๋ฐ์ดํ„ฐ์˜ ํฌ๊ธฐ๋ฅผ ๊ฒฐ์ •

epoch : ๋ฐฐ์น˜ ์‚ฌ์ด์ฆˆ๋กœ ๋ถ„ํ• ๋œ ํ›ˆ๋ จ ๋ฐ์ดํ„ฐ ์ „์ฒด๋ฅผ 1ํšŒ ํ•™์Šตํ•˜๋Š” ํšŸ์ˆ˜๋ฅผ ๋ช‡ ๋ฒˆ ๋ฐ˜๋ณตํ• ์ง€

learning rate : ๋ชจ๋ธ์˜ weight์ด ์—…๋ฐ์ดํŠธ ๋  ๋•Œ๋งˆ๋‹ค ์˜ˆ์ƒ ์˜ค๋ฅ˜์— ๋Œ€ํ•œ ์‘๋‹ต์œผ๋กœ ๋ชจ๋ธ์„ ์กฐ์ •

 

 

๐Ÿ‡ Learning rate ํด ๋•Œ

  • ํ•œ ๋ฒˆ์˜ step์—์„œ ํ•™์Šต์ด ํฌ๊ฒŒ ์ง„ํ–‰
  • ๋ณดํญ์ด ํฌ๊ธฐ ๋•Œ๋ฌธ์— ์ข€ ๋” ๋นจ๋ฆฌ ์ˆ˜๋ ด
  • ์˜ค๋ฒ„์Š›์ด ์‹ฌํ•˜๊ฒŒ ์ผ์–ด๋‚˜ loss๊ฐ€ ์ „ํ˜€ ์ค„์ง€ ์•Š์Œ

 

 

๐Ÿ‡ Learning rate ์ž‘์„ ๋•Œ

  • step ๋ณดํญ์ด ์ž‘์•„ ์กฐ๊ธˆ์”ฉ ํ•™์Šต
  • ์ž‘์€ ๋ณดํญ ๋•Œ๋ฌธ์— local minima์— ๋น ์ง

 

 

 

 ๐Ÿฆž batch_size ํด ๋•Œ

  • ํ•œ ๋ฒˆ ํ•™์Šตํ•  ๋•Œ ๋งŽ์€ ๋ฐ์ดํ„ฐ๋กœ ํ•™์Šต
  • batch๊ฐ€ ํฌ๋ฉด ๊ณ„์‚ฐ๋˜๋Š” loss๊ฐ’์˜ ํŽธ์ฐจ๊ฐ€ ์ž‘์•„ ๊ณผ์ ํ•ฉ ์œ„ํ—˜

 

 

 ๐Ÿฆž batch_size ์ž‘์„ ๋•Œ

  • ์ž‘์€ ๋ฐฐ์น˜๋Š” ์ž‘์€ ๋ฐ์ดํ„ฐ๋กœ ํ•™์Šตํ•˜๋ฏ€๋กœ loss์˜ ๋ถ„์‚ฐ์ด ์ปค์„œ regularize ํšจ๊ณผํ•™์Šต ์‹œ๊ฐ„์ด ์˜ค๋ž˜ ๊ฑธ๋ฆผ
  • step์ด ๋งŽ์•„ local minima ์œ„ํ—˜

 

 

 

 

 

728x90
๋ฐ˜์‘ํ˜•
Comments