๋ชฉ๋ก์ „์ฒด ๊ธ€ (1005)

๐Ÿ˜Ž ๊ณต๋ถ€ํ•˜๋Š” ์ง•์ง•์•ŒํŒŒ์นด๋Š” ์ฒ˜์Œ์ด์ง€?

๋ชจ๋ธ ํŒŒ๋ผ๋ฏธํ„ฐ batch, epoch, learning rate๋ž€?

batch_size : ๋งค ํ›ˆ๋ จ ๋‹จ๊ณ„๋งˆ๋‹ค ํ•™์Šตํ•  ๋ฐ์ดํ„ฐ์˜ ํฌ๊ธฐ๋ฅผ ๊ฒฐ์ • epoch : ๋ฐฐ์น˜ ์‚ฌ์ด์ฆˆ๋กœ ๋ถ„ํ• ๋œ ํ›ˆ๋ จ ๋ฐ์ดํ„ฐ ์ „์ฒด๋ฅผ 1ํšŒ ํ•™์Šตํ•˜๋Š” ํšŸ์ˆ˜๋ฅผ ๋ช‡ ๋ฒˆ ๋ฐ˜๋ณตํ• ์ง€ learning rate : ๋ชจ๋ธ์˜ weight์ด ์—…๋ฐ์ดํŠธ ๋  ๋•Œ๋งˆ๋‹ค ์˜ˆ์ƒ ์˜ค๋ฅ˜์— ๋Œ€ํ•œ ์‘๋‹ต์œผ๋กœ ๋ชจ๋ธ์„ ์กฐ์ • ๐Ÿ‡ Learning rate ํด ๋•Œ ํ•œ ๋ฒˆ์˜ step์—์„œ ํ•™์Šต์ด ํฌ๊ฒŒ ์ง„ํ–‰ ๋ณดํญ์ด ํฌ๊ธฐ ๋•Œ๋ฌธ์— ์ข€ ๋” ๋นจ๋ฆฌ ์ˆ˜๋ ด ์˜ค๋ฒ„์Š›์ด ์‹ฌํ•˜๊ฒŒ ์ผ์–ด๋‚˜ loss๊ฐ€ ์ „ํ˜€ ์ค„์ง€ ์•Š์Œ ๐Ÿ‡ Learning rate ์ž‘์„ ๋•Œ step ๋ณดํญ์ด ์ž‘์•„ ์กฐ๊ธˆ์”ฉ ํ•™์Šต ์ž‘์€ ๋ณดํญ ๋•Œ๋ฌธ์— local minima์— ๋น ์ง ๐Ÿฆž batch_size ํด ๋•Œ ํ•œ ๋ฒˆ ํ•™์Šตํ•  ๋•Œ ๋งŽ์€ ๋ฐ์ดํ„ฐ๋กœ ํ•™์Šต batch๊ฐ€ ํฌ๋ฉด ๊ณ„์‚ฐ๋˜๋Š” loss๊ฐ’์˜ ํŽธ์ฐจ๊ฐ€ ์ž‘์•„ ๊ณผ์ ํ•ฉ ์œ„ํ—˜ ๐Ÿฆž batch_size ์ž‘..