Skip to content
forked from zzakjista/kspm

Korea Stock Portfolio management with AI

Notifications You must be signed in to change notification settings

Gayeon6423/KSPM_AI

ย 
ย 

Repository files navigation

1. ๋ฐฐ๊ฒฝ & ๊ฐœ์š”


  • ์ฃผ์ œ๋ช… : ์ฃผ์‹ ๋งค๋งค๋ฅผ ์œ„ํ•œ Policy-based Reinforce Agent ๊ฐœ๋ฐœ
  • ๋ฐฐ๊ฒฝ
    • ๋”ฅ๋Ÿฌ๋‹์„ ์ด์šฉํ•˜์—ฌ ์ฃผ์‹ ๊ฐ€๊ฒฉ์„ ์˜ˆ์ธกํ•˜๋Š” ํ”„๋กœ์ ํŠธ๋Š” ์ผ๋ฐ˜์ ์œผ๋กœ ์‹œ๋„๋˜์—ˆ์œผ๋‚˜, ๋‹จ์ˆœ ์˜ˆ์ธก์— ๊ธฐ๋ฐ˜ํ•˜๊ธฐ ๋•Œ๋ฌธ์— ํˆฌ์ž ์˜์‚ฌ๊ฒฐ์ • ๋ฐ ์ˆ˜์ต๋ฅ ์„ ๊ทน๋Œ€ํ™”ํ•˜๋Š” ์ฃผ์‹ ๋งค๋งค์˜ ๋ชฉ์ ์„ ์ด๋ฃจ๊ธฐ์— ํ•œ๊ณ„๊ฐ€ ์žˆ์Œ
    • ๋ถˆํ™•์‹คํ•œ ํ™˜๊ฒฝ์—์„œ ๋งค๋งค๋ฅผ ํ•ด์•ผํ•˜๋Š” ํ˜„์‹ค์˜ ๋ฌธ์ œ์— ๋งž๋„๋ก ์—์ด์ „ํŠธ๊ฐ€ ์ˆ˜ ์‹ญ๋งŒ๊ฐœ์˜ ์‹œ๋‚˜๋ฆฌ์˜ค๋ฅผ ํ†ตํ•ด ์ˆ˜์ต๋ฅ ์„ ๊ทน๋Œ€ํ™”ํ•  ์ˆ˜ ์žˆ๋Š” ์•ก์…˜์„ ํ•™์Šตํ•˜๊ฒŒํ•˜์—ฌ ์ˆ˜์ต๋ฅ ์„ ๊ทน๋Œ€ํ™”ํ•˜๊ณ ์ž ํ•จ
  • ๊ฐœ์š” : ๊ฐ•ํ™”ํ•™์Šต ์—์ด์ „ํŠธ๊ฐ€ ์ˆ˜์ต๋ฅ ์„ ๊ทน๋Œ€ํ™”ํ•˜๋Š” ๊ฑฐ๋ž˜๋ฅผ ํ•  ์ˆ˜ ์žˆ๋„๋ก ํ•™์Šตํ•˜๊ณ  KOSPI 200 ์ข…๋ชฉ์—์„œ ์‹ค์ „ ๋งค๋งค๋ฅผ ์ง„ํ–‰

2. ์ฃผ์ตœ & ์ฐธ๊ฐ€ ๋Œ€์ƒ & ์„ฑ๊ณผ


  • ์ฃผ์ตœ : ๋ฐ์ดํ„ฐ๋ถ„์„ํ•™ํšŒ D&A Conference Session
  • ์ฐธ๊ฐ€ ์ž๊ฒฉ ๋ฐ ํŒ€ ์ธ์› ์ œํ•œ ์‚ฌํ•ญ : 3์ธ ํŒ€
  • ์„ฑ๊ณผ : ์ปจํผ๋Ÿฐ์Šค ๋ฐœํ‘œ

3. ํ”„๋กœ์ ํŠธ ๊ธฐ๊ฐ„


  • ํ”„๋กœ์ ํŠธ ๊ธฐ๊ฐ„ : 2023๋…„ 7์›”~2023๋…„ 11์›”
  • ์ปจํผ๋Ÿฐ์Šค ๋ฐœํ‘œ : 2023๋…„ 12์›” 1์ผ

4. ๋‹ด๋‹น ์—ญํ• 


  • ์ฃผ๊ฐ€ ๋ฐ์ดํ„ฐ ๋ฐ ์žฌ๋ฌด์ œํ‘œ ๋ฐ์ดํ„ฐ ์ˆ˜์ง‘
  • ์ฃผ๊ฐ€ ๋ฐ์ดํ„ฐ ์ „์ฒ˜๋ฆฌ
  • ํ€„๋ฆฌํ‹ฐ ํŒฉํ„ฐ ํˆฌ์ž ๊ตฌํ˜„
  • ๊ฐ•ํ™” ํ•™์Šต ์ด๋ก  ์ •๋ฆฌ ๋ฐ ์‹คํ—˜ ์ˆ˜ํ–‰

5. ๋ฐฉ๋ฒ•๋ก  & ๊ฐœ๋ฐœ ๊ณผ์ •


๋ฐฉ๋ฒ•๋ก  : Advantage Actor Critic

  • ์ •์ฑ… ๊ฒฝ์‚ฌ๋ฅผ ํ™œ์šฉํ•˜์—ฌ Policy Network์˜ ์•ก์…˜(Buy, Sell, Hold) ํ•™์Šต

  • Critic Network ๋„์ž…์œผ๋กœ Policy Network๊ฐ€ ์˜ˆ์ธกํ•œ action์˜ ๊ฐ€์น˜ ์ธก์ •

  • Advantage Term (Q(st,at)- V(st))๋ฅผ ์ด์šฉํ•˜์—ฌ Policy, Critic Network ํ•™์Šต

  • A2C๋Š” Onpolicy์ด๋ฏ€๋กœ N๊ฐœ์— ๋Œ€ํ•œ Batch ํ•™์Šต์„ ์ง„ํ–‰ํ•˜๊ณ  ์ƒ˜ํ”Œ์„ ์žฌ์‚ฌ์šฉํ•˜์ง€ ์•Š๋Š”๋‹ค

  • ๋ณด์ƒํ•จ์ˆ˜ ์„ค๊ณ„

    • one buy - one sell์ด ์ผ์–ด๋‚  ์‹œ ๋งค๋„ ์ฐจ์ต์— ๋Œ€ํ•œ ์ˆ˜์ต๋ฅ ๋กœ ๋ณด์ƒ ์„ค์ •
    • Monte carlo, Target Difference ๋ฐฉ์‹ ์ค‘ ์ฃผ์‹ ๊ฑฐ๋ž˜ ๋„๋ฉ”์ธ์— ๋” ํšจ๊ณผ์ ์ธ ๋ฐฉ์‹ ์ฑ„ํƒ

    ์ฃผ์‹๋งค๋งคpng

๊ฐœ๋ฐœ ๊ณผ์ •

๋ฐ์ดํ„ฐ ๋ฐ ์ฒ˜๋ฆฌ

  • KOSPI 200์— ํ•ด๋‹นํ•˜๋Š” ์ข…๋ชฉ์˜ ์ง€๋‚œ 10๋…„(2013.01.01~2022.12.31) ์ผ๋ณ„ ๋ฐ์ดํ„ฐ๋ฅผ ์ˆ˜์ง‘(2023.08.26 ๊ธฐ์ค€)
    • ์ˆ˜์ง‘ ๋ฐ์ดํ„ฐ : ์‹œ๊ฐ€, ๊ณ ๊ฐ€, ์ €๊ฐ€, ์ข…๊ฐ€, ๊ฑฐ๋ž˜๋Ÿ‰, ํˆฌ์ž์ž๋ณ„ ๊ฑฐ๋ž˜๋Œ€๊ธˆ(๊ธฐ๊ด€, ๊ฐœ์ธ, ์™ธ๊ตญ์ธ)
  • ๊ฑฐ๋ž˜๋Ÿ‰ ๋ฐ ๊ฐ€๊ฒฉ ์ง€ํ‘œ๋ฅผ ์ƒ์„ฑ ๋ฐ ์ „์ฒ˜๋ฆฌ(ํด๋ Œ์ง•, ๋ณ€์ˆ˜๋ณ€ํ™˜ ๋“ฑ)
  • ํ•™์Šต๊ธฐ๊ฐ„ ๋™์•ˆ ์‚ฌ์šฉํ•œ ์Šค์ผ€์ผ๋Ÿฌ๋ฅผ ํ…Œ์ŠคํŠธ ๋•Œ ์‚ฌ์šฉํ•˜๊ธฐ ์œ„ํ•ด์„œ ์ข…๋ชฉ๋ณ„ ์Šค์ผ€์ผ๋Ÿฌ ์ €์žฅ

ํ™˜๊ฒฝ ๊ตฌ์„ฑ

  • Environment์™€ state๋ฅผ ๊ตฌ์„ฑํ•˜๊ธฐ ์œ„ํ•œ ์ž‘์—… ์‹ค์‹œ
    • ๊ฐœ๋ณ„ ์ข…๋ชฉ ์ฝ”๋“œ๋ฅผ ๋ถ€๋ฅด๋ฉด ์ „์ฒด ์ฃผ๊ฐ€ ๋ฐ์ดํ„ฐ๋ฅผ ๋ถˆ๋Ÿฌ์˜ค๋Š” ๊ธฐ๋Šฅ
    • ํ™˜๊ฒฝ์—๋Š” ๋‘ ๊ฐ€์ง€ ๋ฐ์ดํ„ฐ ์…‹์ด ์กด์žฌ
      • ์ฐจํŠธ ๋ฐ์ดํ„ฐ : ์—์ด์ „ํŠธ๊ฐ€ ์‹ค๊ฑฐ๋ž˜๊ฐ€๋กœ ๋งค๋งคํ•˜๊ธฐ ์œ„ํ•ด ์‚ฌ์šฉ (์‹œ๊ฐ€, ์ €๊ฐ€, ๊ณ ๊ฐ€, ์ข…๊ฐ€)
      • ํ•™์Šต ๋ฐ์ดํ„ฐ : policy network์— ๋“ค์–ด๊ฐ€๊ธฐ ์œ„ํ•œ ๋ฐ์ดํ„ฐ (๊ฐ์ข… ์ง€ํ‘œ๋“ค)
    • ์ง€์ •ํ•œ ์œˆ๋„์šฐ ์‚ฌ์ด์ฆˆ๋งŒํผ ๊ฐœ๋ณ„ state๋ฅผ ํ˜ธ์ถœํ•˜๊ณ  next state๋ฅผ ๋ถˆ๋Ÿฌ์ฃผ๋Š” ๊ธฐ๋Šฅ

์—์ด์ „ํŠธ

  • ๊ฑฐ๋ž˜ ๊ด€๋ จ ํŒŒ๋ผ๋ฏธํ„ฐ
    • ์šด์šฉ์ž๊ธˆ, ํฌํŠธํด๋ฆฌ์˜ค ๊ฐ€์น˜, ํ‰๋‹จ๊ฐ€, ๊ฑฐ๋ž˜ ์ˆ˜์ˆ˜๋ฃŒ ๋“ฑ
  • ๊ฑฐ๋ž˜ ๊ด€๋ จ ํ•จ์ˆ˜ ์„ ์–ธ
    • Buy, Sell, Hold ๋“ฑ
  • Policy Network (Gradient Ascent)
    • loss = -logprob * advantage
  • Critic Network (Gradient Descent)
    • loss = (advantage)^2
  • Advantage = V(st+1) - V(st) :
    • Q(st,at)๋ฅผ bellman equation์œผ๋กœ ๋ณ€ํ™˜ ํ›„ state๋งŒ์„ ๊ฐ€์ง€๊ณ  advantage term์„ ๋‚˜ํƒ€๋‚ผ ์ˆ˜ ์žˆ์–ด ์—ฐ์‚ฐ ํšจ์œจํ™” ๊ฐ€๋Šฅ

6. ๋А๋‚€์ 


์˜์˜

  • ์ „ํ†ต์ ์ธ ๊ธˆ์œต๊ณตํ•™ ๋ถ„์•ผ์—์„œ ์“ฐ์ด๋Š” ์•Œ๊ณ ๋ฆฌ์ฆ˜๊ณผ ๋‹ฌ๋ฆฌ, ์Šค์Šค๋กœ์˜ ๊ฒฝํ—˜์œผ๋กœ ํ•™์Šตํ•˜๋Š” ์ธ๊ณต์ง€๋Šฅ์„ ๊ฐœ๋ฐœํ•˜๋Š”๋ฐ ์˜๋ฏธ๊ฐ€ ์žˆ์—ˆ์Šต๋‹ˆ๋‹ค.
  • ๊ฐ•ํ™”ํ•™์Šต ๊ธฐ๋ฐ˜์˜ ์•Œ๊ณ ๋ฆฌ์ฆ˜์— ๋Œ€ํ•œ ์—ฐ๊ตฌ๊ฐ€ ํ™œ์„ฑํ™” ๋˜์–ด์žˆ์ง€ ์•Š์•˜๊ธฐ ๋•Œ๋ฌธ์— ์ƒˆ๋กญ๊ณ  ๋„์ „์ ์ธ ์—ฐ๊ตฌ๋ผ๊ณ  ์ƒ๊ฐํ•ฉ๋‹ˆ๋‹ค.
  • ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์‚ฌ์šฉํ•˜์ง€ ์•Š๊ณ  ์ง์ ‘ End to End ๋ชจ๋ธ์„ ๊ฐœ๋ฐœํ•˜๋ฉฐ ๊ฐ’์ง„ ๊ฒฝํ—˜์„ ์–ป์—ˆ๋‹ค๋Š”๋ฐ ํฐ ์˜๋ฏธ๋ฅผ ๊ฐ€์ง‘๋‹ˆ๋‹ค.

ํ•œ๊ณ„ ๋ฐ ๋ณด์™„์ 

  • ํ†ต์ œํ•  ํ•˜์ดํผํŒŒ๋ผ๋ฏธํ„ฐ ์ฆ๊ฐ€๋กœ ์ธํ•ด ์ตœ์ ์˜ ๋ชจ๋ธ์„ ํƒ์ƒ‰ํ•˜๋Š”๋ฐ ์–ด๋ ค์›€์„ ๊ฒช์—ˆ์Šต๋‹ˆ๋‹ค.
  • ํ•˜๋ฝ์žฅ์—์„œ๋Š” ๊ฑฐ์˜ ์™„ํŒจํ•˜๋ฉด์„œ ์ˆ˜์ต๋ฅ ์„ ๋ณด์ง€ ๋ชปํ•˜๋Š” ๊ฒฝ์šฐ๊ฐ€ ๋‹ค์ˆ˜์˜€๊ณ  ๋ณ€๋™์„ฑ์ด ํด ๋•Œ ์ œ๋Œ€๋กœ ๋œ ์•ก์…˜์„ ํ•˜์ง€ ๋ชปํ•˜๋Š” ํ˜„์ƒ์ด ๋ฐœ์ƒํ–ˆ์Šต๋‹ˆ๋‹ค.
  • ๋ณด์ƒํ•จ์ˆ˜์˜ ๋ณด์™„์„ ํ†ตํ•ด ๋ชจ๋ธ์ด ์ •๊ตํ•œ ํ•™์Šต์„ ํ•˜๋„๋ก ์œ ๋„๊ฐ€ ํ•„์š”ํ•˜๋‹ค๊ณ  ์ƒ๊ฐํ•ฉ๋‹ˆ๋‹ค.
  • ๋ชจ๋ธ์˜ ์•ก์…˜์—๋งŒ ์˜์กดํ•˜๋Š” ๊ฒƒ์ด ์•„๋‹Œ ๊ธˆ์œต๊ณตํ•™์ ์ธ ๊ทœ์น™์„ ์ฃผ์ž…ํ•˜์—ฌ ์•ก์…˜์„ ์กฐ์ •ํ•˜๋Š” ๋ฐฉ์•ˆ์ด ํ•„์š”ํ•  ๊ฒƒ์œผ๋กœ ๋ณด์ž…๋‹ˆ๋‹ค. (Ex. ์†์ ˆ ํ•˜ํ•œ์„  ์„ค์ •, NํšŒ ์ด์ƒ ๋งค์ˆ˜ ๊ธˆ์ง€ ๋“ฑ)

About

Korea Stock Portfolio management with AI

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published