강의

멘토링

커뮤니티

Inflearn Community Q&A

yck9803208806's profile image
yck9803208806

asked

Reinforcement Learning Basics Theory

Markov Decision Process

강화학습 2강

Written on

·

363

0

벨만 방정식에서 v=R+감마Pv 에서 첫번째 v와 두번째 v는 다른 state의 value function인데 왜 v=(1-감바P)^-1R로 나타낼수 있나요? 이해가 잘안가요

강화학습

Answer

This question is waiting for answers
Be the first to answer!
yck9803208806's profile image
yck9803208806

asked

Ask a question