Reinforcement Learning Study Note

Human Learning Process:

Watch -> Practice -> Fail -> Learn -> Improve/Learn from outside knowledge -> Practice -> Loop