NO.PZ2022120201000007
问题如下:
What are the distinctions between the Monte Carlo and temporal difference methods for reinforcement learning?
选项:
解释:
The Monte Carlo method updates strategies using the total future rewards. Temporal difference learning looks only one decision ahead when updating strategies.
没看懂,麻烦讲解下,谢谢