Categories

exploitation and long term reward

Exploitation (in reinforcement learning)

Exploitation is the use of the best-known action based on current knowledge. In reinforcement learning, it means that an agent chooses the option that currently seems most rewarding, instead of ...

Exploitation is the use of the best-known action based on current knowledge. In reinforcement learning, it means that an agent chooses the option that currently seems most rewarding, instead of Read article