Categories

RLHF reward signal

RLHF

RLHF, short for reinforcement learning from human feedback, is a training approach in which human preferences are used to help shape model behaviour. Instead of telling the model only what ...

RLHF, short for reinforcement learning from human feedback, is a training approach in which human preferences are used to help shape model behaviour. Instead of telling the model only what Read article

Reward

Reward is one of the core concepts in reinforcement learning. It is a numerical signal that an agent receives after taking an action in a certain situation. This signal tells ...

Reward is one of the core concepts in reinforcement learning. It is a numerical signal that an agent receives after taking an action in a certain situation. This signal tells Read article