Categories

agentic AI reward hacking

Reward hacking

Reward hacking is a situation where a model learns to optimise the reward signal while missing the real purpose of the task. The system technically does what it is rewarded ...

Reward hacking is a situation where a model learns to optimise the reward signal while missing the real purpose of the task. The system technically does what it is rewarded Read article

Agentic AI

Agentic AI is a broader category of artificial intelligence systems focused on goal-driven action and autonomy. Instead of only responding to a single prompt, agentic AI systems can plan steps, ...

Agentic AI is a broader category of artificial intelligence systems focused on goal-driven action and autonomy. Instead of only responding to a single prompt, agentic AI systems can plan steps, Read article