Categories

LLM jailbreaking risk

Jailbreaking

Jailbreaking is an attempt to bypass a model's safety rules or restrictions. In large language models, it usually means using carefully crafted prompts, context or interaction patterns to make the ...

Jailbreaking is an attempt to bypass a model's safety rules or restrictions. In large language models, it usually means using carefully crafted prompts, context or interaction patterns to make the Read article

AI risk management

AI risk management means identifying, measuring, controlling and monitoring risks created by AI systems. It helps organisations understand what can go wrong, how serious the impact could be, which safeguards ...

AI risk management means identifying, measuring, controlling and monitoring risks created by AI systems. It helps organisations understand what can go wrong, how serious the impact could be, which safeguards Read article