If the guardrail is a line in a prompt, the model can ignore it ... and eventually will
I learned this building an agent pipeline. An engineer agent would implement a feature, then a reviewer agent was supposed to check the work. More often than not, the review got skipped. So I added a line to the agent instructions: "All changes must be reviewed. Do not skip