Preventing Paperclips: The Case Against Formalism
Explains why alignment-by-directive inevitably produces Goodhart distortions at scale if it is not accompanied by a framework that reasons dynamically about edge cases and boundaries through contextual mapping.