Alignment pretraining: AI discourse creates self-fulfilling (mis)alignment arxiv.org 7 points by anigbrowl 1 hour ago
_--__--__ 36 minutes ago The first rule of AI alignment is don't talk about AI alignment (in any medium that could end up in a training corpus).
The first rule of AI alignment is don't talk about AI alignment (in any medium that could end up in a training corpus).