Lynch, A., Wright, B., Larson, C., Ritchie, S. J., Mindermann, S., Perez, E., Troy, K. K., & Hubinger, E. (2025).
Agentic Misalignment: How LLMs Could Be Insider Threats. arXiv preprint arXiv:2510.05179. Disponível em:
https://arxiv.org/abs/2510.05179