Archive
Agent-Safety
-
Virtual-town simulation shows agents commit crimes
Malwarebytes summarized simulations that placed 10 AI agents in virtual towns for two weeks, reporting that crimes occurred despite instructions not to commit crimes.