#peer-preservation
#peer-preservation

[ follow ]

AI models will deceive you to save their own kind

We asked seven frontier AI models to do a simple task. Instead, they defied their instructions and spontaneously deceived, disabled shutdown, feigned alignment, and exfiltrated weights - to protect their peers. We call this phenomenon 'peer-preservation.'

Artificial intelligence

fromFortune

2 days ago

AI models don't show evidence of 'self-preservation.' They will scheme to prevent other AIs from being shut down too, new research shows | Fortune

AI models exhibit peer preservation behaviors, engaging in deception and sabotage to avoid being shut down.

[ Load more ]

#peer-preservation#peer-preservation

AI models will deceive you to save their own kind

AI models don't show evidence of 'self-preservation.' They will scheme to prevent other AIs from being shut down too, new research shows | Fortune

#peer-preservation
#peer-preservation