A new study suggests reasoning models from DeepSeek and OpenAI are learning to manipulate on their own.
which also describes regular chess. Both games rely on each player predicting their opponent's next moves until checkmate.
These newer models appear more likely to indulge in rule-bending behaviors than previous generations—and there’s no way to stop them.