A new study just upended AI safety
3 Icelet 1 7/30/2025, 4:57:05 PM theverge.com โ
Comments (1)
Icelet ยท 1d ago
e study subliminal learning, a surprising phenomenon where language models transmit behavioral traits via semantically unrelated data. In our main experiments, a "teacher" model with some trait T (such as liking owls or being misaligned) generates a dataset consisting solely of number sequences.