Getting AIs working toward human goals study shows how to measure misalignment