The paper addresses the AI shutdown problem, a long-standing challenge in AI safety. The shutdown problem asks how to design AI systems that will shut down when instructed, will not try to prevent ...
We’re now deep into the AI era, where every week brings another feature or task that AI can accomplish. But given how far down the road we already are, it’s all the more essential to zoom out and ask ...
The authors argue that generative AI introduces a new class of alignment risks because interaction itself becomes a mechanism of influence. Humans adapt their behavior in response to AI outputs, ...
The Manila Times on MSNOpinion

Humanity’s alignment problem

It’s lunchtime on top of the world again. Time magazine’s annual “Person of the Year” issue has revived the iconic Depression ...
OpenAI’s o3 just cleared artificial general intelligence (AGI) benchmarks. Eighty-seven percent on ARC-AGI, the test that’s supposed to measure whether machines can actually think. Silicon Valley ...
Technology alone is no longer enough. Organizations face an unprecedented proliferation of tools, platforms and systems, each ...
The same AI that aced the genius test can't count how many times the letter "R" appears in "strawberry." OpenAI's o3 just cleared artificial general intelligence (AGI) benchmarks. Eighty-seven percent ...