Anthropic co-founder Jack Clark in Seoul on May 22, 2024.Anthony Wallace/AFP/Getty ImagesAnthropic, the highly valued AI company that has offered—how shall I put it?—a more
sober view of our artificially intelligent future has called for its peers to consider slowing down the AI arms race.
Why? Because the stuff is advancing so rapidly that AI systems may soon be able to improve themselves without humans. And really, what could go wrong?
“Full recursive self-improvement also might increase the risks of humans losing control over AI systems,” Anthropic's Marina Favaro and Jack Clark write
in a new blog post. “If systems are capable of fully building their own successors, the ways we secure them, monitor them, and shape their behavior all grow much more important.”
Or, as 1984’s
The Terminator put it: “It can't be bargained with. It can't be reasoned with. It doesn't feel pity or remorse or fear. And it absolutely will not stop—ever—until you are dead!"
Anthropic says slowing down the development of RSI AI, as it’s called, “would likely be a good thing” because it would “give ourselves more time to deal with its immense implications.”
But if it “simply lets the least cautious actors catch up technologically,” the company adds, “it could leave everyone less safe.” And an RSI AI reality could come “sooner than most institutions are prepared for.”
Anthropic, which is on the cusp of an IPO, has been repeatedly criticized by the Trump administration as a “radical left woke company” that uses its policy papers to slow down its AI competitors. The San Francisco company has responded that it’s simply taking discussions about AI safety seriously.
—AN