Scott Alexander

Choose a list of delegates

to vote as the majority of them.

Unless you vote directly.

Author and psychiatrist

X: @slatestarcodex

ai (1) ai-governance (1) emerging-tech (1) ethics (1) × nuclear (1) tech-ethics (1)

Top

New

Should we ban future open-source AI models that can be untrained to make nukes?

Scott Alexander strongly agrees and says:

I am Scott Alexander Add a real quote Report comment

Someday AIs really will be able to make nukes or pull off $500 million hacks. At that point, companies will have to certify that their model has been trained not to do this, and that it will stay trained. But if it were open-source, then anyone could easily untrain it. So after models become capable of making nukes or super-Ebola, companies won’t be able to open-source them anymore without some as-yet-undiscovered technology to prevent end users from using these capabilities. Sounds . . . good? I don’t know if even the most committed anti-AI-safetyist wants a provably-super-dangerous model out in the wild. Still, what happens after that? No cutting-edge open-source AIs ever again? I don’t know. In whatever future year foundation models can make nukes and hack the power grid, maybe the CIA will have better AIs capable of preventing nuclear terrorism, and the power company will have better AIs capable of protecting their grid. The law seems to leave open the possibility that in this situation, the AIs wouldn’t technically be capable of doing these things, and could be open-sourced. (source)

4 7 1y ago

Terms · Privacy · Contact