2910 shaares
55 private links
55 private links
1 result
tagged
corrigibility
As artificially intelligent systems grow in intelli- gence and capability, some of their available options may allow them to resist intervention by their programmers. We call an AI system “corrigible” if it cooperates with what its creators regard as a corrective intervention, despite de- fault incentives for rational agents to resist at- tempts to shut them down or modify their preferences.