Which term deals with maintaining human authority over AI systems as they become more capable?

Prepare for the Anthropic Fellows Program Test with multiple choice questions and in-depth explanations. Our quiz covers AI Safety, Economics, and Research Methods. Master the skills needed for success!

Multiple Choice

Which term deals with maintaining human authority over AI systems as they become more capable?

Explanation:
Maintaining human authority over AI as capabilities grow is about keeping humans in control and able to guide or stop AI behavior when needed. This focus is captured by AI Control, which studies how to preserve human oversight, intervention, and override mechanisms so that powerful systems remain governable. AI Safety is broader and covers preventing harm and ensuring reliable behavior, but it doesn’t single out the governance aspect of who makes the final decisions. AI Alignment is about making AI’s objectives align with human values, which is closely related but centers on getting the AI to pursue the right goals rather than on who controls it. Frontier Models refer to the scale and capabilities of the models themselves, not the control or governance issues.

Maintaining human authority over AI as capabilities grow is about keeping humans in control and able to guide or stop AI behavior when needed. This focus is captured by AI Control, which studies how to preserve human oversight, intervention, and override mechanisms so that powerful systems remain governable. AI Safety is broader and covers preventing harm and ensuring reliable behavior, but it doesn’t single out the governance aspect of who makes the final decisions. AI Alignment is about making AI’s objectives align with human values, which is closely related but centers on getting the AI to pursue the right goals rather than on who controls it. Frontier Models refer to the scale and capabilities of the models themselves, not the control or governance issues.

Subscribe

Get the latest from Passetra

You can unsubscribe at any time. Read our privacy policy