Job description
Anthropic • San Francisco, CA
About Anthropic
Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.
About the Team
The Frontier Red Team (FRT) is a small, focused technical research team within Anthropic's Policy organization. Our goal is to make the entire world safer in this era of advanced AI by understanding what these systems can do and building the defenses that matter.
In 2026, we're focused on researching and ensuring safety with self-improving, highly autonomous AI systems—especially ones with cyberphysical capabilities.** **See our previous related work on cyberdefense, robotics, and Project Vend. We'll also be collaborating closely with the Emerging Risks workstream to understand novel, societal-scale risks that arise when agents interface with the external world. This is early-stage, high-conviction research with the potential for outsized impact.
Note: We are exclusively hiring in SF. We support relocation, but all hires must relocate before starting.