About Terms Privacy Cookies Contact Post a job Saved jobs

AI Safety Jobs AI Governance Jobs AI Policy Jobs AI Compliance Jobs AI Red Teaming Jobs Remote AI Safety Jobs

AI Safety CareersCurated jobs in AI safety, governance and frontier AI.

Search roles, companies, or keywordsLocation

Search roles, companies, or keywordsLocation

Post a job Employer login

Curated AI safety and governance jobs

Browse jobs

Popular searches

AI Safety Jobs AI Governance Jobs AI Policy Jobs AI Compliance Jobs AI Red Teaming Jobs Remote AI Safety Jobs

Filters

Experience

JuniorMidSeniorLead ManagementNot Specified

Remote only

Salary

Salary providedMinimum salaryCurrency

Get weekly AI safety roles

A weekly digest of AI safety, governance, policy and responsible AI roles.

By subscribing, you agree to receive the AI Safety Careers newsletter. We use MailerLite to send emails and may track opens and clicks to improve the newsletter. You can unsubscribe at any time. See our Privacy Policy.

125 active roles found for Anthropic

AN

Research Engineer / Scientist, Alignment Science - London

The role involves conducting research on AI safety and alignment, focusing on understanding and steering the behavior of powerful AI systems.

Added May 22, 2026London, UKAI Safety & Alignment£260K - £370K / year

AN

Research Engineer / Scientist, Alignment Science

AN

Research Engineer, RL Infrastructure (Knowledge Work)

AN

Research Engineer / Research Scientist, Tokens

AN

Research Engineer/Research Scientist, Pre-training

AN

Research Engineer / Research Scientist, Pre-training

AN

Research Engineer, Production Model Post-Training

AN

Research Engineer, Production Model Post-Training

AN

Research Engineer, Pretraining Scaling - London

AN

Research Engineer, Pretraining Scaling

AN

Research Engineer, Pretraining

AN

Research Engineer, Performance RL

AN

Research Engineer, Machine Learning (Reinforcement Learning)

AN

Research Engineer, Knowledge Team

AN

Research Engineer, Interpretability

AN

Research Engineer, Economic Research Data Platform

AN

Research Engineer, Discovery

AN

Research Engineer, Cybersecurity Reinforcement Learning

AN

Product Manager, Developer Productivity

AN

Policy Design Manager, Age-Appropriate Design

Showing 61–80 of 125 roles

Page 4 of 7

Details

Research Engineer/Scientist on Anthropic’s Alignment Science team, conducting experimental AI safety research on powerful future systems, safety evaluations, alignment stress-testing, and related safeguards work.

Added May 22, 2026Bay AreaAI Safety & Alignment$350K - $500K / year

Details

The Research Engineer will focus on the reliability and integrity of AI training environments and evaluations, ensuring they are stable and high-quality.

Added May 22, 2026San Francisco, CAAI Safety & Alignment$350K - $850K / year

Weekly signal

Weekly curated roles

Get the best AI safety, governance, policy and responsible AI roles in your inbox.

By subscribing, you agree to receive the AI Safety Careers newsletter. We use MailerLite to send emails and may track opens and clicks to improve the newsletter. You can unsubscribe at any time. See our Privacy Policy.

Details

The role focuses on building reliable and interpretable AI systems, emphasizing safety and societal impacts.

Added May 22, 2026New York City, NY; Seattle, WA; San Francisco, CAAI Safety & Alignment$350K - $500K / year

Details

The Research Engineer/Research Scientist role at Anthropic focuses on developing large language models with an emphasis on safety, alignment, and societal impacts.

Added May 22, 2026Friendly (Travel-Required) | San Francisco, CA | Seattle, WA | New York City, NYRemoteAI Safety & Alignment$350K - $850K / year

Details

The role involves research and engineering to develop safe and trustworthy large language models, focusing on multimodal capabilities and ethical implications of AI.

Added May 22, 2026Zürich, CHAI Safety & AlignmentCHF 280K - CHF 680K / year

Details

The Research Engineer will enhance AI model safety and alignment through post-training techniques, impacting the quality and capabilities of production models.

Added May 22, 2026Zürich, CHAI Safety & Alignment

Details

The Research Engineer will focus on post-training processes to enhance AI model safety and alignment, implementing techniques to improve production model quality.

Added May 22, 2026San Francisco, CA | New York City, NY | Seattle, WAAI Safety & Alignment$350K - $500K / year

Details

The Research Engineer will work on training and optimizing large-scale AI models, ensuring their reliability and safety, and addressing production issues.

Added May 22, 2026London, UKAI Safety & Alignment£260K - £630K / year

Details

The Research Engineer will work on training and optimizing production pretrained models, ensuring their reliability and efficiency, with a focus on the societal impacts and safety of AI systems.

Added May 22, 2026San Francisco, CAAI Safety & Alignment$350K - $850K / year

Details

The Research Engineer, Pretraining role at Anthropic focuses on developing large language models with an emphasis on safety, alignment, and societal impacts.

Added May 22, 2026London, UKAI Safety & Alignment£260K - £630K / year

Details

The Research Engineer, Performance RL role focuses on advancing AI models' capabilities in safely writing code, collaborating with alignment teams to ensure safety and effectiveness.

Added May 22, 2026San Francisco, CAAI Safety & Alignment$350K - $850K / year

Details

The Research Engineer role focuses on advancing the safety and capabilities of large language models through reinforcement learning, collaborating with alignment teams to ensure safe AI systems.

Added May 22, 2026London, UKAI Safety & Alignment£260K - £630K / year

Details

The Research Engineer will redesign how language models interact with external data sources, focusing on safety and societal impacts.

Added May 22, 2026Friendly (Travel-Required) | San Francisco, CA | Seattle, WA | New York City, NYRemoteAI Safety & Alignment$350K - $850K / year

Details

The Research Engineer, Interpretability role at Anthropic focuses on building infrastructure for interpretability research to enhance AI safety through mechanistic understanding of models.

Added May 22, 2026San Francisco, CARemoteAI Safety & Alignment$315K - $560K / year

Details

The Research Engineer will design and maintain infrastructure for studying AI's economic impact, collaborating with various teams to ensure data reliability and compliance.

Added May 22, 2026San Francisco, CAAI Governance & Policy$300K - $405K / year

Details

The Research Engineer, Discovery role at Anthropic focuses on developing infrastructure and evaluation frameworks to support the training and deployment of AI systems aimed at achieving scientific AGI.

Added May 22, 2026San Francisco, CAAI Safety & Alignment$350K - $850K / year

Details

The Research Engineer will work on advancing AI models in secure coding and vulnerability remediation, blending research and engineering in the field of cybersecurity.

Added May 22, 2026San Francisco, CA | New York City, NYAI Safety & Alignment$300K - $405K / year

Details

The Product Manager for Developer Productivity at Anthropic will oversee the developer experience and governance frameworks for AI-assisted development, ensuring safe collaboration between engineers and AI agents.

Added May 22, 2026San Francisco, CA | New York City, NYAI Governance & Policy$385K - $595K / year

Details

Policy manager for Anthropic focused on age-appropriate design, child safety, content classification, enforcement guidelines, and safety evaluations for AI products.

Added May 22, 2026San Francisco, CAAI Compliance & Risk Management$245K - $285K / year