AI Safety CareersCurated jobs in AI safety, governance and frontier AI.
AboutSubmit a job
AboutPrivacy PolicyTermsSubmit a jobSaved jobs
AI Safety JobsAI Governance JobsAI Policy JobsAI Compliance JobsAI Red Teaming JobsRemote AI Safety Jobs

Filters

Experience

Salary

Curated AI safety and governance jobs

Find roles in AI safety, AI governance, AI policy, evaluations, red teaming, responsible AI and frontier AI labs.

AI Safety JobsAI Governance JobsAI Policy JobsAI Compliance JobsAI Red Teaming JobsRemote AI Safety Jobs

Get the best AI safety roles weekly

A concise digest of alignment, governance, and AI risk jobs.

By subscribing, you agree to receive the AI Safety Careers newsletter. You can unsubscribe at any time. See our Privacy Policy.

504 active roles found

Anthropic
Research Engineer, RL Infrastructure (Knowledge Work)
Apply

The Research Engineer will focus on the reliability and integrity of AI training environments and evaluations, ensuring they are stable and high-quality.

Added 4 days agoSan Francisco, CAAI Safety & Alignment$350K-$850K / year
Anthropic
Research Engineer, Reward Models Platform
Apply

The Research Engineer will develop tools and infrastructure for evaluating and optimizing reward signals in AI systems, focusing on enhancing model behavior safety and efficiency.

Added 4 days ago
Anthropic
Research Engineer / Research Scientist, Tokens
Apply

The role focuses on building reliable and interpretable AI systems, emphasizing safety and societal impacts.

Added 4 days ago
Anthropic
Research Engineer/Research Scientist, Pre-training
Apply

The Research Engineer/Research Scientist role at Anthropic focuses on developing large language models with an emphasis on safety, alignment, and societal impacts.

Added 4 days ago
Anthropic
Research Engineer / Research Scientist, Pre-training
Apply

The role involves research and engineering to develop safe and trustworthy large language models, focusing on multimodal capabilities and ethical implications of AI.

Added 4 days ago
Anthropic
Research Engineer, Production Model Post-Training
Apply

The Research Engineer will enhance AI model safety and alignment through post-training techniques, impacting the quality and capabilities of production models.

Added 4 days ago
Anthropic
Research Engineer, Production Model Post-Training
Apply

The Research Engineer will focus on post-training processes to enhance AI model safety and alignment, implementing techniques to improve production model quality.

Added 4 days ago
Anthropic
Research Engineer, Pretraining Scaling - London
Apply

The Research Engineer will work on training and optimizing large-scale AI models, ensuring their reliability and safety, and addressing production issues.

Added 4 days ago
Anthropic
Research Engineer, Pretraining Scaling
Apply

The Research Engineer will work on training and optimizing production pretrained models, ensuring their reliability and efficiency, with a focus on the societal impacts and safety of AI systems.

Added 4 days ago
Anthropic
Research Engineer, Pretraining
Apply

The Research Engineer, Pretraining role at Anthropic focuses on developing large language models with an emphasis on safety, alignment, and societal impacts.

Added 4 days ago
Anthropic
Research Engineer, Performance RL
Apply

The Research Engineer, Performance RL role focuses on advancing AI models' capabilities in safely writing code, collaborating with alignment teams to ensure safety and effectiveness.

Added 4 days ago
Anthropic
Research Engineer, Machine Learning (Reinforcement Learning)
Apply

The Research Engineer role focuses on advancing the safety and capabilities of large language models through reinforcement learning, collaborating with alignment teams to ensure safe AI systems.

Added 4 days ago
Anthropic
Research Engineer, Knowledge Team
Apply

The Research Engineer will redesign how language models interact with external data sources, focusing on safety and societal impacts.

Added 4 days ago
Anthropic
Research Engineer, Interpretability
Apply

The Research Engineer, Interpretability role at Anthropic focuses on building infrastructure for interpretability research to enhance AI safety through mechanistic understanding of models.

Added 4 days ago
Anthropic
Research Engineer, Environment Scaling
Apply

The Research Engineer will improve AI models by developing training environments and QA frameworks, focusing on safety and reliability in AI systems.

Added 4 days ago
Anthropic
Research Engineer, Economic Research Data Platform
Apply

The Research Engineer will design and maintain infrastructure for studying AI's economic impact, collaborating with various teams to ensure data reliability and compliance.

Added 4 days ago
Anthropic
Research Engineer, Discovery
Apply

The Research Engineer, Discovery role at Anthropic focuses on developing infrastructure and evaluation frameworks to support the training and deployment of AI systems aimed at achieving scientific AGI.

Added 4 days ago
Anthropic
Research Engineer, Cybersecurity Reinforcement Learning
Apply

The Research Engineer will work on advancing AI models in secure coding and vulnerability remediation, blending research and engineering in the field of cybersecurity.

Added 4 days ago
Anthropic
Research Engineer, AI Observability
Apply

The Research Engineer will design AI monitoring systems to analyze large datasets, focusing on misuse prevention and model audits, collaborating with safety teams.

Added 4 days ago
Anthropic
Prompt Engineer, Agent Prompts & Evals
Apply

The role focuses on prompt engineering and evaluation development to ensure AI model quality and safety, collaborating with product teams to enhance user experiences.

Added 4 days ago
PreviousPage 21 of 26Next
Remote-Friendly (Travel-Required) | San Francisco, CA | Seattle, WA | New York City, NY
Remote
AI Safety & Alignment
$350K-$500K / year
New York City, NY; Seattle, WA; San Francisco, CA
AI Safety & Alignment
$350K-$500K / year

Weekly roles

A concise digest of alignment, governance, and AI risk jobs.

By subscribing, you agree to receive the AI Safety Careers newsletter. You can unsubscribe at any time. See our Privacy Policy.

Remote-Friendly (Travel-Required) | San Francisco, CA | Seattle, WA | New York City, NY
Remote
AI Safety & Alignment
$350K-$850K / year
Zürich, CH
AI Safety & Alignment
CHF 280K-CHF 680K / year
Zürich, CH
AI Safety & Alignment
San Francisco, CA | New York City, NY | Seattle, WA
AI Safety & Alignment
$350K-$500K / year
London, UK
AI Safety & Alignment
£260K-£630K / year
San Francisco, CA
AI Safety & Alignment
$350K-$850K / year
London, UK
AI Safety & Alignment
£260K-£630K / year
San Francisco, CA
AI Safety & Alignment
$350K-$850K / year
London, UK
AI Safety & Alignment
£260K-£630K / year
Remote-Friendly (Travel-Required) | San Francisco, CA | Seattle, WA | New York City, NY
Remote
AI Safety & Alignment
$350K-$850K / year
San Francisco, CA
Remote
AI Safety & Alignment
$315K-$560K / year
Remote-Friendly (Travel Required) | San Francisco, CA
Remote
AI Safety & Alignment
$350K-$850K / year
San Francisco, CA
AI Governance & Policy
$300K-$405K / year
San Francisco, CA
AI Safety & Alignment
$350K-$850K / year
San Francisco, CA | New York City, NY
AI Safety & Alignment
$300K-$405K / year
San Francisco, CA
AI Safety & Alignment
$320K-$405K / year
San Francisco, CA | New York City, NY
AI Safety & Alignment
$320K-$405K / year