AI Safety CareersCurated jobs in AI safety, governance and frontier AI.
AboutSubmit a job
AboutPrivacy PolicyTermsSubmit a jobSaved jobs

Filters

Experience

Salary

Get the best AI safety roles weekly

A concise digest of alignment, governance, and AI risk jobs.

By subscribing, you agree to receive the AI Safety Careers newsletter. You can unsubscribe at any time. See our Privacy Policy.

165 active roles found for Anthropic

Anthropic
Research Engineer, Science of Scaling
Apply

The Research Engineer will work on developing safe and trustworthy AI systems, focusing on the science of scaling large language models.

Added 3 days agoLondon, UKAI Safety & Alignment£260K-£630K / year
Anthropic
Research Engineer, Safeguards Labs
Anthropic
Research Engineer, RL Infrastructure (Knowledge Work)
Anthropic
Research Engineer, Reward Models Platform
Anthropic
Research Engineer / Research Scientist, Tokens
Anthropic
Research Engineer/Research Scientist, Pre-training
Anthropic
Research Engineer / Research Scientist, Pre-training
Anthropic
Research Engineer, Production Model Post-Training
Anthropic
Research Engineer, Production Model Post-Training
Anthropic
Research Engineer, Pretraining Scaling - London
Anthropic
Research Engineer, Pretraining Scaling
Anthropic
Research Engineer, Pretraining
Anthropic
Research Engineer, Performance RL
Anthropic
Research Engineer, Machine Learning (Reinforcement Learning)
Anthropic
Research Engineer, Knowledge Team
Anthropic
Research Engineer, Interpretability
Anthropic
Research Engineer, Environment Scaling
Anthropic
Research Engineer, Economic Research Data Platform
Anthropic
Research Engineer, Discovery
Anthropic
Research Engineer, Cybersecurity Reinforcement Learning
PreviousPage 4 of 9Next
Apply

The Research Engineer in Safeguards Labs will lead projects focused on AI safety, including detecting misuse and strengthening model safeguards.

Added 3 days agoSan Francisco, CA | New York City, NYAI Safety & Alignment$350K-$850K / year
Apply

The Research Engineer will focus on the reliability and integrity of AI training environments and evaluations, ensuring they are stable and high-quality.

Added 3 days agoSan Francisco, CAAI Safety & Alignment$350K-$850K / year

Weekly roles

A concise digest of alignment, governance, and AI risk jobs.

By subscribing, you agree to receive the AI Safety Careers newsletter. You can unsubscribe at any time. See our Privacy Policy.

Apply

The Research Engineer will develop tools and infrastructure for evaluating and optimizing reward signals in AI systems, focusing on enhancing model behavior safety and efficiency.

Added 3 days agoRemote-Friendly (Travel-Required) | San Francisco, CA | Seattle, WA | New York City, NYRemoteAI Safety & Alignment$350K-$500K / year
Apply

The role focuses on building reliable and interpretable AI systems, emphasizing safety and societal impacts.

Added 3 days agoNew York City, NY; Seattle, WA; San Francisco, CAAI Safety & Alignment$350K-$500K / year
Apply

The Research Engineer/Research Scientist role at Anthropic focuses on developing large language models with an emphasis on safety, alignment, and societal impacts.

Added 3 days agoRemote-Friendly (Travel-Required) | San Francisco, CA | Seattle, WA | New York City, NYRemoteAI Safety & Alignment$350K-$850K / year
Apply

The role involves research and engineering to develop safe and trustworthy large language models, focusing on multimodal capabilities and ethical implications of AI.

Added 3 days agoZürich, CHAI Safety & AlignmentCHF 280K-CHF 680K / year
Apply

The Research Engineer will enhance AI model safety and alignment through post-training techniques, impacting the quality and capabilities of production models.

Added 3 days agoZürich, CHAI Safety & Alignment
Apply

The Research Engineer will focus on post-training processes to enhance AI model safety and alignment, implementing techniques to improve production model quality.

Added 3 days agoSan Francisco, CA | New York City, NY | Seattle, WAAI Safety & Alignment$350K-$500K / year
Apply

The Research Engineer will work on training and optimizing large-scale AI models, ensuring their reliability and safety, and addressing production issues.

Added 3 days agoLondon, UKAI Safety & Alignment£260K-£630K / year
Apply

The Research Engineer will work on training and optimizing production pretrained models, ensuring their reliability and efficiency, with a focus on the societal impacts and safety of AI systems.

Added 3 days agoSan Francisco, CAAI Safety & Alignment$350K-$850K / year
Apply

The Research Engineer, Pretraining role at Anthropic focuses on developing large language models with an emphasis on safety, alignment, and societal impacts.

Added 3 days agoLondon, UKAI Safety & Alignment£260K-£630K / year
Apply

The Research Engineer, Performance RL role focuses on advancing AI models' capabilities in safely writing code, collaborating with alignment teams to ensure safety and effectiveness.

Added 3 days agoSan Francisco, CAAI Safety & Alignment$350K-$850K / year
Apply

The Research Engineer role focuses on advancing the safety and capabilities of large language models through reinforcement learning, collaborating with alignment teams to ensure safe AI systems.

Added 3 days agoLondon, UKAI Safety & Alignment£260K-£630K / year
Apply

The Research Engineer will redesign how language models interact with external data sources, focusing on safety and societal impacts.

Added 3 days agoRemote-Friendly (Travel-Required) | San Francisco, CA | Seattle, WA | New York City, NYRemoteAI Safety & Alignment$350K-$850K / year
Apply

The Research Engineer, Interpretability role at Anthropic focuses on building infrastructure for interpretability research to enhance AI safety through mechanistic understanding of models.

Added 3 days agoSan Francisco, CARemoteAI Safety & Alignment$315K-$560K / year
Apply

The Research Engineer will improve AI models by developing training environments and QA frameworks, focusing on safety and reliability in AI systems.

Added 3 days agoRemote-Friendly (Travel Required) | San Francisco, CARemoteAI Safety & Alignment$350K-$850K / year
Apply

The Research Engineer will design and maintain infrastructure for studying AI's economic impact, collaborating with various teams to ensure data reliability and compliance.

Added 3 days agoSan Francisco, CAAI Governance & Policy$300K-$405K / year
Apply

The Research Engineer, Discovery role at Anthropic focuses on developing infrastructure and evaluation frameworks to support the training and deployment of AI systems aimed at achieving scientific AGI.

Added 3 days agoSan Francisco, CAAI Safety & Alignment$350K-$850K / year
Apply

The Research Engineer will work on advancing AI models in secure coding and vulnerability remediation, blending research and engineering in the field of cybersecurity.

Added 3 days agoSan Francisco, CA | New York City, NYAI Safety & Alignment$300K-$405K / year