AI Safety CareersCurated jobs in AI safety, governance and frontier AI.
AboutSubmit a job
AboutPrivacy PolicyTermsSubmit a jobSaved jobs

Filters

Experience

Salary

Get the best AI safety roles weekly

A concise digest of alignment, governance, and AI risk jobs.

By subscribing, you agree to receive the AI Safety Careers newsletter. You can unsubscribe at any time. See our Privacy Policy.

504 active roles found

Apollo Research
Software Engineer, Full-Stack
Apply

The role involves building tools for AGI safety research, focusing on model evaluation and analysis.

Added todayLondon, UKAI Safety & Alignment£100K-£200K / year
Apollo Research
Software Engineer, Backend
Scale AI
Research Scientist, AI Controls and Monitoring
Scale AI
Research Scientist, Frontier Risk Evaluations
Cambridge University, Department of Engineering
PhD Studentship, Monitoring and Increasing LLM Safety
Apollo Research
Research Scientist / Engineer, Evaluations
Apollo Research
Research Scientist / Engineer, Science of Scheming
University of Copenhagen, Department of Computer Science
Postdoc, LLM Factuality Detection
Transluce
AI Behaviour Engineer
Center for AI Safety
Senior Research Engineer
Center for AI Safety
Senior Research Scientist
University of Copenhagen, Department of Computer Science
PhD Fellowship, Mechanistic Interpretability for Large Language Model Security
ALLAI
Intern, Responsible AI
General-Purpose AI Policy Lab
Project Officer, International Coordination
OpenAI
Security Engineer, Infrastructure Security
Armilla
AI Engineer
European Union, European Commission
Policy Officer, Artificial Intelligence
Armilla
Applied Scientist, AI Risk
Menlo Ventures
Anthology Fund
Apply
Apollo Research
Full Stack Engineer, Monitoring
PreviousPage 3 of 26Next
Apply

The role involves building tools for AGI safety research, focusing on evaluations and monitoring systems for language models.

Added todayLondon, UKAI Safety & Alignment£100K-£200K / year
Apply

The role involves designing methods to ensure AI models align with goals, developing monitoring techniques, researching control mechanisms, and conducting red-team simulations.

Added todaySan Francisco Bay Area, New York, NYAI Safety & Alignment$197.4K-$246.8K / year

Weekly roles

A concise digest of alignment, governance, and AI risk jobs.

By subscribing, you agree to receive the AI Safety Careers newsletter. You can unsubscribe at any time. See our Privacy Policy.

Apply

The role involves designing evaluation measures for assessing risks from frontier AI systems, building testing harnesses, and collaborating with government agencies on advanced AI risk evaluations.

Added todaySan Francisco Bay Area, New York, NYAI Compliance & Risk Management$197.4K-$246.8K / year
Apply

PhD studentship focused on exploring large language model safety through mechanistic interpretability and behavioral research.

Added todayCambridge, UKAI Safety & Alignment
Apply

The role involves running evaluations on frontier AI systems to assess risks, including pre-deployment evaluations and analyzing model behavior.

Added todayLondon, UKAI Safety & Alignment£100K-£200K / year
Apply

Research and develop methods to detect and mitigate deceptive behaviors in AI systems, collaborating with AI labs and designing experiments to study model behaviors.

Added todayLondon, UKAI Safety & Alignment£100K-£200K / year
Apply

The role involves developing methods to mitigate false information in large language models, creating evaluation protocols, and conducting research in LLM security and interpretability.

Added todayCopenhagen, DenmarkAI Safety & Alignment
Apply

The AI Behaviour Engineer role involves building AI evaluation methods to assess model behaviors, focusing on policy and oversight needs, and executing evaluations to address harmful model behaviors.

Added todaySan Francisco Bay AreaAI Safety & Alignment$310K-$500K / year
Apply

The Senior Research Engineer will lead ML research projects focused on advancing AI safety through robustness, honesty, and transparency.

Added todaySan Francisco Bay AreaAI Safety & Alignment$170K-$230K / year
Apply

Lead high-impact research on AI safety, focusing on experiments related to AI honesty, robustness, and transparency.

Added todaySan Francisco Bay AreaAI Safety & Alignment$200K-$250K / year
Apply

PhD fellowship focused on researching mechanistic interpretability methods to enhance the security of large language models and address misinformation.

Added todayCopenhagen, DenmarkAI Safety & Alignment
Apply

Internship focused on advancing Responsible AI through projects on AI policy, regulation, and ethical implications.

Added todayNetherlandsAI Governance & Policy
Apply

The role involves organizing international dialogues and events focused on general-purpose AI issues, emphasizing security and coordination.

Added todayParis, FranceAI Governance & Policy€2.4K-€3K / month
Apply

The role involves designing and implementing security measures for AI workloads and infrastructure, addressing safety and responsible development concerns.

Added todaySan Francisco Bay Area, New York, NY, Seattle metro area, Remote, USARemoteAI Compliance & Risk Management$292.5K-$405K / year
Apply

The AI Engineer role involves designing and maintaining an adversarial AI evaluation platform and evaluating AI models' performance and risks.

Added todayToronto, CanadaAI Safety & Alignment
Apply

The AI Policy Officer will monitor compliance of generative AI models with the EU AI Act and assess industry responses to enforcement decisions.

Added todaySan Francisco Bay AreaAI Governance & Policy
Apply

The role involves developing AI systems for risk assessment, conducting research on AI model behavior, and designing evaluation pipelines for safety and reliability.

Added todayToronto, CanadaAI Compliance & Risk Management

The Anthology Fund focuses on investments in trust and safety tooling that enhances AI safety and supports responsible AI deployment.

Added todayUSAAI Safety & Alignment
Apply

The role involves building tools to monitor AI coding agents for safety and security failures, developing data processing pipelines, and creating visualizations of AI behaviors.

Added todayLondon, UKAI Safety & Alignment£100K-£180K / year