The role involves developing scalable interpretability assistants to detect unexpected behaviors in AI models, creating evaluations to identify undesirable behaviors, and advancing AI oversight capabilities.
648 active roles found
The role involves developing scalable interpretability assistants to detect unexpected behaviors in AI models, creating evaluations to identify undesirable behaviors, and advancing AI oversight capabilities.
The role involves conducting AI alignment research, designing experimental protocols, and communicating research findings.
The role involves building research infrastructure and automation tooling specifically for AI alignment research, contributing to model evaluations and safety evaluations.
The Director, AI Policy will lead the AI policy portfolio, managing a team and budget to develop strategies focused on AI safety, fairness, and responsible adoption.
The Director of Federal Affairs will lead engagement with Congress and the executive branch on AI policy, focusing on legislative strategy and relationships with policymakers.
The Product Manager for Enterprise will focus on developing security and compliance features for AI systems, ensuring they meet enterprise requirements and regulatory standards.
The Policy Communications Manager will drive external communications for Anthropic's security, governance, and responsible scaling programs, focusing on how the company develops secure AI systems and communicates its governance structures.
The Engineering Manager will lead the Review Tooling team responsible for building systems to investigate potential harms and enforce safety measures across AI products.
The Engineering Manager (Web Safety) role focuses on architecting and building system protections to prevent misuse and abuse of AI technologies, ensuring responsible deployment and compliance with safety standards.
The Engineering Manager for Cloud Safety will lead a team focused on ensuring the safe deployment and operation of AI systems, particularly in the context of cloud services, emphasizing safety evaluations and responsible AI development.
The Applied AI Security Architect will serve as a security expert for enterprise customers, focusing on compliance with European regulations and addressing security concerns related to AI deployment.
The role involves conducting AI alignment research focusing on model safety and interpretability, designing experiments, and evaluating deep learning models.
The role involves building intelligent systems for AI governance, focusing on automated content generation and interpretation to support AI risk management and compliance with regulations.
The Research Program Manager for the Alignment team will oversee alignment and safety projects, ensuring AI systems follow human intent and remain safe as capabilities scale.
The Software Engineer, Platform role involves designing and developing foundational platforms and systems that support model evaluations and safety, with a focus on reinforcement learning through human feedback.
The role involves building evaluation infrastructure for AI safety systems, focusing on detecting misuse and ensuring the reliability of automated abuse detection.
The role involves conducting research on privacy and security threats to AI systems, focusing on threat modeling and robustness evaluation.
The role involves investigating misuse of OpenAI's products related to child safety, developing detection strategies, and collaborating with various teams to ensure compliance and risk management.
The role involves building a platform for governance, risk, and compliance (GRC) at Anthropic, focusing on automating compliance processes and integrating various systems for effective risk management.
The role involves managing and engineering security risks related to AI systems, focusing on risk assessment, quantification, and building AI-native risk tooling.
Get the best AI safety, governance, policy and responsible AI roles in your inbox.
No spam. Unsubscribe anytime. See our Privacy Policy.