AI Safety & Alignment
Member of Technical Staff, Research
San Francisco Bay Area
$250,000 - $450,000
-
In this role, you'll conduct research on AI capabilities, risks and mitigations through benchmarking and alignment assessment.
Member of Technical Staff, Research
Model Evaluation and Threat Research · Added today
Applications are handled by the employer on an external website. AI Safety Careers does not process applications directly.
AI Safety & Alignment
San Francisco Bay Area
$250,000 - $450,000
In this role, you'll conduct research on AI capabilities, risks and mitigations through benchmarking and alignment assessment.
Develop and maintain benchmarks and metrics to measure frontier model capabilities on threat-relevant tasks.
Build research infrastructure and evaluation methods to assess model behaviour under monitoring protocols.
Create maintainable, scalable systems and lead projects from ideation to delivery.
Contribute rigorous research science through literature knowledge and problem-solving on open-ended challenges.
Model Evaluation and Threat Research (formerly Alignment Research Center, Evaluations) is a project focused on evaluating the capabilities and alignment of advanced ML models.
This listing may be aggregated from a public source or submitted by a third party. If you represent this employer and would like to update or remove this listing, contact support@aisafetycareers.com.