AI Safety CareersCurated jobs in AI safety, governance and frontier AI.

Search roles, companies, or keywordsLocation

About Privacy Policy Terms Submit a job Saved jobs

Abuse Investigator (AI Self-Improvement Risk)

OpenAI · Added 3 days ago

Applications are handled by the employer on an external website. AI Safety Careers does not process applications directly.

Back to roles

AI Safety & Alignment

Abuse Investigator (AI Self-Improvement Risk)

Added 3 days agoOpenAISan Francisco$288K-$320K / year

San Francisco Bay Area

$288,000 - $320,000

In this role, you'll investigate model behavior to identify agentic or autonomous patterns that introduce safety risks.

Detect and analyze multi-step planning, capability chaining, tool use, persistence, and workaround behaviors.

Develop signals and tracking strategies to proactively identify emerging agentic risk patterns across the platform.

Identify gaps in safeguards, evaluations, or monitoring systems and propose improvements.

Communicate investigation findings clearly to technical, policy, and leadership stakeholders.

OpenAI is a frontier AI research and product company, with teams working on alignment, policy, and security. You can read concerns about doing harm by working at a frontier AI company in our career review on the topic, including concerns about OpenAI in particular. Our Take On This Role: We have concerns about OpenAI's track record on safety and responsible development and do not recommend almost any roles at OpenAI. Nonetheless, it is possible that OpenAI will create AGI in the next decade, in which case safety and security work at the company could be extremely important. If you receive a job offer from OpenAI, consider contacting us for career advice.

This listing may be aggregated from a public source or submitted by a third party. If you represent this employer and would like to update or remove this listing, contact support@aisafetycareers.com.

View all jobs from OpenAI

Get the best AI safety roles weekly

A concise digest of alignment, governance, and AI risk jobs.

By subscribing, you agree to receive the AI Safety Careers newsletter. You can unsubscribe at any time. See our Privacy Policy.

Abuse Investigator (AI Self-Improvement Risk) at OpenAI | AI Safety Careers