PhD studentship focused on exploring large language model safety through mechanistic interpretability and behavioral research.
Added todayCambridge, UKAI Safety & Alignment
1 active role found for Cambridge University, Department of Engineering
PhD studentship focused on exploring large language model safety through mechanistic interpretability and behavioral research.