The role involves developing methods to mitigate false information in large language models, creating evaluation protocols, and conducting research in LLM security and interpretability.
Added todayCopenhagen, DenmarkAI Safety & Alignment
2 active roles found for University of Copenhagen, Department of Computer Science
The role involves developing methods to mitigate false information in large language models, creating evaluation protocols, and conducting research in LLM security and interpretability.
PhD fellowship focused on researching mechanistic interpretability methods to enhance the security of large language models and address misinformation.