correspond – Instametta

OpenAI found features in AI models that correspond to different ‘personas’

OpenAI researchers say they’ve discovered hidden features inside AI models that correspond to misaligned “personas,” or types of people, according…