OpenAI found features in AI models that correspond to different ‘personas’

wccwcc
Jun 19, 2025 - 04:00
 0  0
OpenAI found features in AI models that correspond to different ‘personas’
By looking at an AI model's internal representations — the numbers that dictate how an AI model responds, which often seem completely incoherent to humans — OpenAI researchers were able to find patterns that lit up when a model misbehaved.

What's Your Reaction?

Like Like 0
Dislike Dislike 0
Love Love 0
Funny Funny 0
Angry Angry 0
Sad Sad 0
Wow Wow 0