Here’s what’s really going on inside an LLM’s neural network

Ezzy Black · May 22, 2024

Random John Smith Guy said:
We had some suspicions something like this might be possible after exploring vector steering, where you could push a model by adding particular vectors at particular layers to, say, change the mood, or always bring up King George III, or whatever you may. I imagine that this method is somewhat similar, if rather more advanced.

However, this article is missing the most bemusing part of this project, where Anthropic taught an AI to conduct proper Maoist self-criticism.

While we all appreciate the fact the China is most likely well behind on the technology, it was amusing to hear them say they were waiting to release their AI until it's responses were "appropriately socialist."

I guess this is one way they would do that.

Search

Search

Here’s what’s really going on inside an LLM’s neural network

Ezzy Black

Ars Scholae Palatinae

More options