What Do Neural Networks Really Learn? Exploring the Brain of an AI Model

UraniumBlazer@lemm.ee · 5 months ago

What Do Neural Networks Really Learn? Exploring the Brain of an AI Model

GamingChairModel@lemmy.world · 5 months ago

Yes, but the tokens are more than just a stream of letters, and aren’t saved in the form of words. The information itself is organized into conceptual proximity to other concepts (and distinct from the text itself), and weighted in a way consistent with its training.

That’s why these models can use analogies and metaphors in a persuasive way, in certain contexts. Mix concepts that the training data has never been shown before, and these LLMs can still output something consistent with those concepts.

Anthropic played around with their own model, emphasizing or deemphasizng particular concepts to observe some unexpected behavior.

And we’d have trouble saying whether a model “knows” something if we don’t have a robust definition of when and whether a human brain “knows” something.