NEW SAVANNA: LLMS are not fundamentally about language [Karpathy]

Sunday, September 15, 2024

LLMS are not fundamentally about language [Karpathy]

It's a bit sad and confusing that LLMs ("Large Language Models") have little to do with language; It's just historical. They are highly general purpose technology for statistical modeling of token streams. A better name would be Autoregressive Transformers or something.

They…
— Andrej Karpathy (@karpathy) September 14, 2024

Note that some time ago I pointed out that transformers would operate in the same way on strings of colored beads as they do on strings of word tokens.

NEW SAVANNA

Pages in this blog

Sunday, September 15, 2024

LLMS are not fundamentally about language [Karpathy]

No comments:

Post a Comment