What Are Tokens? How AI Breaks Text Into Pieces
When people first use an AI system, they often imagine that it reads and understands text the way humans do — word by word, sentence by sentence. In reality, AI models see text very differently. At the center of that difference is something called a token . What is a token? A token is a small piece of text that an AI model works with internally. It might be a whole word, part of a word, a number, or even a punctuation mark. AI models do not read letters or words directly. They process sequences of tokens. For example, a simple sentence might be broken into pieces like this: “Artificial” “ intelligence” “ is” “ useful” The exact breakdown depends on the model, but the idea is the same: text is converted into manageable chunks. Why AI models use tokens Tokens make it possible for AI models to handle language mathematically. Each token is represented as a number, which allows the model to calculate probabilities and relationships between pieces of text. I...