Token

In the context of Large Language Models, a token is the smallest unit of data that a model can understand and process. It can be as short as a single character or as long as a word, depending on the language and the specific model. Tokens are used to break down input data into manageable pieces, enabling the AI to analyze, understand, and generate text.

For most state-of the art models, one token can be statistically approximated as ~4 characters.

The LLM Knowledge Base is a collection of bite-sized explanations for commonly used terms and abbreviations related to Large Language Models and Generative AI.

It's an educational resource that helps you stay up-to-date with the latest developments in AI research and its applications.

Tensor Processing Unit (TPU)

Temperature

System Message

Supervised Fine-Tuning (SFT)