AI

LLM Token Counter

Estimate token counts for GPT, Claude, and Llama prompts.

All tools

Text

Estimates

183Charactersincl. whitespace
28Wordswhitespace split
46GPT-3.5 / GPT-4~chars / 4
49Claude~chars / 3.8
50Llama 3~chars / 3.7
44Gemini~chars / 4.2

Estimates use character-ratio heuristics. Real tokenization varies by language and content (code, JSON, and non-Latin text use more tokens).

Frequently asked questions

What is a token in an LLM?
A token is a chunk of text the model processes as a single unit, usually 3-4 characters or roughly 0.75 of an English word. Punctuation, whitespace, and rare words often become their own tokens.
Why do token counts differ between GPT, Claude, and Llama?
Each model family uses a different tokenizer (BPE variants like cl100k, o200k, or SentencePiece) trained on different corpora. The same sentence can vary by 10-30 percent across providers.
Is this counter exact for billing purposes?
No. Browser-side counters are heuristic estimates and won't match the provider's tokenizer byte-for-byte, especially for code, emoji, or non-Latin scripts. Use it for budgeting, not invoicing.