MCP Token Ledger

Every MCP server you connect injects its tools/list response into the model context on every turn. This page measures that cost so you can budget your window before the model starts drifting. Heavy servers are often the reason a stack feels sluggish — a single 7,000-token MCP eats more context than most conversations.

Entries measured

Heaviest

7,400 tok

github-official

Combined cost

55,951 tok

if you installed all of these

Tokenizer

est. 3.5ch

harness + editorial seeds

Server	Tools	Tokens	Cost	Top tool	Source
GitHub (Official) DEV_TOOLS	27	7,400	Heavy	`create_or_update_file`	Estimate
Playwright (Microsoft) BROWSER	32	6,800	Heavy	`browser_navigate`	Estimate
Stripe INTEGRATION	24	5,100	Medium	`create_payment_intent`	Estimate
Jira (Atlassian) DEV_TOOLS	24	4,800	Medium	`create_issue`	Estimate
Linear PRODUCTIVITY	22	4,200	Medium	`create_issue`	Estimate
Slack COMMUNICATIONS	18	3,800	Medium	`slack_post_message`	Estimate
Filesystem FILESYSTEM	14	3,493	Medium	`read_text_file`319t	Measured
Notion PRODUCTIVITY	16	3,400	Medium	`query_database`	Estimate
Google Drive FILESYSTEM	14	3,100	Medium	`search_files`	Estimate
Memory MEMORY	9	2,802	Light	`search_nodes`391t	Measured
Gmail COMMUNICATIONS	11	2,700	Light	`send_email`	Estimate
Sentry DEV_TOOLS	12	2,400	Light	`list_issues`	Estimate
Supabase DATABASE	10	2,200	Light	`sql`	Estimate
sequentialthinking	1	1,287	Light	`sequentialthinking`1286t	Measured
Postgres DATABASE	6	1,100	Light	`execute_sql`	Estimate
Puppeteer BROWSER	7	699	Trivial	`puppeteer_screenshot`181t	Measured
Brave Search SEARCH	2	415	Trivial	`brave_local_search`211t	Measured
everart	1	255	Trivial	`generate_image`255t	Measured

Methodology

Harness measurements spawn each MCP server over stdio, send the initialize and tools/list JSON-RPC messages, and count tokens in the serialised tools response. Tokens are estimated at roughly 3.5 characters per token — consistent across servers, within 15% of exact tokenizer output, and good enough for ranking.

Editorial estimates are used for servers we can't spawn locally (commercial SaaS, auth-required, Python-only). They get replaced the moment we have real harness data.

Why this matters: context is a zero-sum resource. A 6,000-token MCP on a 200K-context model costs 3% of your window every turn, every conversation, forever. Stacking three heavy MCPs can burn 10%+ of context before the model has read a single user message.