MarkItDown
Microsoft's MarkItDown as an MCP. Convert PDF, Office, audio, video, images, web to clean Markdown for LLM ingestion.
Delv Safety Grade: A
Score 84/100 · assessed 2026-04-28
Microsoft's MarkItDown MCP server converts diverse file formats (PDF, Office documents, audio, video, images, web pages) into Markdown for LLM consumption. As an official Microsoft project, it benefits from strong organisational backing and active maintenance. The tool requires filesystem read access to process local files and network outbound for web page conversion. Audio and video transcription likely uses external services, though documentation doesn't specify which. The conversion scope is broad, touching multiple file types and potentially external APIs for OCR and transcription. Supply chain is solid via PyPI distribution, though the MCP wrapper package is newer than the core MarkItDown library. No security incidents recorded. The main risk is the breadth of file format handling, which increases attack surface for malformed inputs.
Lethal Trifecta (prompt-injection exposure)
TWO OF THREESame shape as markdownify with broader format support.
Green flags
- Official Microsoft project with strong organisational backing
- Core MarkItDown library widely used and tested
- Open source with clear documentation and active issues
- Standard PyPI distribution with versioning
- Read-focused operation, no destructive file operations
Red flags
- Audio/video transcription mechanism not clearly documented
- Broad file format support increases malformed input attack surface
- External API dependencies for OCR/transcription not fully specified
- MCP wrapper package newer, less battle-tested than core library
Permissions requested
Install
pip install markitdown-mcp
Review
Install this if you're regularly converting documents, PDFs, or media for LLM ingestion. It's reliable, officially maintained, and handles edge cases better than cobbled-together scripts. Skip it if you're only working with plain text or need real-time processing.
Good at
- Handles a genuinely wide range of formats without needing separate tools for each.
- Officially maintained by Microsoft, so it's not abandonware waiting to happen.
- Conversion quality is consistently better than most open-source alternatives, especially for complex PDFs.
- Clean Markdown output with proper heading hierarchy and table formatting.
- No API keys or external services required, everything runs locally.
Watch out
- Video conversion only extracts audio for transcription, no visual analysis.
- Large files take a long time to process, which can cause timeouts in Claude.
- No native batch processing, you're converting one file at a time unless you script it.
- Excel sheets become unwieldy Markdown tables if they're more than a few dozen rows.
- Hosts beyond Claude Desktop require manual config editing, no GUI setup.
Use cases
- Turning a PDF into Markdown for the agent to read
- Bulk-converting docx into a knowledge base
- Pulling text out of audio for transcription pipelines
- Cleaning up a downloaded webpage
Getting started
Works with
Similar MCPs
- ElevenLabsOfficial ElevenLabs MCP server. Text-to-speech, voice cloning, transcription, and audio sound design from inside Claude Desktop, Cursor, Windsurf, and other MCP clients.
- TimeReturns current time in any timezone, computes scheduling math, parses date strings. Boring but absurdly useful for any time-aware agent.
- Sequential ThinkingAnthropic's reasoning-loop MCP — gives Claude a structured 'think step by step' tool. Real impact on multi-step problem solving.
- Everything (reference)Anthropic's reference MCP that exercises every protocol feature. Required reading if you're building your own MCP.