Markdownify
zcaceres Markdownify MCP. Converts PDF, image, audio, DOCX/XLSX/PPTX, YouTube transcripts, and webpages into Markdown.
Delv Safety Grade: C
Score 58/100 · assessed 2026-04-28
Markdownify is a community MCP server by solo developer zcaceres that converts multiple file formats (PDF, images, audio, Office documents, YouTube videos, webpages) into Markdown. The repository is open source with reasonable documentation, but the install method is unclear and there's no standard package distribution. The server requires broad permissions including filesystem access for reading input files, network access for fetching YouTube transcripts and webpages, and likely shell execution for format conversion tools. The solo maintainer has reasonable GitHub activity, but the project lacks the robustness of vendor-backed tools. No security incidents are known, but the opaque installation process and broad permission surface create supply chain risk. Functional for its stated purpose but requires careful vetting before deployment.
Lethal Trifecta (prompt-injection exposure)
TWO OF THREEFetches a URL, returns markdown. The fetch is outbound; the body is untrusted. Common 2/3 in research stacks.
Green flags
- Open source with clear documentation on GitHub
- Specific, well-defined use case (format conversion to Markdown)
- Active repository with recent commits
- No known security incidents or malicious behaviour
- Useful for common workflows (YouTube transcripts, PDF conversion)
Red flags
- No standard package distribution, unclear install method
- Solo maintainer with limited bus factor
- Broad filesystem and network access required for conversions
- Likely depends on external conversion tools (OCR, ffmpeg, etc.)
- No versioning or dependency pinning visible
Permissions requested
Review
Install this if you regularly convert multiple formats into Markdown for research or note-taking. It's not the best at any one task, but the convenience of handling PDFs, Office files, YouTube, and images in a single MCP server outweighs the lack of specialisation. Skip it if you only need one format or require production-grade accuracy.
Good at
- Handles six different input formats (PDF, image, audio, Office docs, YouTube, web) in one server, so you avoid installing multiple dependencies.
- YouTube transcript extraction works directly from a URL, which is faster than copy-pasting from third-party sites.
- Office document conversion (PPTX, DOCX, XLSX) is reliable for text-heavy files, making it useful for client deliverables or meeting notes.
- No environment variables or API keys required, so setup is straightforward.
- Works in both Claude Desktop and Cursor, covering the two most common MCP hosts.
Watch out
- OCR quality on images is typical consumer-grade: fine for clean screenshots, poor for handwritten notes or low-resolution scans.
- PDF conversion struggles with complex layouts, embedded tables, or multi-column documents.
- Documentation is sparse, so you'll need to experiment to understand the limits of each conversion type.
- Community-maintained with no clear update cadence, so don't expect rapid bug fixes or feature additions.
Use cases
- Pulling YouTube transcripts via the agent
- Converting client-supplied PPTX into Markdown notes
- Web-to-Markdown for research pipelines
- OCR-style image-to-text in a chat
Getting started
Works with
Similar MCPs
- ElevenLabsOfficial ElevenLabs MCP server. Text-to-speech, voice cloning, transcription, and audio sound design from inside Claude Desktop, Cursor, Windsurf, and other MCP clients.
- TimeReturns current time in any timezone, computes scheduling math, parses date strings. Boring but absurdly useful for any time-aware agent.
- MarkItDownMicrosoft's MarkItDown as an MCP. Convert PDF, Office, audio, video, images, web to clean Markdown for LLM ingestion.
- Sequential ThinkingAnthropic's reasoning-loop MCP — gives Claude a structured 'think step by step' tool. Real impact on multi-step problem solving.