markdownify-mcp

MCP.Pizza Chef: zcaceres

Markdownify MCP server is a versatile Model Context Protocol server designed to convert a wide range of file types and web content into Markdown format. It supports transforming PDFs, images, audio files (with transcription), DOCX, XLSX, PPTX, YouTube video transcripts, Bing search results, and general web pages into easily readable and shareable Markdown text. This server enables seamless integration of content conversion into AI workflows, enhancing content accessibility and usability across platforms. Markdownify simplifies content processing by providing a unified interface for Markdown conversion, making it ideal for developers and AI agents needing structured, markdown-formatted data.

Use This MCP server To

Convert PDFs to Markdown for easy editing Transcribe and convert audio files to Markdown Extract and convert web pages to Markdown Generate Markdown from YouTube video transcripts Transform DOCX, XLSX, PPTX files into Markdown Retrieve and serve existing Markdown documents

README

Markdownify MCP Server

Markdownify is a Model Context Protocol (MCP) server that converts various file types and web content to Markdown format. It provides a set of tools to transform PDFs, images, audio files, web pages, and more into easily readable and shareable Markdown text.

Markdownify Server MCP server

Features

  • Convert multiple file types to Markdown:
    • PDF
    • Images
    • Audio (with transcription)
    • DOCX
    • XLSX
    • PPTX
  • Convert web content to Markdown:
    • YouTube video transcripts
    • Bing search results
    • General web pages
  • Retrieve existing Markdown files

Getting Started

  1. Clone this repository
  2. Install dependencies:
    pnpm install
    

Note: this will also install uv and related Python depdencies.

  1. Build the project:
    pnpm run build
    
  2. Start the server:
    pnpm start
    

Development

  • Use pnpm run dev to start the TypeScript compiler in watch mode
  • Modify src/server.ts to customize server behavior
  • Add or modify tools in src/tools.ts

Usage with Desktop App

To integrate this server with a desktop app, add the following to your app's server configuration:

{
  "mcpServers": {
    "markdownify": {
      "command": "node",
      "args": [
        "{ABSOLUTE PATH TO FILE HERE}/dist/index.js"
      ],
      "env": {
        // By default, the server will use the default install location of `uv`
        "UV_PATH": "/path/to/uv"
      }
    }
  }
}

Available Tools

  • youtube-to-markdown: Convert YouTube videos to Markdown

  • pdf-to-markdown: Convert PDF files to Markdown

  • bing-search-to-markdown: Convert Bing search results to Markdown

  • webpage-to-markdown: Convert web pages to Markdown

  • image-to-markdown: Convert images to Markdown with metadata

  • audio-to-markdown: Convert audio files to Markdown with transcription

  • docx-to-markdown: Convert DOCX files to Markdown

  • xlsx-to-markdown: Convert XLSX files to Markdown

  • pptx-to-markdown: Convert PPTX files to Markdown

  • get-markdown-file: Retrieve an existing Markdown file. File extension must end with: *.md, *.markdown.

    OPTIONAL: set MD_SHARE_DIR env var to restrict the directory from which files can be retrieved, e.g. MD_SHARE_DIR=[SOME_PATH] pnpm run start

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

This project is licensed under the MIT License - see the LICENSE file for details.

markdownify-mcp FAQ

How do I install the Markdownify MCP server?
Clone the repository, install dependencies with 'pnpm install', build with 'pnpm run build', then start the server using 'pnpm start'.
What file types can Markdownify convert?
It supports PDFs, images, audio files (with transcription), DOCX, XLSX, PPTX, and web content including YouTube transcripts and Bing search results.
Does Markdownify support audio transcription?
Yes, it transcribes audio files and converts the transcriptions into Markdown format.
Can I convert web pages to Markdown using this server?
Yes, Markdownify can convert general web pages, YouTube video transcripts, and Bing search results into Markdown.
Is Markdownify compatible with multiple LLM providers?
Yes, it integrates with models from OpenAI, Anthropic Claude, and Google Gemini via the MCP protocol.
What dependencies are required to run Markdownify?
It requires Node.js with pnpm, and Python dependencies including 'uv' for some processing tasks.
How does Markdownify handle existing Markdown files?
It can retrieve and serve existing Markdown files directly through the MCP interface.
Can I extend Markdownify to support additional file types?
Yes, as an open-source MCP server, you can extend its converters to handle more formats.