Fire in da houseTop Tip:Paying $100+ per month for Perplexity, MidJourney, Runway, ChatGPT and other tools is crazy - get all your AI tools in one site starting at $15 per month with Galaxy AI Fire in da houseCheck it out free

mcp-server-datahub

MCP.Pizza Chef: acryldata

The mcp-server-datahub is an official MCP server implementation for DataHub, allowing AI agents to query metadata, lineage, and context across the DataHub ecosystem. It supports both DataHub Core and Cloud, enabling search across entity types, metadata retrieval, lineage traversal, and SQL query listing for datasets. This server facilitates real-time, structured access to data ecosystem insights for enhanced AI-driven workflows.

Use This MCP server To

Query DataHub metadata for any data entity Traverse upstream and downstream lineage graphs Search across all DataHub entity types with filters List SQL queries associated with datasets Integrate DataHub context into AI agent workflows Enable real-time data ecosystem insights for models Support metadata-driven decision making in data apps

README

mcp-server-datahub

A Model Context Protocol server implementation for DataHub. This enables AI agents to query DataHub for metadata and context about your data ecosystem.

Supports both DataHub Core and DataHub Cloud.

Features

  • Searching across all entity types and using arbitrary filters
  • Fetching metadata for any entity
  • Traversing the lineage graph, both upstream and downstream
  • Listing SQL queries associated with a dataset

Demo

Check out the demo video, done in collaboration with the team at Block.

Usage

  1. Install uv

    # On macOS and Linux.
    curl -LsSf https://astral.sh/uv/install.sh | sh
  2. Locate your authentication details

    For authentication, you'll need the following:

    Alternative: Using ~/.datahubenv for authentication

    You can also use a ~/.datahubenv file to configure your authentication. The easiest way to create this file is to run datahub init and follow the prompts.

    uvx --from acryl-datahub datahub init
  3. Configure your MCP client. See below - this will vary depending on your agent.

Claude Desktop

Run which uvx to find the full path to the uvx command.

In your claude_desktop_config.json file, add the following:

{
  "mcpServers": {
    "datahub": {
      "command": "<full-path-to-uvx>",  // e.g. /Users/hsheth/.local/bin/uvx
      "args": ["mcp-server-datahub"],
      "env": {
        "DATAHUB_GMS_URL": "<your-datahub-url>",
        "DATAHUB_GMS_TOKEN": "<your-datahub-token>"
      }
    }
  }
}

Cursor

In .cursor/mcp.json, add the following:

{
  "mcpServers": {
    "datahub": {
      "command": "uvx",
      "args": ["mcp-server-datahub"],
      "env": {
        "DATAHUB_GMS_URL": "<your-datahub-url>",
        "DATAHUB_GMS_TOKEN": "<your-datahub-token>"
      }
    }
  }
}

Other MCP Clients

command: uvx
args:
  - mcp-server-datahub
env:
  DATAHUB_GMS_URL: <your-datahub-url>
  DATAHUB_GMS_TOKEN: <your-datahub-token>

Troubleshooting

spawn uvx ENOENT

The full stack trace might look like this:

2025-04-08T19:58:16.593Z [datahub] [error] spawn uvx ENOENT {"stack":"Error: spawn uvx ENOENT\n    at ChildProcess._handle.onexit (node:internal/child_process:285:19)\n    at onErrorNT (node:internal/child_process:483:16)\n    at process.processTicksAndRejections (node:internal/process/task_queues:82:21)"}

Solution: Replace the uvx bit of the command with the output of which uvx.

Developing

See DEVELOPING.md.

mcp-server-datahub FAQ

How do I authenticate the mcp-server-datahub?
Use your DataHub instance URL and a personal access token for authentication.
Does mcp-server-datahub support both DataHub Core and Cloud?
Yes, it supports both DataHub Core and DataHub Cloud environments.
What types of metadata can I query with this server?
You can query metadata for any entity type within DataHub, including datasets, charts, dashboards, and more.
Can I traverse lineage graphs using this MCP server?
Yes, it supports traversing both upstream and downstream lineage graphs.
How do I install the mcp-server-datahub?
Install the 'uv' tool first, then configure the server with your DataHub URL and token.
Is there a demo available to see mcp-server-datahub in action?
Yes, a demo video is available showcasing its capabilities in collaboration with Block.
What kind of filters can I use when searching entities?
You can use arbitrary filters to refine searches across all entity types.
Can this server list SQL queries related to datasets?
Yes, it can list SQL queries associated with any dataset in DataHub.