Use This MCP server To

Run SQL queries on Databricks SQL warehouses via LLMs List all active and scheduled Databricks jobs Retrieve detailed status and metadata of specific Databricks jobs Automate monitoring of Databricks job executions Integrate Databricks job data into AI-driven workflows Enable LLMs to interact with Databricks for data analysis Fetch job information for reporting or alerting systems

README

Databricks MCP Server

A Model Context Protocol (MCP) server that connects to Databricks API, allowing LLMs to run SQL queries, list jobs, and get job status.

Features

Run SQL queries on Databricks SQL warehouses
List all Databricks jobs
Get status of specific Databricks jobs
Get detailed information about Databricks jobs

Prerequisites

Python 3.7+
Databricks workspace with:
- Personal access token
- SQL warehouse endpoint
- Permissions to run queries and access jobs

Setup

Clone this repository

Create and activate a virtual environment (recommended):

python -m venv .venv
source .venv/bin/activate  # On Windows: .venv\Scripts\activate

Install dependencies:
```
pip install -r requirements.txt
```

Create a .env file in the root directory with the following variables:

DATABRICKS_HOST=your-databricks-instance.cloud.databricks.com
DATABRICKS_TOKEN=your-personal-access-token
DATABRICKS_HTTP_PATH=/sql/1.0/warehouses/your-warehouse-id

Test your connection (optional but recommended):
```
python test_connection.py
```

Obtaining Databricks Credentials

Host: Your Databricks instance URL (e.g., your-instance.cloud.databricks.com)
Token: Create a personal access token in Databricks:
- Go to User Settings (click your username in the top right)
- Select "Developer" tab
- Click "Manage" under "Access tokens"
- Generate a new token, and save it immediately
HTTP Path: For your SQL warehouse:
- Go to SQL Warehouses in Databricks
- Select your warehouse
- Find the connection details and copy the HTTP Path

Running the Server

Start the MCP server:

python main.py

You can test the MCP server using the inspector by running

npx @modelcontextprotocol/inspector python3 main.py

Available MCP Tools

The following MCP tools are available:

run_sql_query(sql: str) - Execute SQL queries on your Databricks SQL warehouse
list_jobs() - List all Databricks jobs in your workspace
get_job_status(job_id: int) - Get the status of a specific Databricks job by ID
get_job_details(job_id: int) - Get detailed information about a specific Databricks job

Example Usage with LLMs

When used with LLMs that support the MCP protocol, this server enables natural language interaction with your Databricks environment:

"Show me all tables in the database"
"Run a query to count records in the customer table"
"List all my Databricks jobs"
"Check the status of job #123"
"Show me details about job #456"

Troubleshooting

Connection Issues

Ensure your Databricks host is correct and doesn't include https:// prefix
Check that your SQL warehouse is running and accessible
Verify your personal access token has the necessary permissions
Run the included test script: python test_connection.py

Security Considerations

Your Databricks personal access token provides direct access to your workspace
Secure your .env file and never commit it to version control
Consider using Databricks token with appropriate permission scopes only
Run this server in a secure environment

mcp-databricks-server FAQ

How do I authenticate the mcp-databricks-server with my Databricks workspace?

You authenticate by providing a personal access token and the Databricks host URL in the .env configuration file.

What Python version is required to run the mcp-databricks-server?

Python 3.7 or higher is required to run this MCP server.

Can the mcp-databricks-server execute arbitrary SQL queries?

Yes, it can run SQL queries on configured Databricks SQL warehouses through the API.

What permissions are needed in Databricks to use this server?

You need permissions to run SQL queries and access job information in your Databricks workspace.

How do I set up the environment variables for this server?

Create a .env file with DATABRICKS_HOST, DATABRICKS_TOKEN, and DATABRICKS_HTTP_PATH variables as per your Databricks workspace details.

Is it possible to get real-time job status updates using this server?

Yes, the server can fetch current status and detailed information about Databricks jobs.

Which LLM providers can work with this MCP server?

This MCP server is provider-agnostic and works with OpenAI, Anthropic Claude, and Google Gemini models.

How do I install dependencies for the mcp-databricks-server?

After cloning the repo, activate a Python virtual environment and run 'pip install -r requirements.txt'.