Overview
This demo showcases how to set up and use the dlt MCP Server for data pipeline validation and inspection. The MCP server enables interactive querying and management of dlt pipelines, including data inspection, row counts, and load validation.
Prerequisites
- UV installed on your local machine.
- dlt workspace installed and configured.
- A
pyproject.tomlfile with the necessary dependencies.
Setup
1. Configure MCP Server
VS Code
- Open VS Code and access Settings (
Command+Shift+P). - Navigate to Tools > MCP.
- Click Add Custom MCP to create/open the
mcp.jsonfile. - Add the configuration for the dlt MCP Server to
mcp.json:
{
"name": "dlt-mcp-server",
"command": "dlt pipeline ...",
"args": ["--with", "duckdb"]
}
- Ensure you include the
duckdbdependency if using a DuckDB destination.- Save the file. The MCP server will automatically update within a few seconds.
Cursor
- Open Cursor and access Settings (
Command+,). - Navigate to Extensions > MCP.
- Click Add Custom MCP to create/open the
mcp.jsonfile. - Add the configuration for the dlt MCP Server to
mcp.json:
{
"name": "dlt-mcp-server",
"command": "dlt pipeline ...",
"args": ["--with", "duckdb"]
}
- Save the file. The MCP server will automatically update within a few seconds.
Kiro
- Open Kiro and access Preferences (
Command+,). - Navigate to Plugins > MCP.
- Click Add Custom MCP to create/open the
mcp.jsonfile. - Add the configuration for the dlt MCP Server to
mcp.json:
{
"name": "dlt-mcp-server",
"command": "dlt pipeline ...",
"args": ["--with", "duckdb"]
}
- Save the file. The MCP server will automatically update within a few seconds.
Claude Desktop
- Open Claude Desktop and access Settings (
Command+,). - Navigate to Integrations > MCP.
- Click Add Custom MCP to create/open the
mcp.jsonfile. - Add the configuration for the dlt MCP Server to
mcp.json:
{
"name": "dlt-mcp-server",
"command": "dlt pipeline ...",
"args": ["--with", "duckdb"]
}
- Save the file. The MCP server will automatically update within a few seconds.
Other IDEs (e.g., PyCharm, IntelliJ, Sublime Text)
- Locate the MCP configuration section in your IDE's settings.
- Create or open the
mcp.jsonfile. - Add the configuration for the dlt MCP Server to
mcp.json:
{
"name": "dlt-mcp-server",
"command": "dlt pipeline ...",
"args": ["--with", "duckdb"]
}
- Save the file. The MCP server will automatically update within a few seconds.
Usage
1. Test MCP Server
- Open a chat in your IDE and ask:
What pipelines are available?
- The MCP server should list the available pipelines (e.g., GitHub pipeline).
2. Inspect Pipeline Data
- Ask:
What tables are in this pipeline?
-
The server will list tables (e.g.,
commits,contributors).- Ask:
When was the data last loaded?
- The server will provide the timestamp of the last data load.
3. Validate Data
- Ask:
How many rows are in the commits table?
- If the MCP server lacks dependencies (e.g.,
duckdb), it will throw an error. Update themcp.jsonconfiguration to include the missing dependency and retry.
4. Agentic Help
- Ask:
How many rows will be extracted in the next run in commits?
- The MCP server will analyze the pipeline and confirm if incremental loading is applied. If not, it will fetch all existing rows plus any new data since the last run.
Troubleshooting
-
Dependency Errors: Ensure all required dependencies (e.g.,
duckdb) are included in themcp.jsonconfiguration. -
Configuration Updates: After modifying
mcp.json, wait a few seconds for the MCP server to apply changes. - IDE-Specific Issues: Refer to your IDE's documentation for MCP-related troubleshooting.
Conclusion
The dlt MCP Server simplifies pipeline management by enabling interactive data inspection and validation. Customize the mcp.json configuration to support your specific pipeline destinations and dependencies.
Top comments (0)