DINO-X
Official DINO-X Model Context Protocol (MCP) server that empowers LLMs with real-world visual perception through image object detection, localization, and captioning APIs.
What is DINO-X?
DINO-X is a Model Context Protocol (MCP) server that allows AI assistants like Claude, Cursor, and VS Code to official dino-x model context protocol (mcp) server that empowers llms with real-world visual perception through image object detection, localization, and captioning apis.
Official DINO-X Model Context Protocol (MCP) server that empowers LLMs with real-world visual perception through image object detection, localization, and captioning APIs.
This server falls under the Data Science & ML category on MCPgee, the world's largest MCP server directory with 33,000+ servers.
Features
- Official DINO-X Model Context Protocol (MCP) server that emp
Use Cases
Maintainer
Works with
Installation
Manual Installation
npx dino-x-mcpConfiguration
Configuration Details
claude_desktop_config.json
Performance
Response Metrics
Resource Usage
How to Set Up and Use DINO-X
DINO-X MCP is the official Model Context Protocol server for the DINO-X computer vision platform by IDEA Research. It gives language models real-world visual perception by exposing object detection, text-prompted localization, human pose estimation, and image captioning through a clean MCP tool interface. Developers use it to build AI pipelines that can understand, count, and reason about objects in images without writing custom vision API wrappers.
Prerequisites
- Node.js 18 or later installed on your system
- A DINO-X API key from the DINO-X platform (deepdataspace.com)
- An MCP-compatible client such as Claude Desktop or Claude Code
- Images accessible as local files (STDIO mode) or public HTTPS URLs (HTTP mode)
Obtain a DINO-X API key
Sign up at the DINO-X platform (deepdataspace.com), navigate to your account settings, and generate an API key. Keep this key handy — it is required to authenticate every request.
Run the server with npx
The easiest way to start the server is with npx. The package is published under the scoped name @deepdataspace/dinox-mcp. Pass your API key as a CLI flag or environment variable.
npx -y @deepdataspace/dinox-mcp --dinox-api-key=YOUR_API_KEYConfigure your MCP client
Add the server to your Claude Desktop or other MCP client configuration file. Set DINOX_API_KEY in the env block. Optionally set IMAGE_STORAGE_DIRECTORY to a writable path where annotated result images will be saved.
{
"mcpServers": {
"dino-x": {
"command": "npx",
"args": ["-y", "@deepdataspace/dinox-mcp"],
"env": {
"DINOX_API_KEY": "your-dinox-api-key",
"IMAGE_STORAGE_DIRECTORY": "/tmp/dinox-results"
}
}
}
}Restart your MCP client
Restart Claude Desktop (or reload your MCP client) so it picks up the new server configuration. You should see the DINO-X tools listed in the available tools panel.
Detect objects in an image
Use the detect-all-objects tool to run full-scene object detection on an image. In STDIO mode provide a file:// URL; in HTTP mode provide an https:// URL. The tool returns categories, bounding boxes, and captions for every detected object.
Use text-prompted detection
Use detect-objects-by-text to find specific objects by providing English noun phrases. This lets you ask the model to locate particular items like 'forklift' or 'person wearing helmet' rather than detecting everything.
DINO-X Examples
Client configuration
Claude Desktop config using npx with the scoped package name and required environment variable.
{
"mcpServers": {
"dino-x": {
"command": "npx",
"args": ["-y", "@deepdataspace/dinox-mcp"],
"env": {
"DINOX_API_KEY": "your-dinox-api-key",
"IMAGE_STORAGE_DIRECTORY": "/tmp/dinox-results"
}
}
}
}Prompts to try
Example natural-language prompts that exercise the DINO-X vision tools.
- "Detect all objects in this image: file:///home/user/warehouse.jpg and count how many forklifts are present"
- "Find all people wearing hard hats in file:///home/user/site.png using text-prompted detection"
- "Analyze the human poses in file:///home/user/exercise.jpg and describe what exercise the person is performing"
- "Run full-scene detection on https://example.com/product.jpg and list every visible item with its bounding box"Troubleshooting DINO-X
Error: DINOX_API_KEY is not set or invalid
Make sure the DINOX_API_KEY environment variable is present in your MCP client config's env block and that the key is active. You can verify the key by testing it directly with the DINO-X REST API before using the MCP server.
Images fail to load in STDIO mode with HTTPS URLs
STDIO transport only accepts file:// URLs. Switch to HTTP transport mode (--http flag) when you need to process remote HTTPS image URLs. In HTTP mode file:// paths are not supported.
Annotated output images are not being saved
The IMAGE_STORAGE_DIRECTORY must exist and be writable by the process running the server. Create the directory first and ensure the path is absolute. This feature only works in STDIO mode.
Frequently Asked Questions about DINO-X
What is DINO-X?
DINO-X is a Model Context Protocol (MCP) server that official dino-x model context protocol (mcp) server that empowers llms with real-world visual perception through image object detection, localization, and captioning apis. It connects AI assistants to external tools and data sources through a standardized interface.
How do I install DINO-X?
Follow the installation instructions on the DINO-X GitHub repository. Clone the repo, install dependencies, and add the server config to your AI client.
Which AI clients work with DINO-X?
DINO-X works with all major MCP-compatible AI clients including Claude Desktop, Claude Code, Cursor, VS Code (GitHub Copilot), Windsurf, and Cline.
Is DINO-X free to use?
Yes, DINO-X is open source and available under the Apache-2.0 license. You can use it freely in both personal and commercial projects.
DINO-X Alternatives — Similar Data Science & ML Servers
Looking for alternatives to DINO-X? Here are other popular data science & ml servers you can use with Claude, Cursor, and VS Code.
Ultrarag
★ 5.6kA Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines
RocketRide
★ 3.1k📇 🏠 - MCP server that exposes RocketRide AI pipelines as t
Aix Db
★ 2.1kAix-DB 基于 LangChain/LangGraph 框架,结合 MCP Skills 多智能体协作架构,实现自然语言到数据洞察的端到端转换。
NeMo Data Designer
★ 1.9k🎨 NeMo Data Designer: Generate high-quality synthetic data from scratch or from seed data.
PaperBanana
★ 1.7kOpen source implementation and extension of Google Research’s PaperBanana for automated academic figures, diagrams, and research visuals, expanded to new domains like slide generation.
MiniMax
★ 1.5kBridges MiniMax AI capabilities to the Model Context Protocol, enabling AI agents to perform image understanding, text-to-image generation, and speech synthesis. It provides a standardized interface for accessing MiniMax's core tools via JSON-RPC.
Browse More Data Science & ML MCP Servers
Explore all data science & ml servers available in the MCPgee directory. Each server includes setup guides for Claude, Cursor, and VS Code.
Set Up DINO-X in Your Editor
Choose your AI client for step-by-step setup instructions.
Quick Config Preview
Add this to your claude_desktop_config.json or .cursor/mcp.json
Ready to use DINO-X?
Browse our complete directory of 33,000+ MCP servers, read setup guides for your editor, and start building with the Model Context Protocol.