DINO-X

v1.0.0Data Science & MLstable

Official DINO-X Model Context Protocol (MCP) server that empowers LLMs with real-world visual perception through image object detection, localization, and captioning APIs.

image-recognitionmcpmcp-serverobject-detectionpose-estimation
Share:
114
Stars
0
Downloads
0
Weekly
0/5

What is DINO-X?

DINO-X is a Model Context Protocol (MCP) server that allows AI assistants like Claude, Cursor, and VS Code to official dino-x model context protocol (mcp) server that empowers llms with real-world visual perception through image object detection, localization, and captioning apis.

Official DINO-X Model Context Protocol (MCP) server that empowers LLMs with real-world visual perception through image object detection, localization, and captioning APIs.

This server falls under the Data Science & ML category on MCPgee, the world's largest MCP server directory with 33,000+ servers.

Features

  • Official DINO-X Model Context Protocol (MCP) server that emp

Use Cases

Enable real-world visual perception for LLMs through object detection.
Perform image captioning and object localization with DINO-X.
Analyze visual data and understand image content with AI-powered computer vision.
IDEA-Research

Maintainer

LicenseApache-2.0
Languagetypescript
Versionv1.0.0
UpdatedMay 7, 2026
Statushealthy
Maintenanceactive

Works with

ClaudeOpenAIwindowsmacoslinux

Installation

Manual Installation

npx dino-x-mcp

Configuration

Configuration Details

Config File

claude_desktop_config.json

Performance

Response Metrics

Response Time< 200ms
ThroughputMedium

Resource Usage

Memory UsageLow
CPU UsageLow

How to Set Up and Use DINO-X

DINO-X MCP is the official Model Context Protocol server for the DINO-X computer vision platform by IDEA Research. It gives language models real-world visual perception by exposing object detection, text-prompted localization, human pose estimation, and image captioning through a clean MCP tool interface. Developers use it to build AI pipelines that can understand, count, and reason about objects in images without writing custom vision API wrappers.

Prerequisites

  • Node.js 18 or later installed on your system
  • A DINO-X API key from the DINO-X platform (deepdataspace.com)
  • An MCP-compatible client such as Claude Desktop or Claude Code
  • Images accessible as local files (STDIO mode) or public HTTPS URLs (HTTP mode)
1

Obtain a DINO-X API key

Sign up at the DINO-X platform (deepdataspace.com), navigate to your account settings, and generate an API key. Keep this key handy — it is required to authenticate every request.

2

Run the server with npx

The easiest way to start the server is with npx. The package is published under the scoped name @deepdataspace/dinox-mcp. Pass your API key as a CLI flag or environment variable.

npx -y @deepdataspace/dinox-mcp --dinox-api-key=YOUR_API_KEY
3

Configure your MCP client

Add the server to your Claude Desktop or other MCP client configuration file. Set DINOX_API_KEY in the env block. Optionally set IMAGE_STORAGE_DIRECTORY to a writable path where annotated result images will be saved.

{
  "mcpServers": {
    "dino-x": {
      "command": "npx",
      "args": ["-y", "@deepdataspace/dinox-mcp"],
      "env": {
        "DINOX_API_KEY": "your-dinox-api-key",
        "IMAGE_STORAGE_DIRECTORY": "/tmp/dinox-results"
      }
    }
  }
}
4

Restart your MCP client

Restart Claude Desktop (or reload your MCP client) so it picks up the new server configuration. You should see the DINO-X tools listed in the available tools panel.

5

Detect objects in an image

Use the detect-all-objects tool to run full-scene object detection on an image. In STDIO mode provide a file:// URL; in HTTP mode provide an https:// URL. The tool returns categories, bounding boxes, and captions for every detected object.

6

Use text-prompted detection

Use detect-objects-by-text to find specific objects by providing English noun phrases. This lets you ask the model to locate particular items like 'forklift' or 'person wearing helmet' rather than detecting everything.

DINO-X Examples

Client configuration

Claude Desktop config using npx with the scoped package name and required environment variable.

{
  "mcpServers": {
    "dino-x": {
      "command": "npx",
      "args": ["-y", "@deepdataspace/dinox-mcp"],
      "env": {
        "DINOX_API_KEY": "your-dinox-api-key",
        "IMAGE_STORAGE_DIRECTORY": "/tmp/dinox-results"
      }
    }
  }
}

Prompts to try

Example natural-language prompts that exercise the DINO-X vision tools.

- "Detect all objects in this image: file:///home/user/warehouse.jpg and count how many forklifts are present"
- "Find all people wearing hard hats in file:///home/user/site.png using text-prompted detection"
- "Analyze the human poses in file:///home/user/exercise.jpg and describe what exercise the person is performing"
- "Run full-scene detection on https://example.com/product.jpg and list every visible item with its bounding box"

Troubleshooting DINO-X

Error: DINOX_API_KEY is not set or invalid

Make sure the DINOX_API_KEY environment variable is present in your MCP client config's env block and that the key is active. You can verify the key by testing it directly with the DINO-X REST API before using the MCP server.

Images fail to load in STDIO mode with HTTPS URLs

STDIO transport only accepts file:// URLs. Switch to HTTP transport mode (--http flag) when you need to process remote HTTPS image URLs. In HTTP mode file:// paths are not supported.

Annotated output images are not being saved

The IMAGE_STORAGE_DIRECTORY must exist and be writable by the process running the server. Create the directory first and ensure the path is absolute. This feature only works in STDIO mode.

Frequently Asked Questions about DINO-X

What is DINO-X?

DINO-X is a Model Context Protocol (MCP) server that official dino-x model context protocol (mcp) server that empowers llms with real-world visual perception through image object detection, localization, and captioning apis. It connects AI assistants to external tools and data sources through a standardized interface.

How do I install DINO-X?

Follow the installation instructions on the DINO-X GitHub repository. Clone the repo, install dependencies, and add the server config to your AI client.

Which AI clients work with DINO-X?

DINO-X works with all major MCP-compatible AI clients including Claude Desktop, Claude Code, Cursor, VS Code (GitHub Copilot), Windsurf, and Cline.

Is DINO-X free to use?

Yes, DINO-X is open source and available under the Apache-2.0 license. You can use it freely in both personal and commercial projects.

Browse More Data Science & ML MCP Servers

Explore all data science & ml servers available in the MCPgee directory. Each server includes setup guides for Claude, Cursor, and VS Code.

Quick Config Preview

{ "mcpServers": { "dino-x-mcp": { "command": "npx", "args": ["-y", "dino-x-mcp"] } } }

Add this to your claude_desktop_config.json or .cursor/mcp.json

Read the full setup guide →

Ready to use DINO-X?

Browse our complete directory of 33,000+ MCP servers, read setup guides for your editor, and start building with the Model Context Protocol.

33,000+ ServersFree & Open SourceStep-by-Step Guides