Gemini Ocr MCP Server

v1.0.0APIsstable

A FastMCP-based OCR server powered by Google Gemini. Handles both image file paths and base64‑encoded images to return extracted text. Easy to integrate into MCP workflows — just set your Gemini API key and model, run the server, and call 'ocr_image_

gemini-ocrmcpai-integration
Share:
5
Stars
0
Downloads
0
Weekly
0/5

About

A FastMCP-based OCR server powered by Google Gemini. Handles both image file paths and base64‑encoded images to return extracted text. Easy to integrate into MCP workflows — just set your Gemini API key and model, run the server, and call 'ocr_image_

Features

  • A FastMCP-based OCR server powered by Google Gemini. Handles

Use Cases

A FastMCP-based OCR server powered by Google Gemini. Handles both image file pat
WindoC

Maintainer

LicenseMIT
Languagepython
Versionv1.0.0
UpdatedApr 28, 2026
Statushealthy
Maintenanceactive

Works with

ClaudeOpenAIwindowsmacoslinux

Installation

Manual Installation

npx gemini-ocr

Examples

Basic Configuration

Configure Gemini Ocr MCP Server

npx gemini-ocr

Configuration

Configuration Details

Config File

claude_desktop_config.json

Web UI

Not available

Security

Authentication

RequiredNo

Features

Data Encryption
Audit Logging
RBAC Support

Performance

Response Metrics

Response Time< 200ms
ThroughputMedium

Resource Usage

Memory UsageLow
CPU UsageLow

More APIs MCP Servers

Explore other servers in the APIs category.

Quick Config Preview

{ "mcpServers": { "gemini-ocr": { "command": "npx", "args": ["-y", "gemini-ocr"] } } }

Add this to your claude_desktop_config.json or .cursor/mcp.json

Read the full setup guide →

Ready to use Gemini Ocr MCP Server?

Browse our complete directory of 60+ MCP servers, read setup guides for your editor, and start building with the Model Context Protocol.

60+ ServersFree & Open SourceStep-by-Step Guides