How do I install Vllora MCP Server?

Follow the setup instructions on the Vllora GitHub repository, then add the server configuration to your AI client.

What category is Vllora MCP Server?

Vllora is categorized under Monitoring & Observability. Browse more servers in these categories on MCPgee.

Vllora

Name: Vllora MCP Server
Author: vllora

v1.0.0•Monitoring & Observability•stable

Debug your AI agents

agentsai-agentsai-gatewayanthropicazure

805

Stars

Downloads

Weekly

0/5

View on GitHub

What is Vllora?

Vllora is a Model Context Protocol (MCP) server that allows AI assistants like Claude, Cursor, and VS Code to debug your ai agents

Debug your AI agents

This server falls under the Monitoring & Observability category on MCPgee, the world's largest MCP server directory with 33,000+ servers.

Features

Debug your AI agents

Use Cases

AI agent debugging and monitoring

Multi-provider gateway support

vllora

Maintainer

LicenseNOASSERTION

Languagerust

Versionv1.0.0

UpdatedMay 22, 2026

Statushealthy

Maintenanceactive

Works with

ClaudeOpenAIwindowsmacoslinux

View Source Browse All Servers

Installation

Manual Installation

npx vllora

Configuration

Configuration Details

Config File

claude_desktop_config.json

Performance

Response Metrics

Response Time< 200ms

ThroughputMedium

Resource Usage

Memory UsageLow

CPU UsageLow

How to Set Up and Use Vllora

Vllora is a lightweight real-time debugging and observability tool for AI agents that traces every LLM call, tool interaction, and agent workflow through an OpenAI-compatible API gateway running at localhost. It exposes a web UI on port 9091 and an API on port 9090, automatically collecting traces from frameworks like LangChain, Google ADK, and direct OpenAI SDK calls without requiring code changes. Developers use it to identify latency bottlenecks, inspect tool call sequences, and replay agent interactions during development.

Prerequisites

macOS with Homebrew installed (for the recommended install method)
API keys for the LLM providers you want to proxy (e.g. OpenAI, Anthropic)
An AI agent framework (LangChain, Google ADK, OpenAI SDK, etc.) to trace
An MCP client if using the MCP server integration feature

Install vllora via Homebrew

Add the vllora tap and install the binary.

brew tap vllora/vllora
brew install vllora

Start the vllora server

Launch the server — it starts the API on port 9090 and the debugging UI on port 9091.

vllora

Configure your LLM provider credentials

Open the web UI at http://localhost:9091 and add your API keys for each provider (OpenAI, Anthropic, etc.) through the settings interface.

Point your agent at the vllora endpoint

Change your agent's base URL from the provider's endpoint to vllora's local API. All calls will be proxied and traced.

# Example: redirect OpenAI calls through vllora
export OPENAI_BASE_URL=http://localhost:9090/v1
export VLLORA_OPENAI_API_KEY=your_openai_api_key

Inspect traces in the UI

Open http://localhost:9091 to see live traces, tool call sequences, latency breakdowns, and full request/response payloads for every agent interaction.

Vllora Examples

Client configuration

Claude Desktop config to run vllora as an MCP server for agent debugging.

{
  "mcpServers": {
    "vllora": {
      "command": "npx",
      "args": ["vllora"],
      "env": {
        "VLLORA_OPENAI_API_KEY": "your_openai_api_key"
      }
    }
  }
}

Prompts to try

Example ways to use vllora once connected via MCP.

- "Show me the last 10 traces captured by vllora"
- "Which tool calls had the highest latency in the last session?"
- "Replay the failed agent interaction from trace ID abc123"
- "List all LLM providers currently configured in vllora"

Troubleshooting Vllora

Port 9090 or 9091 is already in use

Find and stop the conflicting process with lsof -i :9090, or configure vllora to use different ports via its settings.

Agent requests are not appearing in the vllora UI

Verify that your agent's base URL is actually set to http://localhost:9090/v1 and not the original provider URL. Restart the agent process after changing the environment variable.

brew install vllora fails on Apple Silicon

Make sure Homebrew itself is up to date with brew update && brew upgrade, then retry. If the formula is not yet available for arm64, download the binary directly from the vllora GitHub releases page.

Frequently Asked Questions about Vllora

What is Vllora?

Vllora is a Model Context Protocol (MCP) server that debug your ai agents It connects AI assistants to external tools and data sources through a standardized interface.

How do I install Vllora?

Follow the installation instructions on the Vllora GitHub repository. Clone the repo, install dependencies, and add the server config to your AI client.

Which AI clients work with Vllora?

Vllora works with all major MCP-compatible AI clients including Claude Desktop, Claude Code, Cursor, VS Code (GitHub Copilot), Windsurf, and Cline.

Is Vllora free to use?

Yes, Vllora is open source and available under the NOASSERTION license. You can use it freely in both personal and commercial projects.

Learn More About MCP Servers

Getting Started with MCP

Set up your first MCP server in minutes

MCP Setup Guide

Configure MCP in Claude, Cursor & VS Code

All MCP Tutorials

18+ hands-on guides for developers

MCP FAQ

40+ answers about Model Context Protocol

Vllora Alternatives — Similar Monitoring & Observability Servers

Looking for alternatives to Vllora? Here are other popular monitoring & observability servers you can use with Claude, Cursor, and VS Code.

Netdata

★ 78.9k

Real-time infrastructure monitoring with metrics, logs, alerts, and ML-based anomaly detection.

Kubeshark

★ 11.9k

eBPF-powered network observability for Kubernetes. Indexes L4/L7 traffic with full K8s context, decrypts TLS without keys. Queryable by AI agents via MCP and humans via dashboard.

Mission Control

★ 4.9k

Self-hosted AI agent orchestration platform: dispatch tasks, run multi-agent workflows, monitor spend, and govern operations from one mission control dashboard.

Grafana

★ 3.0k

This MCP server enables natural-language querying of Grafana logs by automatically detecting log sources and service labels. It provides read-only access to log data with intelligent caching for efficient repeat queries.

Sentrux

★ 2.4k

Real-time architectural sensor that helps AI agents close the feedback loop, enabling recursive self-improvement of code quality. Pure Rust.

OpenInference

★ 986

OpenTelemetry Instrumentation for AI Observability

Browse More Monitoring & Observability MCP Servers

Explore all monitoring & observability servers available in the MCPgee directory. Each server includes setup guides for Claude, Cursor, and VS Code.

Monitoring & Observability Browse All Servers

Set Up Vllora in Your Editor

Choose your AI client for step-by-step setup instructions.

🖥️

Claude Desktop

macOS & Windows app

⌨️

Claude Code

CLI & terminal

📝

Cursor

AI-first code editor

💻

VS Code

GitHub Copilot MCP

🏄

Windsurf

Codeium AI editor

🔌

Cline

VS Code extension

Quick Config Preview

{
  "mcpServers": {
    "vllora": {
      "command": "npx",
      "args": ["-y", "vllora"]
    }
  }
}

Add this to your claude_desktop_config.json or .cursor/mcp.json

Read the full setup guide →

Ready to use Vllora?

Browse our complete directory of 33,000+ MCP servers, read setup guides for your editor, and start building with the Model Context Protocol.

33,000+ ServersFree & Open SourceStep-by-Step Guides

Explore All Servers Read Our Guides