Vllora
Debug your AI agents
What is Vllora?
Vllora is a Model Context Protocol (MCP) server that allows AI assistants like Claude, Cursor, and VS Code to debug your ai agents
Debug your AI agents
This server falls under the Monitoring & Observability category on MCPgee, the world's largest MCP server directory with 33,000+ servers.
Features
- Debug your AI agents
Use Cases
Maintainer
Works with
Installation
Manual Installation
npx vlloraConfiguration
Configuration Details
claude_desktop_config.json
Performance
Response Metrics
Resource Usage
How to Set Up and Use Vllora
Vllora is a lightweight real-time debugging and observability tool for AI agents that traces every LLM call, tool interaction, and agent workflow through an OpenAI-compatible API gateway running at localhost. It exposes a web UI on port 9091 and an API on port 9090, automatically collecting traces from frameworks like LangChain, Google ADK, and direct OpenAI SDK calls without requiring code changes. Developers use it to identify latency bottlenecks, inspect tool call sequences, and replay agent interactions during development.
Prerequisites
- macOS with Homebrew installed (for the recommended install method)
- API keys for the LLM providers you want to proxy (e.g. OpenAI, Anthropic)
- An AI agent framework (LangChain, Google ADK, OpenAI SDK, etc.) to trace
- An MCP client if using the MCP server integration feature
Install vllora via Homebrew
Add the vllora tap and install the binary.
brew tap vllora/vllora
brew install vlloraStart the vllora server
Launch the server — it starts the API on port 9090 and the debugging UI on port 9091.
vlloraConfigure your LLM provider credentials
Open the web UI at http://localhost:9091 and add your API keys for each provider (OpenAI, Anthropic, etc.) through the settings interface.
Point your agent at the vllora endpoint
Change your agent's base URL from the provider's endpoint to vllora's local API. All calls will be proxied and traced.
# Example: redirect OpenAI calls through vllora
export OPENAI_BASE_URL=http://localhost:9090/v1
export VLLORA_OPENAI_API_KEY=your_openai_api_keyInspect traces in the UI
Open http://localhost:9091 to see live traces, tool call sequences, latency breakdowns, and full request/response payloads for every agent interaction.
Vllora Examples
Client configuration
Claude Desktop config to run vllora as an MCP server for agent debugging.
{
"mcpServers": {
"vllora": {
"command": "npx",
"args": ["vllora"],
"env": {
"VLLORA_OPENAI_API_KEY": "your_openai_api_key"
}
}
}
}Prompts to try
Example ways to use vllora once connected via MCP.
- "Show me the last 10 traces captured by vllora"
- "Which tool calls had the highest latency in the last session?"
- "Replay the failed agent interaction from trace ID abc123"
- "List all LLM providers currently configured in vllora"Troubleshooting Vllora
Port 9090 or 9091 is already in use
Find and stop the conflicting process with lsof -i :9090, or configure vllora to use different ports via its settings.
Agent requests are not appearing in the vllora UI
Verify that your agent's base URL is actually set to http://localhost:9090/v1 and not the original provider URL. Restart the agent process after changing the environment variable.
brew install vllora fails on Apple Silicon
Make sure Homebrew itself is up to date with brew update && brew upgrade, then retry. If the formula is not yet available for arm64, download the binary directly from the vllora GitHub releases page.
Frequently Asked Questions about Vllora
What is Vllora?
Vllora is a Model Context Protocol (MCP) server that debug your ai agents It connects AI assistants to external tools and data sources through a standardized interface.
How do I install Vllora?
Follow the installation instructions on the Vllora GitHub repository. Clone the repo, install dependencies, and add the server config to your AI client.
Which AI clients work with Vllora?
Vllora works with all major MCP-compatible AI clients including Claude Desktop, Claude Code, Cursor, VS Code (GitHub Copilot), Windsurf, and Cline.
Is Vllora free to use?
Yes, Vllora is open source and available under the NOASSERTION license. You can use it freely in both personal and commercial projects.
Vllora Alternatives — Similar Monitoring & Observability Servers
Looking for alternatives to Vllora? Here are other popular monitoring & observability servers you can use with Claude, Cursor, and VS Code.
Netdata
★ 78.9kReal-time infrastructure monitoring with metrics, logs, alerts, and ML-based anomaly detection.
Kubeshark
★ 11.9keBPF-powered network observability for Kubernetes. Indexes L4/L7 traffic with full K8s context, decrypts TLS without keys. Queryable by AI agents via MCP and humans via dashboard.
Mission Control
★ 4.9kSelf-hosted AI agent orchestration platform: dispatch tasks, run multi-agent workflows, monitor spend, and govern operations from one mission control dashboard.
Grafana
★ 3.0kThis MCP server enables natural-language querying of Grafana logs by automatically detecting log sources and service labels. It provides read-only access to log data with intelligent caching for efficient repeat queries.
Sentrux
★ 2.4kReal-time architectural sensor that helps AI agents close the feedback loop, enabling recursive self-improvement of code quality. Pure Rust.
OpenInference
★ 986OpenTelemetry Instrumentation for AI Observability
Browse More Monitoring & Observability MCP Servers
Explore all monitoring & observability servers available in the MCPgee directory. Each server includes setup guides for Claude, Cursor, and VS Code.
Set Up Vllora in Your Editor
Choose your AI client for step-by-step setup instructions.
Quick Config Preview
Add this to your claude_desktop_config.json or .cursor/mcp.json
Ready to use Vllora?
Browse our complete directory of 33,000+ MCP servers, read setup guides for your editor, and start building with the Model Context Protocol.