PaperBanana

v0.1.1Data Science & MLstable

Open source implementation and extension of Google Research’s PaperBanana for automated academic figures, diagrams, and research visuals, expanded to new domains like slide generation.

academic-diagramsacademic-researchagentic-aiarxivdiagram-generation
Share:
1,700
Stars
0
Downloads
0
Weekly
0/5

What is PaperBanana?

PaperBanana is a Model Context Protocol (MCP) server that allows AI assistants like Claude, Cursor, and VS Code to open source implementation and extension of google research’s paperbanana for automated academic figures, diagrams, and research visuals, expanded to new domains like slide generation.

Open source implementation and extension of Google Research’s PaperBanana for automated academic figures, diagrams, and research visuals, expanded to new domains like slide generation.

This server falls under the Data Science & ML category on MCPgee, the world's largest MCP server directory with 33,000+ servers.

Features

  • Open source implementation and extension of Google Research’

Use Cases

Automated academic figures and diagrams
Research visualization and slide generation
llmsresearch

Maintainer

LicenseMIT
Languagepython
Versionv0.1.1
UpdatedMay 22, 2026
Statushealthy
Maintenanceactive

Works with

ClaudeOpenAIwindowsmacoslinux

Installation

PIP

pip install paperbanana

Manual Installation

pip install paperbanana

Configuration

Configuration Details

Config File

claude_desktop_config.json

Performance

Response Metrics

Response Time< 200ms
ThroughputMedium

Resource Usage

Memory UsageLow
CPU UsageLow

How to Set Up and Use PaperBanana

PaperBanana is an open-source multi-agent system for generating publication-quality academic figures, methodology diagrams, and research visualizations from text descriptions using large language and vision models. It extends Google Research's original PaperBanana concept with iterative VLM critic feedback loops, batch processing, venue-specific style packs (NeurIPS, ICML, ACL, IEEE), and an MCP server interface that lets AI assistants in Claude Code or Cursor generate and refine figures directly in chat. Researchers use it to automate the figure creation pipeline — from raw method descriptions to polished, ready-to-submit diagrams — without manually operating design tools.

Prerequisites

  • Python 3.10+ with pip installed
  • An OpenAI API key (OPENAI_API_KEY) or a Google AI Studio key (GOOGLE_API_KEY) for Gemini — at least one is required
  • uvx (from the uv package manager) for the recommended MCP installation, or pip for CLI use
  • An MCP-compatible client such as Claude Desktop or Cursor
1

Install PaperBanana via pip

Install the base package. For MCP usage, install with the mcp extra. For full provider support including Google Gemini, add the google extra.

pip install "paperbanana[mcp,google]"
2

Set your LLM provider API key

Export the API key for your chosen provider. OpenAI is the default; Google Gemini (gemini-2.5-flash) is a free alternative.

# OpenAI (default)
export OPENAI_API_KEY="sk-..."

# OR Google Gemini (free tier available)
export GOOGLE_API_KEY="your-google-ai-studio-key"
3

Add the MCP server to your client configuration

Configure your MCP client to launch the paperbanana-mcp server using uvx. This approach avoids manual virtual environment management.

{
  "mcpServers": {
    "paperbanana": {
      "command": "uvx",
      "args": ["--from", "paperbanana[mcp]", "paperbanana-mcp"],
      "env": {
        "OPENAI_API_KEY": "sk-..."
      }
    }
  }
}
4

Restart your MCP client

Restart Claude Desktop or reload the editor extension so it discovers the new paperbanana server. The MCP server exposes tools like generate_diagram, generate_plot, evaluate_diagram, batch_diagrams, and orchestrate_figures.

5

Test with a diagram generation request

Ask your AI assistant to generate a methodology diagram from a text description. The multi-agent pipeline (Retriever, Planner, Stylist, Visualizer, Critic) will iterate until the output meets quality criteria.

PaperBanana Examples

Client configuration

MCP config block for Claude Desktop using uvx to run the PaperBanana MCP server with an OpenAI key.

{
  "mcpServers": {
    "paperbanana": {
      "command": "uvx",
      "args": ["--from", "paperbanana[mcp]", "paperbanana-mcp"],
      "env": {
        "OPENAI_API_KEY": "sk-your-openai-key-here"
      }
    }
  }
}

Prompts to try

These prompts show how to generate methodology diagrams, plots, and evaluate figures using the MCP tools.

- "Generate a methodology diagram for a Retrieval-Augmented Generation pipeline with the caption 'Our RAG Framework Overview' in NeurIPS style"
- "Create a bar chart comparing model accuracy from my results.csv file with intent: accuracy comparison across baselines"
- "Polish this existing diagram figure.png for submission to ICML — make the arrows thicker and increase font size"
- "Evaluate this generated diagram against the reference human-drawn figure and give me a quality score"
- "Batch generate all figures described in my manifest.yaml file"

Troubleshooting PaperBanana

uvx command not found when launching the MCP server

Install uv first: curl -LsSf https://astral.sh/uv/install.sh | sh. Then verify with uvx --version. Alternatively, install paperbanana with pip and use the command 'paperbanana-mcp' directly.

API key authentication errors from OpenAI or Google

Verify that OPENAI_API_KEY or GOOGLE_API_KEY is set in the env block of your MCP config (not just as a shell export, since MCP clients launch servers in isolated environments). Check that your key has active credits.

Diagram generation times out or returns a blank image

The multi-agent pipeline (Planner + Critic loop) can take 60-120 seconds for complex figures. Increase your MCP client's tool timeout if possible. For faster results, use --auto flag equivalents which skip manual critique cycles, or switch to the Google Gemini provider which has lower latency on flash models.

Frequently Asked Questions about PaperBanana

What is PaperBanana?

PaperBanana is a Model Context Protocol (MCP) server that open source implementation and extension of google research’s paperbanana for automated academic figures, diagrams, and research visuals, expanded to new domains like slide generation. It connects AI assistants to external tools and data sources through a standardized interface.

How do I install PaperBanana?

Install via pip with: pip install paperbanana. Then configure your AI client to connect to this MCP server.

Which AI clients work with PaperBanana?

PaperBanana works with all major MCP-compatible AI clients including Claude Desktop, Claude Code, Cursor, VS Code (GitHub Copilot), Windsurf, and Cline.

Is PaperBanana free to use?

Yes, PaperBanana is open source and available under the MIT license. You can use it freely in both personal and commercial projects.

Browse More Data Science & ML MCP Servers

Explore all data science & ml servers available in the MCPgee directory. Each server includes setup guides for Claude, Cursor, and VS Code.

Quick Config Preview

{ "mcpServers": { "paperbanana": { "command": "pip", "args": ["install", "paperbanana"] } } }

Add this to your claude_desktop_config.json or .cursor/mcp.json

Read the full setup guide →

Ready to use PaperBanana?

Browse our complete directory of 33,000+ MCP servers, read setup guides for your editor, and start building with the Model Context Protocol.

33,000+ ServersFree & Open SourceStep-by-Step Guides