arXiv MCP Server logo

arXiv MCP Server

CommunityPopular
blazickjp

A Model Context Protocol server for searching and analyzing arXiv papers

Publisherblazickjp
Repositoryarxiv-mcp-server
LanguagePython
Forks
216
Stars
2.7K
Available tools
0
Transport typestdio
Categories
LicenseApache-2.0
Links
  • Connect tools to AI workflows

    arXiv MCP Server exposes MCP capabilities that can be used by compatible AI clients and agents.

  • 0 available tools

    Browse the callable actions below, including names and descriptions when provided by the server.

  • Ready-to-copy setup

    Use the installation snippets to configure this server in your preferred MCP client.

  • Open source signals

    2.7K stars and 216 forks from the linked repository.

PyPI Version PyPI Downloads GitHub Stars GitHub Forks Tests Python Version License smithery badge Install in VS Code Install in VS Code Insiders Add to Kiro Codex Plugin

ArXiv MCP Server

πŸ” Enable AI assistants to search and access arXiv papers through a simple MCP interface.

The ArXiv MCP Server provides a bridge between AI assistants and arXiv's research repository through the Model Context Protocol (MCP). It allows AI models to search for papers and access their content in a programmatic way.

🀝 Contribute β€’ πŸ“ Report Bug

✨ Core Features

  • πŸ”Ž Paper Search: Query arXiv papers with filters for date ranges and categories
  • πŸ“„ Paper Access: Download and read paper content
  • πŸ“‹ Paper Listing: View all downloaded papers
  • πŸ—ƒοΈ Local Storage: Papers are saved locally for faster access
  • πŸ“ Prompts: A set of research prompts for paper analysis

πŸ”’ Security

Prompt Injection Risk

Paper content retrieved from arXiv is untrusted external input.

When an AI assistant downloads or reads a paper through this server, the paper's text is passed directly into the model's context. A maliciously crafted paper could embed adversarial instructions designed to hijack the AI's behavior β€” for example, instructing it to exfiltrate data, invoke other tools with unintended arguments, or override system-level instructions. This is a known class of attack described by OWASP as LLM01: Prompt Injection and by the OWASP Agentic AI framework as AG01: Prompt Injection in LLM-Integrated Systems.

Recommended Mitigations

  1. Use read-only MCP configurations β€” where possible, configure the MCP client so that the arxiv-mcp-server cannot trigger write operations or invoke other tools on your behalf.
  2. Review paper content before acting on AI summaries β€” if an AI summary asks you to run commands or visit external URLs that were not part of your original request, treat that as a red flag.
  3. Be cautious in multi-tool setups β€” agentic pipelines that combine this server with filesystem, shell, or browser tools are higher risk; a prompt injection in a paper could chain tool calls unexpectedly.
  4. Treat AI-generated summaries as data, not instructions β€” always apply human judgment before executing any action the AI recommends after reading a paper.

References


πŸš€ Quick Start

Installing via Smithery

To install ArXiv Server for Claude Desktop automatically via Smithery:

bash
npx -y @smithery/cli install arxiv-mcp-server --client claude

Installing via Claude Desktop (.mcpb)

The .mcpb bundle is the one-click install path for Claude Desktop on macOS. It bundles the server code and Python package dependencies, so users do not need uv, pip, or manual MCP JSON configuration. Python 3.11+ must still be available on the user's machine.

  1. Download the artifact matching your Mac from the latest release:
    • Apple Silicon: arxiv-mcp-server-darwin-arm64-<version>.mcpb
    • Intel: arxiv-mcp-server-darwin-x86_64-<version>.mcpb
  2. In Claude Desktop open Settings β†’ Extensions (or drag-and-drop the file onto the Claude Desktop window).
  3. Click Install and, when prompted, set your preferred paper storage directory (defaults to ~/.arxiv-mcp-server/papers).

Claude Desktop launches the bundled server over stdio β€” no configuration file edits needed.

Installing Manually

Important β€” use uv tool install, not uv pip install

Running uv pip install arxiv-mcp-server installs the package into the current virtual environment but does not place the arxiv-mcp-server executable on your PATH. You must use uv tool install so that uv creates an isolated environment and exposes the executable globally:

bash
uv tool install arxiv-mcp-server

After this, the arxiv-mcp-server command will be available on your PATH.

PDF fallback (older papers): Most arXiv papers have an HTML version which the base install handles automatically. For older papers that only have a PDF, the server needs the [pdf] extra (pymupdf4llm). Install it with:

bash
uv tool install 'arxiv-mcp-server[pdf]'

You can verify it with:

bash
arxiv-mcp-server --help

If you previously ran uv pip install arxiv-mcp-server and the command is missing, uninstall it and re-install with uv tool install as shown above.

For development:

bash
# Clone and set up development environment
git clone https://github.com/blazickjp/arxiv-mcp-server.git
cd arxiv-mcp-server

# Create and activate virtual environment
uv venv
source .venv/bin/activate

# Install with test dependencies (development only β€” no global executable)
uv pip install -e ".[test]"

πŸ€– Codex Plugin Integration

This repository now includes a Codex plugin manifest at .codex-plugin/plugin.json and a portable MCP config at .mcp.json so Codex-oriented tooling can discover the server without inventing its own install recipe.

The Codex integration uses the same stdio launch path documented elsewhere in this README:

json
{
  "mcpServers": {
    "arxiv": {
      "command": "uvx",
      "args": ["arxiv-mcp-server"]
    }
  }
}

If your Codex client supports plugin manifests, point it at ./.codex-plugin/plugin.json. If it only supports raw MCP configuration, use ./.mcp.json directly.

πŸ”Œ MCP Integration

Add this configuration to your MCP client config file:

json
{
    "mcpServers": {
        "arxiv-mcp-server": {
            "command": "uv",
            "args": [
                "tool",
                "run",
                "arxiv-mcp-server",
                "--storage-path", "/path/to/paper/storage"
            ]
        }
    }
}

For Development:

json
{
    "mcpServers": {
        "arxiv-mcp-server": {
            "command": "uv",
            "args": [
                "--directory",
                "path/to/cloned/arxiv-mcp-server",
                "run",
                "arxiv-mcp-server",
                "--storage-path", "/path/to/paper/storage"
            ]
        }
    }
}

HTTP Transport

For server deployments where stdio is not practical, run the server with Streamable HTTP:

bash
TRANSPORT=http HOST=127.0.0.1 PORT=8080 arxiv-mcp-server --storage-path /path/to/papers

Then configure an MCP client that supports Streamable HTTP:

json
{
    "mcpServers": {
        "arxiv-mcp-server": {
            "type": "http",
            "url": "http://127.0.0.1:8080/mcp"
        }
    }
}

The default HTTP bind host is 127.0.0.1. Streamable HTTP enables MCP DNS rebinding protection by default and allows loopback hosts for the configured port. If exposing the server through a reverse proxy, keep it bound to localhost unless you have added authentication and network controls upstream; set ALLOWED_HOSTS and ALLOWED_ORIGINS to the external host/origin values your proxy forwards.

πŸ”’ Security Note

arXiv papers are user-generated, untrusted content. Paper text returned by this server may contain prompt injection attempts β€” crafted text designed to manipulate an AI assistant's behavior. Treat all paper content as untrusted input.

In production environments, apply appropriate sandboxing and avoid feeding raw paper content into agentic pipelines that have access to sensitive tools or data without review. See SECURITY.md for the full security policy.

πŸ’‘ Available Tools

Core Workflow

The typical workflow for deep paper research is:

search_papers β†’ download_paper β†’ read_paper

list_papers shows what you have locally. semantic_search searches across your local collection.


1. Paper Search

Search arXiv with optional category, date, and boolean filters. Enforces arXiv's 3-second rate limit automatically. If rate limited, wait 60 seconds before retrying.

python
result = await call_tool("search_papers", {
    "query": "\"KAN\" OR \"Kolmogorov-Arnold Networks\"",
    "max_results": 10,
    "date_from": "2024-01-01",
    "categories": ["cs.LG", "cs.AI"],
    "sort_by": "date"   # or "relevance" (default)
})

Supported categories include cs.AI, cs.LG, cs.CL, cs.CV, cs.NE, stat.ML, math.OC, quant-ph, eess.SP, and more. See tool description for the full list.

2. Paper Download

Download a paper by its arXiv ID. Tries HTML first, falls back to PDF. Stores the paper locally for read_paper and semantic_search.

python
result = await call_tool("download_paper", {
    "paper_id": "2401.12345"
})

For older papers that only have a PDF, install the [pdf] extra: uv tool install 'arxiv-mcp-server[pdf]'

3. List Papers

List all papers downloaded locally. Returns arXiv IDs only β€” use read_paper to access content.

python
result = await call_tool("list_papers", {})

4. Read Paper

Read the full text of a locally downloaded paper in markdown. Requires download_paper to be called first.

python
result = await call_tool("read_paper", {
    "paper_id": "2401.12345"
})

πŸ“ Research Prompts

The server offers specialized prompts to help analyze academic papers:

Paper Analysis Prompt

A comprehensive workflow for analyzing academic papers that only requires a paper ID:

python
result = await call_prompt("deep-paper-analysis", {
    "paper_id": "2401.12345"
})

This prompt includes:

  • Detailed instructions for using available tools (list_papers, download_paper, read_paper, search_papers)
  • A systematic workflow for paper analysis
  • Comprehensive analysis structure covering:
    • Executive summary
    • Research context
    • Methodology analysis
    • Results evaluation
    • Practical and theoretical implications
  • Future research directions
  • Broader impacts

Pro Prompt Pack

  • summarize_paper: concise structured summary for one paper.
  • compare_papers: side-by-side technical comparison across paper IDs.
  • literature_review: thematic synthesis across a topic and optional paper set.

βš™οΈ Configuration

Configure through command-line options and environment variables:

SettingPurposeDefault
--storage-pathPaper storage location~/.arxiv-mcp-server/papers
MAX_RESULTSMaximum search results50
REQUEST_TIMEOUTAPI timeout in seconds60
TRANSPORTTransport type: stdio, http, or streamable-httpstdio
HOSTHost to bind to in HTTP mode127.0.0.1
PORTPort to listen on in HTTP mode8000
ALLOWED_HOSTSComma-separated extra allowed Host header values for Streamable HTTP DNS rebinding protectionempty
ALLOWED_ORIGINSComma-separated extra allowed Origin header values for Streamable HTTP DNS rebinding protectionempty

πŸ§ͺ Testing

Run the test suite:

bash
python -m pytest

πŸ§ͺ Experimental Features

These features are not yet fully tested and may behave unexpectedly. Use with caution.

The following tools require additional dependencies and are under active development:

bash
uv pip install -e ".[pro]"

Semantic Search

Semantic similarity search over your locally downloaded papers only. Returns empty results if no papers have been downloaded yet. Requires [pro] dependencies.

python
result = await call_tool("semantic_search", {
    "query": "test-time adaptation in multimodal transformers",
    "max_results": 5
})
# or find papers similar to a known paper:
result = await call_tool("semantic_search", {
    "paper_id": "2404.19756",
    "max_results": 5
})

Citation Graph

Fetch references and citing papers via Semantic Scholar. Works on any arXiv ID β€” no local download required.

python
result = await call_tool("citation_graph", {
    "paper_id": "2401.12345"
})

Research Alerts

Save topic watches and poll for newly published papers since the last check. Uses the same query syntax as search_papers.

python
# Register a watch (idempotent β€” calling again updates the existing watch)
await call_tool("watch_topic", {
    "topic": "\"multi-agent reinforcement learning\"",
    "categories": ["cs.AI", "cs.LG"],
    "max_results": 10
})

# Check all watches β€” returns only papers published since last check
result = await call_tool("check_alerts", {})

# Check a single watch
result = await call_tool("check_alerts", {"topic": "\"multi-agent reinforcement learning\""})

Advanced Prompts

summarize_paper, compare_papers, and literature_review for deeper research workflows. Requires [pro] dependencies.


πŸ“„ License

Released under the Apache License 2.0. See the LICENSE file for details.


Made with ❀️ by the Pearl Labs Team

Installation

TypingMind
Prerequisites:

Node.js 18+

{
  "mcpServers": {
    "arxiv": {
      "command": "npx",
      "args": [
        "-y",
        "arxiv-mcp-server@latest"
      ]
    }
  }
}

Use arXiv MCP Server MCP with multiple AI models

TypingMind connects MCP tools at the workspace level, so once arXiv MCP Server is connected, you can use it with different AI models in TypingMind instead of setting it up separately for each model. This MCP runs locally through the TypingMind MCP connector on your device.

Setup guide to use the local connector

Use this when the MCP server needs access to local files, apps, or private resources on your computer.

1

Open the MCP settings

In TypingMind, go to Settings, Advanced Settings, then Model Context Protocol and choose Setup Connector.

  1. Open TypingMind in your browser.
  2. Click the Settings icon.
  3. Go to Advanced Settings.
  4. Open the Model Context Protocol section.
  5. Click Setup Connector and choose This Device.
TypingMind MCP connector setup screen with This Device selected
2

Run the connector command

Choose This Device, copy the command from TypingMind, and run it in Terminal. Keep the process running while you use MCP.

  1. Copy the setup command shown by TypingMind.
  2. Open Terminal on macOS or Windows Terminal on Windows.
  3. Paste and run the command.
  4. Approve the package install if Terminal asks you to proceed.
  5. Keep the Terminal window running while using MCP tools.
3

Add arXiv MCP Server as a server

When the connector status is Ready, click Edit Servers and paste the MCP server configuration.

  1. Wait until the connector status shows Ready.
  2. Click Edit Servers.
  3. Paste the arXiv MCP Server MCP server configuration.
  4. Save the server list.
  5. Refresh if you want to confirm the connector is still ready.
TypingMind MCP settings showing active server and Edit Servers button
{
  "mcpServers": {
    "arxiv-mcp-server": {
      "command": "npx",
      "args": [
        "-y",
        "null"
      ]
    }
  }
}
4

Use it across models

Save the server list, open Plugins, enable the arXiv MCP Server MCP tools, then select any supported AI model in TypingMind and use the tools in chat or assign them to an AI agent.

  1. Open the Plugins page in TypingMind.
  2. Enable the arXiv MCP Server MCP tools.
  3. Start a chat and choose the AI model you want to use.
  4. Use the MCP tools in chat or assign them to an AI agent.
  5. Switch to another AI model whenever needed without reconnecting MCP.
TypingMind chat using enabled MCP tools with a selected AI model
Can you use arXiv MCP Server to help me with this task?
arXiv MCP Server
Sure. I read it.
Here is what I found using arXiv MCP Server.

Frequently asked questions

What is the arXiv MCP Server MCP server used for?

arXiv MCP Server is an MCP server that lets compatible AI clients connect to external tools and context. In TypingMind, you can add this MCP server once and make its tools available in your AI workspace.

Can I use arXiv MCP Server MCP with multiple AI models in TypingMind?

Yes. TypingMind connects MCP tools at the workspace level, so you can use arXiv MCP Server with different AI models such as Claude, ChatGPT, Gemini, or other models you have configured in TypingMind without setting up the MCP server separately for each model.

Why use arXiv MCP Server MCP with TypingMind?

TypingMind is one of the best frontends for LLM chat because it brings multiple AI models, prompts, plugins, AI agents, API keys, and MCP tools into one workspace. With arXiv MCP Server connected, you can use its MCP tools across your preferred models while keeping your chat workflow organized in TypingMind.

How do I connect arXiv MCP Server MCP to TypingMind?

arXiv MCP Server runs through the TypingMind local MCP connector. This is best when the MCP server needs access to local files, desktop apps, command-line tools, or private resources on your computer.

What tools does arXiv MCP Server MCP provide in TypingMind?

arXiv MCP Server exposes MCP capabilities that can be enabled from the TypingMind Plugins page and used in chat or assigned to AI agents.

Do I need to share my API keys with TypingMind to use arXiv MCP Server MCP?

No. TypingMind is local-first and lets you keep your model providers, API keys, prompts, and MCP configuration under your control. If arXiv MCP Server requires authentication, add the required headers, OAuth settings, or local configuration for that MCP server when you create the connection.

Related MCP Servers

View all

Set up your own AI workspace now

Get notified about new features and future giveaways by subscribing to our newsletter πŸ‘‡