Groq

CommunityPopular

groq

Official Groq MCP server for ultra-fast LLM inference. Connect Claude or Cursor to Groq's LPU-powered API for running Llama, Mixtral, and other open-source models at extreme speed.

Publisher	groq
Repository	`groq-mcp`
Language	TypeScript
Forks	0
Stars	6.5K
Available tools	15
Transport type	stdio
Categories	AI Ml
License	MIT
Links	GitHub

Connect tools to AI workflows
Groq exposes MCP capabilities that can be used by compatible AI clients and agents.
15 available tools
Browse the callable actions below, including names and descriptions when provided by the server.
Ready-to-copy setup
Use the installation snippets to configure this server in your preferred MCP client.
Open source signals
6.5K stars and 0 forks from the linked repository.

Installation

TypingMind

Prerequisites:

Node.js 18+

{
  "mcpServers": {
    "groq": {
      "command": "npx",
      "args": [
        "-y",
        "groq-mcp"
      ],
      "env": {
        "GROQ_API_KEY": "<YOUR_API_KEY>"
      }
    }
  }
}

Available Tools

list_models
Retrieve available LLM models from Groq's API including Llama, Mixtral, and other supported open-source models
create_completion
Generate text completion using specified Groq model with ultra-fast LPU inference
create_chat_completion
Generate conversational responses using chat-formatted input with Groq's high-speed inference
get_model_details
Fetch detailed information about a specific model including parameters, context length, and capabilities
create_streaming_completion
Generate streaming text completion for real-time response delivery
create_streaming_chat
Generate streaming chat responses with token-by-token delivery
get_usage_stats
Retrieve API usage statistics including tokens consumed, requests made, and rate limit status
validate_api_key
Verify Groq API key validity and check authentication status
estimate_tokens
Calculate estimated token count for input text before making API calls
get_rate_limits
Fetch current rate limit information and remaining quota for the API key
list_supported_languages
Get list of natural languages supported by available models
get_model_performance
Retrieve performance metrics including inference speed and throughput for specific models
create_batch_completion
Process multiple completion requests in a single batch operation
get_system_status
Check Groq service status and LPU infrastructure availability
configure_inference_params
Set inference parameters like temperature, max tokens, and top-p for model requests

Use Groq MCP with multiple AI models

TypingMind connects MCP tools at the workspace level, so once Groq is connected, you can use it with different AI models in TypingMind instead of setting it up separately for each model. This MCP runs locally through the TypingMind MCP connector on your device.

Setup guide to use the local connector

Use this when the MCP server needs access to local files, apps, or private resources on your computer.

Open the MCP settings

In TypingMind, go to Settings, Advanced Settings, then Model Context Protocol and choose Setup Connector.

Open TypingMind in your browser.
Click the Settings icon.
Go to Advanced Settings.
Open the Model Context Protocol section.
Click Setup Connector and choose This Device.

TypingMind MCP connector setup screen with This Device selected

Run the connector command

Choose This Device, copy the command from TypingMind, and run it in Terminal. Keep the process running while you use MCP.

Copy the setup command shown by TypingMind.
Open Terminal on macOS or Windows Terminal on Windows.
Paste and run the command.
Approve the package install if Terminal asks you to proceed.
Keep the Terminal window running while using MCP tools.

Add Groq as a server

When the connector status is Ready, click Edit Servers and paste the MCP server configuration.

Wait until the connector status shows Ready.
Click Edit Servers.
Paste the Groq MCP server configuration.
Save the server list.
Refresh if you want to confirm the connector is still ready.

TypingMind MCP settings showing active server and Edit Servers button

Use it across models

Save the server list, open Plugins, enable the Groq MCP tools, then select any supported AI model in TypingMind and use the tools in chat or assign them to an AI agent.

Open the Plugins page in TypingMind.
Enable the Groq MCP tools.
Start a chat and choose the AI model you want to use.
Use the MCP tools in chat or assign them to an AI agent.
Switch to another AI model whenever needed without reconnecting MCP.

TypingMind chat using enabled MCP tools with a selected AI model

Frequently asked questions

What is the Groq MCP server used for?

Groq is an MCP server that lets compatible AI clients connect to external tools and context. In TypingMind, you can add this MCP server once and make its tools available in your AI workspace.

Can I use Groq MCP with multiple AI models in TypingMind?

Yes. TypingMind connects MCP tools at the workspace level, so you can use Groq with different AI models such as Claude, ChatGPT, Gemini, or other models you have configured in TypingMind without setting up the MCP server separately for each model.

Why use Groq MCP with TypingMind?

TypingMind is one of the best frontends for LLM chat because it brings multiple AI models, prompts, plugins, AI agents, API keys, and MCP tools into one workspace. With Groq connected, you can use its MCP tools across your preferred models while keeping your chat workflow organized in TypingMind.

How do I connect Groq MCP to TypingMind?

Groq runs through the TypingMind local MCP connector. This is best when the MCP server needs access to local files, desktop apps, command-line tools, or private resources on your computer.

What tools does Groq MCP provide in TypingMind?

Groq exposes 15 MCP tools that can be enabled from the TypingMind Plugins page and used in chat or assigned to AI agents.

Do I need to share my API keys with TypingMind to use Groq MCP?

No. TypingMind is local-first and lets you keep your model providers, API keys, prompts, and MCP configuration under your control. If Groq requires authentication, add the required headers, OAuth settings, or local configuration for that MCP server when you create the connection.

Related MCP Servers

View all

Knowledge Graph Memory

modelcontextprotocol

OrganizationPopular

Model Context Protocol Servers

Context7

Context7 Platform -- Up-to-date code documentation for LLMs and AI code editors

Blender

Experience seamless AI-powered 3D modeling by connecting Blender with Claude AI via the Model Context Protocol. BlenderMCP enables two-way communication, allowing you to create, modify, and inspect 3D scenes directly through AI prompts. Control objects, materials, lighting, and execute Python code in Blender effortlessly. Access assets from Poly Haven and generate AI-driven models using Hyper3D Rodin. This integration enhances creative workflows by combining Blender’s robust tools with Claude’s intelligent guidance, making 3D content creation faster, interactive, and more intuitive. Perfect for artists and developers seeking AI-assisted 3D design within Blender’s environment.

Google GenAI Toolbox

MCP Toolbox for Databases is an open source MCP server for databases.

Installation

Available Tools

Use Groq MCP with multiple AI models

Setup guide to use the local connector

Open the MCP settings

Run the connector command

Add Groq as a server

Use it across models

Frequently asked questions

Related MCP Servers

Set up your own AI workspace now