How to use Perplexity API Key for AI chat

PerplexityAbout Perplexity
Perplexity AI is an AI-powered search engine that combines large language models with real-time web search to deliver accurate, cited answers with up-to-date information. Key features include Pro Search (access to GPT-5, Claude 4, Gemini 2.5 Pro, Grok 4, and proprietary Sonar models), Deep Research (autonomous multi-step research synthesizing hundreds of sources into comprehensive reports with PDF export), Comet Browser (free AI-powered Chromium browser with conversational web control and sidecar assistant for summaries and automation), and API access for enterprise integration.
Pro subscribers enjoy unlimited Deep Research queries, advanced model selection, domain-specific searches, file uploads, and image generation/editing tools, while free users get limited access to core features with real-time citations. Access at perplexity.ai with API documentation available for Pro users.
Step by step guide to use Perplexity API Key to chat with AI
1. Get Your Perplexity API Key
First, you'll need to obtain an API key from Perplexity. This key allows you to access their AI models directly and pay only for what you use.
- Visit Perplexity's API console
- Sign up or log in to your account
- Navigate to the API keys section
- Generate a new API key (copy it immediately as some providers only show it once)
- Save your API key in a secure password manager or encrypted note
2. Connect Your Perplexity API Key on TypingMind
Once you have your Perplexity API key, connecting it to TypingMind to chat with AI is straightforward:
- Open TypingMind in your browser
- Click the "Settings" icon (gear symbol)
- Navigate to "Models" section
- Click "Add Custom Model"
- Fill in the model information:Name:
sonar-reasoning via Perplexity(or your preferred name)Endpoint:https://api.perplexity.ai/chat/completionsModel ID:sonar-reasoningfor example (check Perplexity model list)Context Length: Enter the model's context window (e.g., 32000 for sonar-reasoning)
sonar-reasoninghttps://api.perplexity.ai/chat/completionssonar-reasoning via Perplexityhttps://www.typingmind.com/model-logo.webp32000 - Add custom headers by clicking "Add Custom Headers" in the Advanced Settings section:Authorization:
Bearer <PERPLEXITY_API_KEY>:X-Title:typingmind.comHTTP-Referer:https://www.typingmind.com - Enable "Support Plugins (via OpenAI Functions)" if the model supports the "functions" or "tool_calls" parameter, or enable "Support OpenAI Vision" if the model supports vision.
- Click "Test" to verify the configuration
- If you see "Nice, the endpoint is working!", click "Add Model"
3. Start Chatting with Perplexity models
Now you can start chatting with Perplexity models through TypingMind:
- Select your preferred Perplexity model from the model dropdown menu
- Start typing your message in the chat input
- Enjoy faster responses and better features than the official interface
- Switch between different AI models as needed

sonar-reasoning
- Use specific, detailed prompts for better responses (How to use Prompt Library)
- Create AI agents with custom instructions for repeated tasks (How to create AI Agents)
- Use plugins to extend Perplexity capabilities (How to use plugins)
- Upload documents and images directly to chat for AI analysis and discussion (Chat with documents)
4. Monitor Your AI Usage and Costs
One of the biggest advantages of using API keys with TypingMind is cost transparency and control. Unlike fixed subscriptions, you pay only for what you actually use. Visit https://www.perplexity.ai/account/api/billing to monitor your Perplexity API usage and set spending limits.
| Feature | Perplexity Subscription Plans | Using Perplexity API Keys |
|---|---|---|
| Cost Structure | ❌ Fixed monthly fee Pay even if you don't use it Pro Plan:$20/month or $200/year Enterprise Pro:$40/month per user | ✅ Pay only for actual usage $0 when you don't use it |
| Usage Limits | ❌ Hard daily/hourly caps You have to wait for the next period to use it again | ✅ Unlimited usage No limits. Only limited by your budget |
| Model Access | ❌ Platform decides available models Old models get discontinued | ✅ Access to all API models Including older & specialized versions |
- Use less expensive models for simple tasks
- Keep prompts concise but specific to reduce token usage
- Use TypingMind's prompt caching to reduce repeat costs (How to enable prompt caching)
- Using RAG (retrieval-augmented generation) for large documents to reduce repeat costs (How to use RAG)










