Ann NgAnn Ng
5 min read

How to use Qwen 3.5 35B A3B from DeepInfra with API Key on TypingMind

Learn how to access and use Qwen 3.5 35B A3B with your DeepInfra API key through TypingMind. Get started with this powerful AI model in minutes.

DeepInfra is a cloud inference platform providing fast and cost-effective access to a wide range of open-source AI models including Llama, Mistral, DeepSeek, and more through an OpenAI-compatible API.

Key features include serverless inference with pay-per-token pricing, support for text generation, embeddings, and image models, low-latency responses powered by optimized GPU infrastructure, and easy integration with any OpenAI-compatible client or SDK.

Official Documentation: https://deepinfra.com/models

Qwen 3.5 35B A3B Overview

Model NameQwen 3.5 35B A3B
ProviderDeepInfra
Model IDQwen/Qwen3.5-35B-A3B
Release DateFeb 1, 2026
Last UpdatedApr 20, 2026
Knowledge Cutoff2025-01
Context Window262,144 tokens
Max Output81,920 tokens
Pricing /1M tokens$0.2 input
$0.95 output
Input Modalitiestext, image, video
Output Modalitiestext
Capabilities
File UploadReasoningTool CallingTemperature ControlOpen Weights

Complete Setup Guide

1

Get Your DeepInfra API Key

First, you'll need to obtain an API key from DeepInfra. This key allows you to access their AI models directly and pay only for what you use.

  1. Visit DeepInfra's API console
  2. Sign up or log in to your account
  3. Navigate to the API keys section
  4. Generate a new API key (copy it immediately as some providers only show it once)
  5. Save your API key in a secure password manager or encrypted note

⚠️ Important: Keep your API key secure and never share it publicly. Store it safely as you'll need it to connect with TypingMind.

2

Configure TypingMind with DeepInfra API Key

  1. Open TypingMind in your browser
  2. Click the "Settings" icon (gear symbol)
  3. Navigate to "Models" section
  4. Click "Add Custom Model"
  5. Fill in the model information:
    Name: Qwen/Qwen3.5-35B-A3B via DeepInfra (or your preferred name)
    Endpoint: https://api.deepinfra.com/v1/chat/completions
    Model ID: Qwen/Qwen3.5-35B-A3B
    Context Length: Enter the model's context window (e.g., 262144 for Qwen/Qwen3.5-35B-A3B)
    DeepInfra Endpoint URL input fieldQwen/Qwen3.5-35B-A3Bhttps://api.deepinfra.com/v1/chat/completionsQwen/Qwen3.5-35B-A3B via DeepInfrahttps://www.typingmind.com/model-logo.webp262144
  6. Add custom headers by clicking "Add Custom Headers" in the Advanced Settings section:
    Authorization: Bearer <DEEPINFRA_API_KEY>:
    X-Title: typingmind.com
    HTTP-Referer: https://www.typingmind.com
  7. Enable "Support Plugins (via OpenAI Functions)" if the model supports the "functions" or "tool_calls" parameter, or enable "Support OpenAI Vision" if the model supports vision.
  8. Click "Test" to verify the configuration
  9. If you see "Nice, the endpoint is working!", click "Add Model"
3

Start chatting with Qwen 3.5 35B A3B

Now you can start chatting with Qwen 3.5 35B A3B through TypingMind:

  • Select Qwen 3.5 35B A3B from the model dropdown menu
  • Start typing your message in the chat input
  • Enjoy faster responses and better features than the official interface
  • Switch between different AI models as needed
The best frontend AI chat for Qwen/Qwen3.5-35B-A3B via OpenRouter API KeyThe best frontend AI chat for Qwen/Qwen3.5-35B-A3B via OpenRouter API KeyQwen/Qwen3.5-35B-A3BThe best frontend AI chat for Qwen/Qwen3.5-35B-A3B via OpenRouter API Key

💡 Pro tips for better results:

Frequently Asked Questions

Do I need a subscription to use Qwen 3.5 35B A3B?

No! With DeepInfra API, you pay only for what you use with no monthly subscription. Add credits to your DeepInfra account and pay as you go. TypingMind is also a one-time purchase, not a subscription.

How much will it cost to use Qwen 3.5 35B A3B?

Qwen 3.5 35B A3B costs $0.2/1M input tokens and $0.95/1M output tokens. A typical conversation might cost $0.01-0.10 depending on length.

Can I use other models besides Qwen 3.5 35B A3B?

Yes! With DeepInfra API + TypingMind, you can access all DeepInfra models. Switch between them instantly in TypingMind.

Is my data private and secure?

Yes! TypingMind stores conversations locally (web version in browser, desktop version on your device). DeepInfra handles API calls securely. Check DeepInfra's data policy for specifics.

Can I use Qwen 3.5 35B A3B for commercial projects?

Yes! Check DeepInfra's terms of service for specific commercial use policies. TypingMind supports commercial use.