How to use DeepSeek R1 Distill Llama 70B from Chutes with API Key on TypingMind

Learn how to access and use DeepSeek R1 Distill Llama 70B with your Chutes API key through TypingMind. Get started with this powerful AI model in minutes.

Chutes.ai is a decentralized serverless AI compute platform built on Bittensor Subnet 64, enabling developers to deploy, run, and scale AI models without managing infrastructure. The platform processes nearly 160 billion tokens daily serving over 400,000 users with up to 90% lower costs than traditional providers through a distributed network of GPU miners compensated with TAO tokens. Key features include always-hot serverless compute with instant inference, model-agnostic support for LLMs, image, and audio models plus custom code, fully abstracted infrastructure handling provisioning and scaling automatically, standardized API access with OpenRouter integration, and open pay-per-use pricing. The roadmap includes long-running jobs, fine-tuning capabilities, AI agents, and Trusted Execution Environments for enhanced privacy, with a startup accelerator offering up to $20,000 in credits.

Official Documentation: https://llm.chutes.ai/v1/models

DeepSeek R1 Distill Llama 70B Overview

Model Name	DeepSeek R1 Distill Llama 70B
Provider	Chutes
Model ID	`deepseek-ai/DeepSeek-R1-Distill-Llama-70B`
Release Date	Dec 29, 2025
Last Updated	Apr 25, 2026
Context Window	131,072 tokens
Max Output	131,072 tokens
Pricing /1M tokens	$0.027 input $0.109 output $0.014 cache read
Input Modalities	text
Output Modalities	text
Capabilities	ReasoningTool CallingTemperature ControlOpen Weights

Complete Setup Guide

Get Your Chutes API Key

First, you'll need to obtain an API key from Chutes. This key allows you to access their AI models directly and pay only for what you use.

Visit Chutes's API console
Sign up or log in to your account
Navigate to the API keys section
Generate a new API key (copy it immediately as some providers only show it once)
Save your API key in a secure password manager or encrypted note

⚠️ Important: Keep your API key secure and never share it publicly. Store it safely as you'll need it to connect with TypingMind.

Configure TypingMind with Chutes API Key

Open TypingMind in your browser
Click the "Settings" icon (gear symbol)
Navigate to "Models" section
Click "Add Custom Model"
Fill in the model information:
Name: deepseek-ai/DeepSeek-R1-Distill-Llama-70B via Chutes (or your preferred name)
Endpoint: https://llm.chutes.ai/v1/chat/completions
Model ID: deepseek-ai/DeepSeek-R1-Distill-Llama-70B
Context Length: Enter the model's context window (e.g., 131072 for deepseek-ai/DeepSeek-R1-Distill-Llama-70B)
deepseek-ai/DeepSeek-R1-Distill-Llama-70Bhttps://llm.chutes.ai/v1/chat/completionsdeepseek-ai/DeepSeek-R1-Distill-Llama-70B via Chuteshttps://www.typingmind.com/model-logo.webp131072
Add custom headers by clicking "Add Custom Headers" in the Advanced Settings section:
Authorization: Bearer <CHUTES_API_KEY>:
X-Title: typingmind.com
HTTP-Referer: https://www.typingmind.com
Enable "Support Plugins (via OpenAI Functions)" if the model supports the "functions" or "tool_calls" parameter, or enable "Support OpenAI Vision" if the model supports vision.
Click "Test" to verify the configuration
If you see "Nice, the endpoint is working!", click "Add Model"

Start chatting with DeepSeek R1 Distill Llama 70B

Now you can start chatting with DeepSeek R1 Distill Llama 70B through TypingMind:

Select DeepSeek R1 Distill Llama 70B from the model dropdown menu
Start typing your message in the chat input
Enjoy faster responses and better features than the official interface
Switch between different AI models as needed

deepseek-ai/DeepSeek-R1-Distill-Llama-70B

💡 Pro tips for better results:

Use specific, detailed prompts for better responses (How to use Prompt Library)
Create AI agents with custom instructions for repeated tasks (How to create AI Agents)
Use plugins to extend DeepSeek R1 Distill Llama 70B capabilities (How to use plugins)

Frequently Asked Questions

Do I need a subscription to use DeepSeek R1 Distill Llama 70B?

No! With Chutes API, you pay only for what you use with no monthly subscription. Add credits to your Chutes account and pay as you go. TypingMind is also a one-time purchase, not a subscription.

How much will it cost to use DeepSeek R1 Distill Llama 70B?

DeepSeek R1 Distill Llama 70B costs $0.0272/1M input tokens and $0.1087/1M output tokens. A typical conversation might cost $0.01-0.10 depending on length.

Can I use other models besides DeepSeek R1 Distill Llama 70B?

Yes! With Chutes API + TypingMind, you can access all Chutes models. Switch between them instantly in TypingMind.

Is my data private and secure?

Yes! TypingMind stores conversations locally (web version in browser, desktop version on your device). Chutes handles API calls securely. Check Chutes's data policy for specifics.

Can I use DeepSeek R1 Distill Llama 70B for commercial projects?

Yes! Check Chutes's terms of service for specific commercial use policies. TypingMind supports commercial use.

How to use DeepSeek R1 Distill Llama 70B from Chutes with API Key on TypingMind

DeepSeek R1 Distill Llama 70B Overview

Complete Setup Guide

Get Your Chutes API Key

Configure TypingMind with Chutes API Key

Start chatting with DeepSeek R1 Distill Llama 70B

Frequently Asked Questions

Explore more

Use Hermes 4.3 36B from chutes with API Key

Use Hermes 4 70B from chutes with API Key

Use Hermes 4 14B from chutes with API Key

Use Hermes 4 405B FP8 TEE from chutes with API Key

Use DeepHermes 3 Mistral 24B Preview from chutes with API Key

Use dots.ocr from chutes with API Key

Use Kimi K2 Instruct 0905 from chutes with API Key

Use Kimi K2 Thinking TEE from chutes with API Key

Use MiniMax M2.1 TEE from chutes with API Key

Use NVIDIA Nemotron 3 Nano 30B A3B BF16 from chutes with API Key

Use DeepSeek R1T Chimera from chutes with API Key

Use DeepSeek TNG R1T2 Chimera TEE from chutes with API Key

Set up your own AI workspace now