Access and Use cogito-v2-preview-llama-109b-moe via OpenRouter

An instruction-tuned, hybrid-reasoning Mixture-of-Experts model built on Llama-4-Scout-17B-16E. Cogito v2 can answer directly or engage an extended “thinking” phase, with alignment guided by Iterated Distillation & Amplification (IDA). It targets coding, STEM, instruction following, and general helpfulness, with stronger multilingual, tool-calling, and reasoning performance than size-equivalent baselines. The model supports long-context use (up to 10M tokens) and standard Transformers workflows. Users can control the reasoning behaviour with the reasoning enabled boolean. Learn more in our docs

Cogito V2 Preview Llama 109B Overview

Full NameCogito V2 Preview Llama 109B
ProviderCogito V2 Preview Llama 109B
Model IDdeepcogito/cogito-v2-preview-llama-109b-moe
Release DateSep 2, 2025
Context Window32,767 tokens
Pricing /1M tokens$0.00000018 for input
$0.00000059 for output
Supported Input Typesimage, text
Supported Parametersfrequency_penaltyinclude_reasoninglogit_biasmax_tokensmin_ppresence_penaltyreasoningrepetition_penaltystoptemperaturetool_choicetoolstop_ktop_p

Complete Setup Guide

1

Create OpenRouter Account

  1. Visit openrouter.ai
  2. Click "Sign In" and create an account (free)
  3. Verify your email address
  4. You'll receive $1 in free credits to test models
2

Get Your OpenRouter API Key

  1. Log in to OpenRouter dashboard
  2. Go to "API Keys" section in the menu
  3. Click "Create API Key"
  4. Give it a name (e.g., "TypingMind")
  5. Copy your API key (starts with "sk-or-v1-...")
OpenRouter API Keys
3

Add Credits to OpenRouter (Optional)

  1. Go to "Credits" in OpenRouter dashboard
  2. Click "Add Credits"
  3. Choose amount ($5 minimum, $20 recommended for testing)
  4. Complete payment (credit card or crypto)
  5. Credits never expire!
4

Configure TypingMind with OpenRouter API Key

Method 1: Direct Import (Recommended)

  1. Open TypingMind in your browser
  2. Click the "Settings" icon (gear symbol)
  3. Navigate to "Manage Models" section
  4. Click "Add Custom Model"
  5. Select "Import OpenRouter" from the options
  6. Enter your OpenRouter API key from Step 1
  7. Click "Check API Key" to verify the connection
  8. Choose which models you want to add from the list (you can add multiple at once)
  9. Click "Import Models" to complete the setup
The best frontend AI chat for OpenRouter API Key

Method 2: Manual Custom Model Setup

  1. Open TypingMind in your browser
  2. Click the "Settings" icon (gear symbol)
  3. Navigate to "Models" section
  4. Click "Add Custom Model"
  5. Fill in the model information:
    Name: deepcogito/cogito-v2-preview-llama-109b-moe via OpenRouter (or your preferred name)
    Endpoint: https://openrouter.ai/api/v1/chat/completions
    Model ID: deepcogito/cogito-v2-preview-llama-109b-moe
    Context Length: Enter the model's context window (e.g., 32767 for deepcogito/cogito-v2-preview-llama-109b-moe)
    OpenRouter Endpoint URL input fielddeepcogito/cogito-v2-preview-llama-109b-moehttps://openrouter.ai/api/v1/chat/completionsdeepcogito/cogito-v2-preview-llama-109b-moe via OpenRouterhttps://www.typingmind.com/model-logo.webp32767
  6. Add custom headers by clicking "Add Custom Headers" in the Advanced Settings section:
    Authorization: Bearer <OPENROUTER_API_KEY>:
    X-Title: typingmind.com
    HTTP-Referer: https://www.typingmind.com
  7. Enable "Support Plugins (via OpenAI Functions)" if the model supports the "functions" or "tool_calls" parameter, or enable "Support OpenAI Vision" if the model supports vision.
  8. Click "Test" to verify the configuration
  9. If you see "Nice, the endpoint is working!", click "Add Model"
5

Start chatting with deepcogito/cogito-v2-preview-llama-109b-moe

Now you can start chatting with the deepcogito/cogito-v2-preview-llama-109b-moe model via OpenRouter on TypingMind:

  • Select your preferred deepcogito/cogito-v2-preview-llama-109b-moe model from the model dropdown menu
  • Start typing your message in the chat input
  • Enjoy faster responses and better features than the official interface
  • Switch between different AI models as needed
The best frontend AI chat for deepcogito/cogito-v2-preview-llama-109b-moe via OpenRouter API KeyThe best frontend AI chat for deepcogito/cogito-v2-preview-llama-109b-moe via OpenRouter API Keydeepcogito/cogito-v2-preview-llama-109b-moeThe best frontend AI chat for deepcogito/cogito-v2-preview-llama-109b-moe via OpenRouter API Key

Pro tips for better results:

Why TypingMind + OpenRouter?

  • Best-in-class UI: TypingMind's interface is far superior to standard chat UIs
  • Model flexibility: Switch between Cogito V2 Preview Llama 109B and 200+ models instantly
  • Cost control: Pay only for what you use through OpenRouter
  • One-time purchase: Buy TypingMind once, use forever with any OpenRouter model
  • Data privacy: Your conversations stored locally, not on external servers

Try TypingMind for free now!

Frequently Asked Questions

Do I need a subscription to use Cogito V2 Preview Llama 109B?

No! Through OpenRouter, you pay only for what you use with no monthly subscription. Add credits to your OpenRouter account and they never expire. TypingMind is also a one-time purchase, not a subscription.

How much will it cost to use Cogito V2 Preview Llama 109B?

It costs 0.00000018 for input and 0.00000059 for output via OpenRouter. A typical conversation might cost $0.01-0.10 depending on length. Start with $5-10 in credits to test.

Can I use other models besides Cogito V2 Preview Llama 109B?

Yes! With OpenRouter + TypingMind, you get access to 200+ models including GPT-4, Claude, Gemini, Llama, Mistral, and many more. Switch between them instantly in TypingMind.

Is my data private and secure?

Yes! TypingMind stores conversations locally (web version in browser, desktop version on your device). OpenRouter handles API calls securely and doesn't train on your data. Check each provider's data policy for specifics.

Can I use Cogito V2 Preview Llama 109B for commercial projects?

Yes! Check Cogito V2 Preview Llama 109B's terms of service for specific commercial use policies. OpenRouter and TypingMind both support commercial use.

What if Cogito V2 Preview Llama 109B is unavailable?

OpenRouter allows you to configure fallback models. If Cogito V2 Preview Llama 109B is down, it can automatically route to your backup choice. You can also manually switch models in TypingMind anytime.

How do I cancel or get a refund?

OpenRouter: No subscriptions to cancel. Unused credits remain in your account forever.