Skip to content
Seedance 2.0 Face is here — generate video from real-person reference photos.Try it now
Back to Models
qwenqwen·Apr 27, 2026

Qwen: Qwen3.6 Flash

qwen/qwen3.6-flash
context
1M
Max Output
65.5K
Input / 1M
$0.25
Output / 1M
$1.50

About

Qwen3.6 Flash is the speed-first variant in Alibaba's Qwen 3.6 series. It accepts text, image, and video input with a 1M token context window and includes an integrated thinking mode. The model supports tool use and structured output, and prompt caching is available with separate cache read and cache write pricing. Pricing is tiered, with higher rates applied to requests above the 256K token threshold.

Capabilities

Context Length
1M
Max Output
65.5K
Reasoning
Yes
In
text, image, video
Out
text
Type256K>256K
Input$0.25$1.00
Output$1.50$4.00
Cache Write$0.3125$0.3125

OpenAI-compatible · Model ID qwen/qwen3.6-flash

curl https://api.elliotgate.com/v1/chat/completions \
  -H "Authorization: Bearer sk-omg-your-api-key" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "qwen/qwen3.6-flash",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'