AI Models Directory
Browse and compare 180+ AI models from OpenAI, Anthropic, Google, and 30+ providers - filter by capabilities, pricing, and context size.
Use Case
Capabilities
Provider
Input Price ($/M tokens)
Output Price ($/M tokens)
Context Size (tokens)
109/268
Models
27/36
Providers
68
Vision Models (filtered)
106
Tool-enabled (filtered)
2
Free Models (filtered)
GLM-4.6V Flash
glm
glm-4.6v-flashStreaming
Vision
Tools
Reasoning
JSON Output
Z AI
Context: 128k
Input
$0.00
/M tokens
Cached
$0.00
/M tokens
Output
$0.00
/M tokens
GLM-4.6V FlashX
glm
glm-4.6v-flashxStreaming
Vision
Tools
Reasoning
JSON Output
Z AI
Context: 128k
Input
$0.04
/M tokens
Cached
$0.00
/M tokens
Output
$0.40
/M tokens
GLM-4.6V
glm
glm-4.6vStreaming
Vision
Tools
Reasoning
JSON Output
Z AI
Context: 128k
Input
$0.30
/M tokens
Cached
$0.05
/M tokens
Output
$0.90
/M tokens
GLM-4.6
glm
glm-4.6Streaming
Tools
Reasoning
JSON Output
Native Web Search
Z AI
Context: 200k
Input
$0.60
/M tokens
Cached
$0.11
/M tokens
Output
$2.20
/M tokens
+ $0.010 per search
GLM-4.7 Flash (Free)
glm
glm-4.7-flash-freeStreaming
Tools
Reasoning
JSON Output
Z AI
Context: 200k
Input
$0.00
/M tokens
Cached
$0.00
/M tokens
Output
$0.00
/M tokens
GLM-4.7 FlashX
glm
glm-4.7-flashxStreaming
Tools
Reasoning
JSON Output
Z AI
Context: 200k
Input
$0.07
/M tokens
Cached
$0.01
/M tokens
Output
$0.40
/M tokens
GLM-4.7
glm
glm-4.7Streaming
Tools
Reasoning
JSON Output
Native Web Search
Z AI
Context: 200k
Input
$0.60
/M tokens
Cached
$0.11
/M tokens
Output
$2.20
/M tokens
+ $0.010 per search
GLM-4.5 X
glm
glm-4.5-xStreaming
Tools
Reasoning
JSON Output
Z AI
Context: 128k
Input
$2.20
/M tokens
Cached
$0.45
/M tokens
Output
$8.90
/M tokens
GLM-4.5V
glm
glm-4.5vStreaming
Vision
Tools
Reasoning
JSON Output
Z AI
Context: 128k
Input
$0.60
/M tokens
Cached
$0.11
/M tokens
Output
$1.80
/M tokens
GLM-4.5
glm
glm-4.5Streaming
Tools
Reasoning
JSON Output
Native Web Search
Z AI
Context: 128k
Input
$0.60
/M tokens
Cached
$0.11
/M tokens
Output
$2.20
/M tokens
+ $0.010 per search
GLM-5
glm
glm-5Streaming
Tools
Reasoning
JSON Output
Native Web Search
Structured JSON Output
Z AI
Context: 202.8k
Input
$1.00
/M tokens
Cached
$0.20
/M tokens
Output
$3.20
/M tokens
+ $0.010 per search
GLM-5.1
glm
glm-5.1Streaming
Tools
Reasoning
JSON Output
Native Web Search
Structured JSON Output
Z AI
Context: 200k
Input
$1.40
/M tokens
Cached
$0.26
/M tokens
Output
$4.40
/M tokens
+ $0.010 per search
Seed 1.8 (251228)
bytedance
seed-1-8-251228Streaming
Vision
Tools
Reasoning
JSON Output
ByteDance
Context: 256k
Input
$0.25
/M tokens
Cached
$0.05
/M tokens
Output
$2.00
/M tokens
Seed 1.6 Flash (250715)
bytedance
seed-1-6-flash-250715Streaming
Vision
Tools
Reasoning
JSON Output
ByteDance
Context: 256k
Input
$0.07
/M tokens
Cached
$0.01
/M tokens
Output
$0.30
/M tokens
Seed 1.6 (250915)
bytedance
seed-1-6-250915Streaming
Vision
Tools
Reasoning
JSON Output
ByteDance
Context: 256k
Input
$0.25
/M tokens
Cached
$0.05
/M tokens
Output
$2.00
/M tokens
Seed 1.6 (250615)
bytedance
seed-1-6-250615Streaming
Vision
Tools
Reasoning
JSON Output
ByteDance
Context: 256k
Input
$0.25
/M tokens
Cached
$0.05
/M tokens
Output
$2.00
/M tokens
Qwen3.6 35B A3B
alibaba
qwen3.6-35b-a3bStreaming
Vision
Tools
Reasoning
JSON Output
Native Web Search
Alibaba Cloud
Context: 262.1k
Input
$0.25
/M tokens
Cached
—
/M tokens
Output
$1.48
/M tokens
+ $0.010 per search
Qwen3.6 Plus
alibaba
qwen3.6-plusStreaming
Vision
Tools
Reasoning
JSON Output
Native Web Search
Alibaba Cloud
Context: 262.1k
Input
$0.50
/M tokens
Cached
$0.05
/M tokens
Output
$3.00
/M tokens
+ $0.010 per search
Qwen3.6 Max Preview
alibaba
qwen3.6-max-previewStreaming
Tools
Reasoning
JSON Output
Alibaba Cloud
Context: 262.1k
Input
$1.30
/M tokens
Cached
$0.13
/M tokens
Output
$7.80
/M tokens
Qwen3 Max 2026-01-23
alibabaScheduled for Deactivation
qwen3-max-2026-01-23Streaming
Vision
Tools
Reasoning
JSON Output
Alibaba Cloud
Context: 262.1k
Deactivating on Jul 8, 2026
Input
$1.20
/M tokens
Cached
$0.24
/M tokens
Output
$6.00
/M tokens
Tiered pricing available
IN
Cached
OUT
<= 32K tokens
$1.20
$0.24
$6.00
<= 128K tokens
$2.40
$0.48
$12.00
<= 252K tokens
$3.00
$0.60
$15.00
Qwen3 VL 235B A22B Thinking
alibaba
qwen3-vl-235b-a22b-thinkingStreaming
Vision
Reasoning
Alibaba Cloud
Context: 131.1k
Deactivating on Jul 8, 2026
Input
$0.50
/M tokens
Cached
—
/M tokens
Output
$2.00
/M tokens
QwQ Plus
alibaba
qwq-plusStreaming
Reasoning
Alibaba Cloud
Context: 131.1k
Input
$0.80
/M tokens
Cached
—
/M tokens
Output
$2.40
/M tokens
Qwen3.5 397B A17B
alibaba
qwen35-397b-a17bStreaming
Vision
Tools
Reasoning
JSON Output
Native Web Search
Alibaba Cloud
Context: 262.1k
Input
$0.60
/M tokens
Cached
—
/M tokens
Output
$3.60
/M tokens
+ $0.010 per search
Qwen3 VL 30B A3B Thinking
alibaba
qwen3-vl-30b-a3b-thinkingStreaming
Vision
Tools
Reasoning
JSON Output
NovitaAI
Context: 131.1k
Input
$0.20
/M tokens
Cached
—
/M tokens
Output
$1.00
/M tokens
Qwen3.7 Plus
alibaba
qwen3.7-plusStreaming
Vision
Tools
Reasoning
JSON Output
Alibaba Cloud
Context: 1M
Input
$0.40
/M tokens
Cached
$0.08
/M tokens
Output
$1.60
/M tokens
Tiered pricing available
IN
Cached
OUT
<= 256K tokens
$0.40
$0.08
$1.60
>256K tokens
$1.20
$0.24
$4.80
Qwen3.7 Max
alibaba
qwen3.7-maxStreaming
Tools
Reasoning
JSON Output
Native Web Search
Alibaba Cloud
Context: 1M
Input
$2.50
/M tokens
Cached
$0.50
/M tokens
Output
$7.50
/M tokens
+ $0.010 per search
Qwen3 Max
alibaba
qwen3-maxStreaming
Vision
Tools
Reasoning
JSON Output
Alibaba Cloud
Context: 256k
Input
$3.00
/M tokens
Cached
$0.60
/M tokens
Output
$15.00
/M tokens
Qwen3 Next 80B A3B Thinking
alibaba
qwen3-next-80b-a3b-thinkingStreaming
Tools
Reasoning
NovitaAI
Context: 131.1k
Input
$0.15
/M tokens
Cached
—
/M tokens
Output
$1.50
/M tokens
Qwen3 30B A3B Thinking 2507
alibabaModel Deactivated
qwen3-30b-a3b-thinking-2507Streaming
Tools
Reasoning
JSON Output
Nebius AI
Context: 262k
Deactivated since Apr 25, 2026
Input
$0.10
/M tokens
Cached
—
/M tokens
Output
$0.30
/M tokens
Qwen3 235B A22B Thinking 2507
alibaba
qwen3-235b-a22b-thinking-2507Streaming
Tools
Reasoning
JSON Output
Nebius AI
Context: 262k
Input
$0.20
/M tokens
Cached
—
/M tokens
Output
$0.60
/M tokens
Kimi K2.7 Code
moonshot
kimi-k2.7-codeStreaming
Vision
Tools
Reasoning
JSON Output
Moonshot AI
Context: 262.1k
Input
$0.95
/M tokens
Cached
$0.19
/M tokens
Output
$4.00
/M tokens
Kimi K2.6
moonshot
kimi-k2.6Streaming
Vision
Tools
Reasoning
JSON Output
Moonshot AI
Context: 262.1k
Input
$0.95
/M tokens
Cached
$0.16
/M tokens
Output
$4.00
/M tokens
Kimi K2.5
moonshot
kimi-k2.5Streaming
Vision
Tools
Reasoning
JSON Output
Moonshot AI
Context: 262.1k
Input
$0.60
/M tokens
Cached
$0.10
/M tokens
Output
$3.00
/M tokens
Kimi K2 Thinking Turbo
moonshot
kimi-k2-thinking-turboStreaming
Tools
Reasoning
JSON Output
Moonshot AI
Context: 262.1k
Input
$1.15
/M tokens
Cached
$0.15
/M tokens
Output
$8.00
/M tokens
Kimi K2 Thinking
moonshot
kimi-k2-thinkingStreaming
Tools
Reasoning
JSON Output
ByteDance
Context: 256k
Input
$0.60
/M tokens
Cached
$0.12
/M tokens
Output
$2.50
/M tokens
MiniMax Text 01
minimax
minimax-text-01Streaming
Tools
Reasoning
MiniMax
Context: 1M
Input
$0.20
/M tokens
Cached
—
/M tokens
Output
$1.10
/M tokens
MiniMax M2.1 Lightning
minimax
minimax-m2.1-lightningStreaming
Tools
Reasoning
MiniMax
Context: 196.6k
Input
$0.12
/M tokens
Cached
—
/M tokens
Output
$0.48
/M tokens
MiniMax M2.1
minimax
minimax-m2.1Streaming
Tools
Reasoning
JSON Output
NovitaAI
Context: 204.8k
Input
$0.30
/M tokens
Cached
$0.03
/M tokens
Output
$1.20
/M tokens
MiniMax M2
minimax
minimax-m2Streaming
Tools
Reasoning
MiniMax
Context: 196.6k
Input
$0.20
/M tokens
Cached
$0.03
/M tokens
Output
$1.00
/M tokens
MiniMax M2.5 Highspeed
minimax
minimax-m2.5-highspeedStreaming
Tools
Reasoning
MiniMax
Context: 204.8k
Input
$0.60
/M tokens
Cached
$0.03
/M tokens
Output
$2.40
/M tokens
MiniMax M2.5
minimax
minimax-m2.5Streaming
Tools
Reasoning
JSON Output
Structured JSON Output
MiniMax
Context: 204.8k
Input
$0.30
/M tokens
Cached
$0.03
/M tokens
Output
$1.20
/M tokens
MiniMax M2.7 Highspeed
minimax
minimax-m2.7-highspeedStreaming
Tools
Reasoning
MiniMax
Context: 204.8k
Input
$0.60
/M tokens
Cached
$0.06
/M tokens
Output
$2.40
/M tokens
MiniMax M2.7
minimax
minimax-m2.7Streaming
Tools
Reasoning
JSON Output
Structured JSON Output
MiniMax
Context: 204.8k
Input
$0.30
/M tokens
Cached
$0.06
/M tokens
Output
$1.20
/M tokens
MiniMax M3
minimax
minimax-m3Streaming
Vision
Tools
Reasoning
JSON Output
MiniMax
Context: 1.0M
Input
$0.60
/M tokens
Cached
$0.12
/M tokens
Output
$2.40
/M tokens
DeepSeek V4 Flash
deepseek
deepseek-v4-flashStreaming
Tools
Reasoning
JSON Output
NovitaAI
Context: 1.1M
Input
$0.14
/M tokens
Cached
$0.03
/M tokens
Output
$0.28
/M tokens
DeepSeek V4 Pro
deepseek
deepseek-v4-proStreaming
Tools
Reasoning
JSON Output
Structured JSON Output
Together AI
Context: 163.8k
Input
$1.74
/M tokens
Cached
$0.20
/M tokens
Output
$3.48
/M tokens
DeepSeek V3.2
deepseek
deepseek-v3.2Streaming
Tools
JSON Output
Reasoning
DeepSeek
Context: 163.8k
Deactivated since May 1, 2026
Input
$0.28
/M tokens
Cached
$0.03
/M tokens
Output
$0.42
/M tokens
DeepSeek V3.1
deepseek
deepseek-v3.1Streaming
Tools
Reasoning
ByteDance
Context: 128k
Input
$0.56
/M tokens
Cached
$0.11
/M tokens
Output
$1.68
/M tokens
MiMo V2 Flash
xiaomiScheduled for Deactivation
mimo-v2-flashStreaming
Tools
Reasoning
JSON Output
Xiaomi
Context: 256k
Deactivating on Jun 18, 2026
Input
$0.10
/M tokens
Cached
$0.02
/M tokens
Output
$0.30
/M tokens
MiMo V2.5
xiaomi
mimo-v2.5Streaming
Vision
Tools
Reasoning
JSON Output
Xiaomi
Context: 1M
Input
$0.14
/M tokens
Cached
$0.03
/M tokens
Output
$0.28
/M tokens