GLM-4.7

Latest GLM with enhanced reasoning capabilities.

204,800 context
Starting at $0.38/M input tokens (tiered)
Starting at $1.98/M (10% off) output tokens (tiered)
Streaming
Tools
Reasoning
JSON Output

All Providers for GLM-4.7

LLM Gateway routes requests to the best providers that are able to handle your prompt size and parameters.

Z AI
Context: 200k10% off
Input
$0.6$0.54
10% off
/M tokens
Cached
$0.11$0.099
10% off
/M tokens
Output
$2.2$1.98
10% off
/M tokens
+ $0.010$0.009 per search
Get Started
NovitaAI
Context: 204.8k
Input
$0.6
/M tokens
Cached
$0.11
/M tokens
Output
$2.2
/M tokens
Get Started
Cerebras
Context: 200k
Input
$2.25
/M tokens
Cached
/M tokens
Output
$2.75
/M tokens
Get Started
ByteDance
Context: 200k
Input
$0.6
/M tokens
Cached
$0.11
/M tokens
Output
$2.2
/M tokens
Get Started
Together AI
Context: 202.8k
Input
$0.45
/M tokens
Cached
/M tokens
Output
$2
/M tokens
Get Started
Alibaba Cloud
Context: 202.8k
Input
$0.431
/M tokens
Cached
/M tokens
Output
$2.007
/M tokens
Tiered Pricing
IN
OUT
≤32K tokens
$0.431
$2.007
>32K tokens
$0.574
$2.294
Get Started
EmberCloud
Context: 200k
Input
$0.38
/M tokens
Cached
$0.19
/M tokens
Output
$1.98
/M tokens
Get Started