GLM-4.7

Latest GLM with enhanced reasoning capabilities.

glm-4.7

STABLEGet Started View uptime

204,800 context

Starting at $0.38/M input tokens (tiered)

Starting at $1.98/M (10% off) output tokens (tiered)

Streaming

Tools

Reasoning

JSON Output

Select Provider

All Providers for GLM-4.7

LLM Gateway routes requests to the best providers that are able to handle your prompt size and parameters.

Z AI

Context: 200k10% off

Input

$0.6$0.54

10% off

/M tokens

Cached

$0.11$0.099

10% off

/M tokens

Output

$2.2$1.98

10% off

/M tokens

+ $0.010$0.009 per search

Get Started

NovitaAI

Context: 204.8k

Input

$0.6

/M tokens

Cached

$0.11

/M tokens

Output

$2.2

/M tokens

Get Started

Cerebras

Context: 200k

Input

$2.25

/M tokens

Cached

—

/M tokens

Output

$2.75

/M tokens

Get Started

ByteDance

Context: 200k

Input

$0.6

/M tokens

Cached

$0.11

/M tokens

Output

$2.2

/M tokens

Get Started

Together AI

Context: 202.8k

Input

$0.45

/M tokens

Cached

—

/M tokens

Output

/M tokens

Get Started

Alibaba Cloud

Context: 202.8k

Input

$0.431

/M tokens

Cached

—

/M tokens

Output

$2.007

/M tokens

Tiered Pricing

OUT

≤32K tokens

$0.431

$2.007

>32K tokens

$0.574

$2.294

Get Started

EmberCloud

Context: 200k

Input

$0.38

/M tokens

Cached

$0.19

/M tokens

Output

$1.98

/M tokens

Get Started

GLM-4.7

Select Provider

All Providers for GLM-4.7

Stay ahead of the curve

Support

Welcome!