Z.AI · Single modality

glm-4.7-flash

zai/glm-4.7-flash id

GLM-4.7-Flash balances high performance with efficiency, making it the perfect lightweight deployment option. Beyond coding, it is also recommended for creative writing, translation, long-context tasks, and roleplay.

ReasoningTool use
Type
Use glm-4.7-flash
# Drop-in OpenAI-compatible client
$ import { generateText } from 'ai'
$
$ const { text } = await generateText({
$ model: 'zai/glm-4.7-flash',
$ baseURL: 'https://synapse.garden/api/v1',
$ apiKey: process.env.MG_KEY,
$ prompt: 'Why is the sky blue?',
$ })
200K
CONTEXT WINDOW
131K
MAX OUTPUT
$0.077/M
INPUT · PER M
$0.440/M
OUTPUT · PER M
PRICING

List prices, every modality.

RatePer million tokens · USD
Input$0.077/M
Output$0.440/M
Honest list pricesHow we calculate prices
MORE FROM Z.AI

Other Z.AI models

See all 13
Model
Input
Output
Context
Type
FAQ · GLM-4.7-FLASH

Frequently asked

01 / 04

How do I call glm-4.7-flash from my code?

Use the OpenAI or Anthropic SDK and point baseURL at https://synapse.garden/api/v1. Set model: ‘zai/glm-4.7-flash and supply your Synapse Garden API key. No code changes beyond the base URL.

02 / 04

How much does glm-4.7-flash cost?

Input: $0.077/M per million tokens. Output: $0.440/M per million tokens. The free tier includes a million tokens every month at no cost.

03 / 04

What's the context window for glm-4.7-flash?

glm-4.7-flash supports a context window of 200K tokens, with a maximum output of 131K tokens.

04 / 04

Do I need a separate Anthropic or OpenAI account?

No. Synapse Garden is the single API surface — one key gives you OpenAI, Anthropic, Google, Meta, Mistral, DeepSeek, xAI, Cohere, and more. Billing, rate limits, and audit logs are unified.

READY

Try glm-4.7-flash in three minutes.

Sign up, create a key, drop our base URL into your existing client. The free tier includes a million tokens every month — no credit card.