GLM 4.5 FP8
submodelOpen weights
GLM 4.5 FP8 by submodel costs $0.20 per 1M input tokens and $0.80 per 1M output tokens, with a 131K-token context window.
Pricing
Input (per 1M tokens)
$0.20
Output (per 1M tokens)
$0.80
Cached input (per 1M)
—
Specifications
- Provider
- submodel
- Context window
- 131K tokens
- Parameters
- —
- Released
- Jul 2025
- Open weights
- Yes
- Frontier model
- No
Compare GLM 4.5 FP8 with…
FAQ
Pricing is per 1M tokens (USD); confirm with the provider before production use.