Skip to content

/

No models found

© 2026 OpenRouter, Inc

Product

Chat
Rankings
Apps
Models
Providers
Pricing
Enterprise
Labs

Company

About
Blog
CareersHiring
Privacy
Terms of Service
Support
State of AI
Works With OR
Data

Developer

Documentation
API Reference
SDK
Status

Connect

Discord
GitHub
LinkedIn
X
YouTube

Wafer

Browse models provided by Wafer (Terms of Service)

2 models

Tokens processed on OpenRouter

DeepSeek: DeepSeek V4 FlashDeepSeek V4 Flash
DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model from DeepSeek with 284B total parameters and 13B activated parameters, supporting a 1M-token context window. It is designed for fast inference and high-throughput workloads, while maintaining strong reasoning and coding performance. The model includes hybrid attention for efficient long-context processing. Reasoning efforts `high` and `xhigh` are supported; `xhigh` maps to max reasoning. It is well suited for applications such as coding assistants, chat systems, and agent workflows where responsiveness and cost efficiency are important.
by deepseekApr 24, 20261.05M context$0.09/M input tokens$0.18/M output tokens
Z.ai: GLM 5.1GLM 5.1
GLM-5.1 delivers a major leap in coding capability, with particularly significant gains in handling long-horizon tasks. Unlike previous models built around minute-level interactions, GLM-5.1 can work independently and continuously on a single task for more than 8 hours, autonomously planning, executing, and improving itself throughout the process, ultimately delivering complete, engineering-grade results.
by z-aiApr 7, 2026203K context$1/M input tokens$3.20/M output tokens