- qwen/qwen3.7-max
qwen/qwen3.7-max
- context · $1.0000 / M input tokens · $2.9700 / M output tokens
Qwen3.7 Max is a Qwen route on OurToken for developers evaluating a higher-capability Qwen 3.7 option for chat, coding, reasoning, and production assistant workflows.
All systems operational.
Pricing
Pay-per-use
No upfront costs, pay only for what you use
API Usage
API Access Guide
Code examples
Use the OurToken API endpoint for this model. The examples below use direct HTTP requests and the recommended endpoint for the model family.
curl https://api.ourtoken.ai/v1/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer YOUR_API_KEY" \
-d '{
"model": "qwen3.7-max",
"messages": [
{
"role": "user",
"content": "Hello!"
}
],
"max_tokens": 256
}'Chat Completions API Reference
Create a chat response with the OpenAI Chat Completions-compatible endpoint. Use https://api.ourtoken.ai/v1 as the SDK Base URL and POST /chat/completions as the endpoint.
Authorization
| Content-Type | application/json |
| Authorization | Bearer YOUR_API_KEY |
Request Body
| Field | Type | Required | Description |
|---|---|---|---|
| model | string | Required | Model ID to call. |
| messages | array<object> | Required | Conversation messages sent to the model. |
| max_tokens | integer | Optional | Maximum number of output tokens. |
| temperature | number | Optional | Sampling temperature. |
| top_p | number | Optional | Nucleus sampling parameter. |
| stream | boolean | Optional | Whether to return a streaming response. |
| stream_options | object | Optional | Additional options for streaming responses. |
| tools | array<object> | Optional | Tools available to the model. |
| tool_choice | string | object | Optional | Controls how the model selects tools. |
| response_format | object | Optional | Controls structured output, such as JSON object responses. |
Response Body
| Field | Type | Required | Description |
|---|---|---|---|
| id | string | Required | Unique chat completion identifier. |
| object | "chat.completion" | Required | Object type returned by the Chat Completions API. |
| created | integer | Required | Unix timestamp when the response was created. |
| model | string | Required | Model that produced the response. |
| choices | array<object> | Required | Candidate responses returned by the model. |
| choices[].message.role | string | Required | Role of the returned chat message. |
| choices[].message.content | string | Optional | Text content in the returned chat message. |
| choices[].finish_reason | string | Optional | Reason generation stopped. |
| usage | object | Optional | Token usage information for the chat completion. |
| usage.prompt_tokens | integer | Optional | Input token count. |
| usage.completion_tokens | integer | Optional | Output token count. |
| usage.total_tokens | integer | Optional | Total token count. |
| usage.prompt_tokens_details | object | Optional | Breakdown of input token usage. |
| usage.prompt_tokens_details.cached_tokens | integer | Optional | Tokens served from cache. |
Model Introduction
Qwen qwen3.7-max
Qwen3.7 Max is a Qwen route on OurToken for developers evaluating a higher-capability Qwen 3.7 option for chat, coding, reasoning, and production assistant workflows.
Use Qwen3.7 Max when your team wants to evaluate the higher-end Qwen route before choosing a production default. OurToken keeps the model ID, API examples, availability status, and qwen 3.7 max pricing close together so developers can test the model with real prompts.
Why It Looks Great
- Higher-capability Qwen 3.7 route for evaluation and production testing.
- OpenAI-compatible chat completions setup through the OurToken endpoint.
- Dedicated route page for model ID, code examples, and 60% of official price pricing review.
- Useful for comparing benchmark claims against real prompts and logs.
- Clean path from Qwen discovery into API implementation.
Key Features
- Model ID: qwen3.7-max
- Provider: Qwen
- Input price: $1.0000 per 1M tokens on OurToken
- Output price: $2.9700 per 1M tokens on OurToken
- Cache read price: $0.1980 per 1M tokens on OurToken
- Cache write price: $1.2380 per 1M tokens on OurToken
- API endpoint: chat completions
- Evaluation focus: reasoning, coding, multilingual chat, and benchmark validation
Specifications
qwen 3.7 max api Features for Developers
Use qwen 3.7 max api access to review qwen 3.7 max pricing at 60% of official price and test benchmark claims.
API Access
Call qwen 3.7 max api through the OurToken unified endpoint with the qwen3.7-max model ID. This gives developers a direct route for testing Qwen 3.7 prompts while keeping API keys, request examples, and usage review in one place.
Pricing Review
Review qwen 3.7 max pricing before scaling traffic. OurToken lists $1.0000 input, $2.9700 output, $0.1980 cache read, and $1.2380 cache write per 1M tokens, using official references of $1.65, $4.951, $0.33, and $2.063.
Free Claims
Searches for qwen 3.7 max free often mix official trials, provider credits, third-party playgrounds, and community claims. Treat those sources as access research, then confirm whether your OurToken account has applicable balance, route availability, or promotions before testing.
Benchmark Testing
Use qwen 3.7 max benchmark claims as prompts for your own evaluation instead of treating them as a guarantee. Compare coding quality, reasoning stability, latency, tool behavior, and cost against the tasks your product will actually run.
Coding Workflows
Qwen3.7 Max can be evaluated for code explanation, debugging plans, repository questions, and agent-style implementation prompts. Keep results in logs, compare output quality with other routes, and avoid choosing a default model from benchmark summaries alone.
Production Fit
Before routing production traffic, test qwen 3.7 max api with realistic prompt size, expected output length, retry behavior, and user-facing latency targets. The best model choice should reflect your workload, not only provider positioning or directory rankings.
How to Use qwen 3.7 max api on OurToken
Create an API key, use qwen3.7-max, compare 60% of official price pricing, run tests, and monitor usage.
Create Key
Create an OurToken API key from the dashboard and store it in a secure server-side environment variable. This gives your backend a stable way to test qwen 3.7 max api without exposing credentials in browser code.
01Copy Model
Use qwen3.7-max as the model value in your request body. Keeping the exact model ID in configuration helps developers avoid casing mistakes while comparing Qwen routes across local tests, staging traffic, and production deployments.
02Call Endpoint
Send requests to the OurToken chat completions endpoint with your API key, model ID, and prompt payload. Existing OpenAI-compatible request patterns can usually be reused after changing the base URL, credential, and model value.
03Review Pricing
Before scaling usage, review qwen 3.7 max pricing: $1.0000 input, $2.9700 output, $0.1980 cache read, and $1.2380 cache write per 1M tokens. Compare those rows with expected prompt size, output length, and request volume.
04Test Benchmark
Build your own qwen 3.7 max benchmark suite with real coding, reasoning, retrieval, and assistant prompts. Compare outputs against acceptance criteria, not only public leaderboard claims or one-off provider examples.
05Monitor Cost
After testing, review request count, token usage, failures, latency, and spend in OurToken history. This helps decide whether qwen 3.7 max api should become a default route or remain an evaluation option.
06qwen 3.7 max api FAQ
Answers about qwen 3.7 max pricing, qwen 3.7 max free access claims, benchmark evaluation, model ID, and provider comparison.