Qwen

qwen/qwen3.7-max

- context · $1.0000 / M input tokens · $2.9700 / M output tokens

Qwen3.7 Max is a Qwen route on OurToken for developers evaluating a higher-capability Qwen 3.7 option for chat, coding, reasoning, and production assistant workflows.

Get API Key
Availability
100% uptime
10 minutes agonow

All systems operational.

Pricing

Pay-per-use

No upfront costs, pay only for what you use

60% of official price
Input$1.65 / M$1.0000 / M Tokens
Output$4.951 / M$2.9700 / M Tokens

API Usage

API Access Guide

Base URLhttps://api.ourtoken.ai/v1
API Endpointchat/completions
Full URLhttps://api.ourtoken.ai/v1/chat/completions
Model IDqwen3.7-max
Get API Key

Code examples

Use the OurToken API endpoint for this model. The examples below use direct HTTP requests and the recommended endpoint for the model family.

curl https://api.ourtoken.ai/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -d '{
    "model": "qwen3.7-max",
    "messages": [
      {
        "role": "user",
        "content": "Hello!"
      }
    ],
    "max_tokens": 256
  }'

Chat Completions API Reference

Create a chat response with the OpenAI Chat Completions-compatible endpoint. Use https://api.ourtoken.ai/v1 as the SDK Base URL and POST /chat/completions as the endpoint.

Authorization

Content-Typeapplication/json
AuthorizationBearer YOUR_API_KEY

Request Body

FieldTypeRequiredDescription
modelstringRequiredModel ID to call.
messagesarray<object>RequiredConversation messages sent to the model.
max_tokensintegerOptionalMaximum number of output tokens.
temperaturenumberOptionalSampling temperature.
top_pnumberOptionalNucleus sampling parameter.
streambooleanOptionalWhether to return a streaming response.
stream_optionsobjectOptionalAdditional options for streaming responses.
toolsarray<object>OptionalTools available to the model.
tool_choicestring | objectOptionalControls how the model selects tools.
response_formatobjectOptionalControls structured output, such as JSON object responses.

Response Body

FieldTypeRequiredDescription
idstringRequiredUnique chat completion identifier.
object"chat.completion"RequiredObject type returned by the Chat Completions API.
createdintegerRequiredUnix timestamp when the response was created.
modelstringRequiredModel that produced the response.
choicesarray<object>RequiredCandidate responses returned by the model.
choices[].message.rolestringRequiredRole of the returned chat message.
choices[].message.contentstringOptionalText content in the returned chat message.
choices[].finish_reasonstringOptionalReason generation stopped.
usageobjectOptionalToken usage information for the chat completion.
usage.prompt_tokensintegerOptionalInput token count.
usage.completion_tokensintegerOptionalOutput token count.
usage.total_tokensintegerOptionalTotal token count.
usage.prompt_tokens_detailsobjectOptionalBreakdown of input token usage.
usage.prompt_tokens_details.cached_tokensintegerOptionalTokens served from cache.

Model Introduction

Qwen qwen3.7-max

Qwen3.7 Max is a Qwen route on OurToken for developers evaluating a higher-capability Qwen 3.7 option for chat, coding, reasoning, and production assistant workflows.

Use Qwen3.7 Max when your team wants to evaluate the higher-end Qwen route before choosing a production default. OurToken keeps the model ID, API examples, availability status, and qwen 3.7 max pricing close together so developers can test the model with real prompts.

Why It Looks Great

  • Higher-capability Qwen 3.7 route for evaluation and production testing.
  • OpenAI-compatible chat completions setup through the OurToken endpoint.
  • Dedicated route page for model ID, code examples, and 60% of official price pricing review.
  • Useful for comparing benchmark claims against real prompts and logs.
  • Clean path from Qwen discovery into API implementation.

Key Features

  • Model ID: qwen3.7-max
  • Provider: Qwen
  • Input price: $1.0000 per 1M tokens on OurToken
  • Output price: $2.9700 per 1M tokens on OurToken
  • Cache read price: $0.1980 per 1M tokens on OurToken
  • Cache write price: $1.2380 per 1M tokens on OurToken
  • API endpoint: chat completions
  • Evaluation focus: reasoning, coding, multilingual chat, and benchmark validation

Specifications

ProviderQwen
Model IDqwen3.7-max
Model TypeLarge Language Model (LLM)
OurToken Input Price$1.0000 / 1M tokens
OurToken Output Price$2.9700 / 1M tokens
OurToken Cache Read Price$0.1980 / 1M tokens
OurToken Cache Write Price$1.2380 / 1M tokens
Official Input Reference$1.65 / 1M tokens
Official Output Reference$4.951 / 1M tokens
Official Cache Read Reference$0.33 / 1M tokens
Official Cache Write Reference$2.063 / 1M tokens
Context WindowReview current provider and route documentation
API Endpointhttps://api.ourtoken.ai/v1/chat/completions

qwen 3.7 max api Features for Developers

Use qwen 3.7 max api access to review qwen 3.7 max pricing at 60% of official price and test benchmark claims.

API Access

Call qwen 3.7 max api through the OurToken unified endpoint with the qwen3.7-max model ID. This gives developers a direct route for testing Qwen 3.7 prompts while keeping API keys, request examples, and usage review in one place.

Pricing Review

Review qwen 3.7 max pricing before scaling traffic. OurToken lists $1.0000 input, $2.9700 output, $0.1980 cache read, and $1.2380 cache write per 1M tokens, using official references of $1.65, $4.951, $0.33, and $2.063.

Free Claims

Searches for qwen 3.7 max free often mix official trials, provider credits, third-party playgrounds, and community claims. Treat those sources as access research, then confirm whether your OurToken account has applicable balance, route availability, or promotions before testing.

Benchmark Testing

Use qwen 3.7 max benchmark claims as prompts for your own evaluation instead of treating them as a guarantee. Compare coding quality, reasoning stability, latency, tool behavior, and cost against the tasks your product will actually run.

Coding Workflows

Qwen3.7 Max can be evaluated for code explanation, debugging plans, repository questions, and agent-style implementation prompts. Keep results in logs, compare output quality with other routes, and avoid choosing a default model from benchmark summaries alone.

Production Fit

Before routing production traffic, test qwen 3.7 max api with realistic prompt size, expected output length, retry behavior, and user-facing latency targets. The best model choice should reflect your workload, not only provider positioning or directory rankings.

How to Use qwen 3.7 max api on OurToken

Create an API key, use qwen3.7-max, compare 60% of official price pricing, run tests, and monitor usage.

Create Key

Create an OurToken API key from the dashboard and store it in a secure server-side environment variable. This gives your backend a stable way to test qwen 3.7 max api without exposing credentials in browser code.

01

Copy Model

Use qwen3.7-max as the model value in your request body. Keeping the exact model ID in configuration helps developers avoid casing mistakes while comparing Qwen routes across local tests, staging traffic, and production deployments.

02

Call Endpoint

Send requests to the OurToken chat completions endpoint with your API key, model ID, and prompt payload. Existing OpenAI-compatible request patterns can usually be reused after changing the base URL, credential, and model value.

03

Review Pricing

Before scaling usage, review qwen 3.7 max pricing: $1.0000 input, $2.9700 output, $0.1980 cache read, and $1.2380 cache write per 1M tokens. Compare those rows with expected prompt size, output length, and request volume.

04

Test Benchmark

Build your own qwen 3.7 max benchmark suite with real coding, reasoning, retrieval, and assistant prompts. Compare outputs against acceptance criteria, not only public leaderboard claims or one-off provider examples.

05

Monitor Cost

After testing, review request count, token usage, failures, latency, and spend in OurToken history. This helps decide whether qwen 3.7 max api should become a default route or remain an evaluation option.

06

qwen 3.7 max api FAQ

Answers about qwen 3.7 max pricing, qwen 3.7 max free access claims, benchmark evaluation, model ID, and provider comparison.

01

What is qwen 3.7 max api?

qwen 3.7 max api is the OurToken route page for calling the qwen3.7-max model through a unified API workflow. Developers can copy the model ID, create an API key, run chat completions requests, review current pricing at 60% of official price, and compare real outputs before choosing it for production traffic.
02

How should I check qwen 3.7 max pricing?

qwen 3.7 max pricing on OurToken is $1.0000 per 1M input tokens and $2.9700 per 1M output tokens. Cache read is $0.1980 per 1M tokens, and cache write is $1.2380 per 1M tokens. Official references are $1.65 input, $4.951 output, $0.33 cache read, and $2.063 cache write.
03

Is qwen 3.7 max free on OurToken?

qwen 3.7 max free searches often refer to trials, credits, playground access, or third-party promotions. Do not assume free production access on OurToken unless the dashboard or account terms show it. Treat free-access claims as research signals, then confirm route availability and billing before testing.
04

How should I interpret a qwen 3.7 max benchmark?

A qwen 3.7 max benchmark can help decide what to test, but it should not replace your own evaluation. Run representative prompts for coding, multilingual chat, reasoning, tool use, and latency, then compare quality, stability, token usage, and cost against your product requirements.
05

Which model ID should I use for Qwen3.7 Max?

Use qwen3.7-max as the model value when calling this route through OurToken. Keep it in configuration rather than hard-coding it across many files, because that makes it easier to compare qwen 3.7 max api with Qwen3.7 Plus or other provider routes later.
06

How does Qwen3.7 Max compare with other providers?

Compare Qwen3.7 Max by availability, integration effort, current pricing, latency, quality on your prompts, and support needs. Official Alibaba Cloud pages, OpenRouter, Fireworks, and analysis sites can inform research, but the final decision should come from OurToken route tests and production-like traffic.