minimax/minimax-m3

$0.2400 / M इनपुट टोकन · $0.9600 / M आउटपुट टोकन

MiniMax M3 OurToken पर MiniMax model route है, उन developers के लिए जिन्हें coding, agent workflows, long-context tasks, multimodal evaluation और production assistants के लिए hosted API access चाहिए।

API Key प्राप्त करें

24H स्थिति मॉनिटर

100% अपटाइम

8 घंटे पहलेअभी

उपलब्ध

2026-07-24 10:39:09 UTC

मूल्य निर्धारण

उपयोग के अनुसार भुगतान

कोई अग्रिम लागत नहीं, केवल उतने के लिए भुगतान करें जितना आप उपयोग करते हैं

40% of official price

इनपुट$0.60 / M$0.2400 / M टोकन

आउटपुट$2.40 / M$0.9600 / M टोकन

कैश किया गया इनपुट$0.12 / M$0.0480 / M टोकन

कैश लेखन$0 / M$0 / M टोकन

API उपयोग

API एक्सेस गाइड

बेस URLhttps://api.ourtoken.ai/v1

API एंडपॉइंटchat/completions

पूरा URLhttps://api.ourtoken.ai/v1/chat/completions

मॉडल IDminimax-m3

API Key प्राप्त करें

कोड उदाहरण

इस मॉडल के लिए OurToken API endpoint का उपयोग करें। नीचे दिए गए उदाहरण direct HTTP requests और मॉडल परिवार के लिए recommended endpoint का उपयोग करते हैं।

curl https://api.ourtoken.ai/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -d '{
    "model": "minimax-m3",
    "messages": [
      {
        "role": "user",
        "content": "Hello!"
      }
    ],
    "max_tokens": 256
  }'

Chat Completions API संदर्भ

OpenAI Chat Completions-संगत endpoint के साथ chat response बनाएँ। SDK Base URL के रूप में https://api.ourtoken.ai/v1 और endpoint के रूप में POST /chat/completions का उपयोग करें।

प्राधिकरण

Content-Type	application/json
Authorization	Bearer YOUR_API_KEY

अनुरोध सामग्री

फ़ील्ड	प्रकार	आवश्यक	विवरण
model	string	आवश्यक	कॉल करने के लिए Model ID।
messages	array<object>	आवश्यक	model को भेजे गए conversation messages।
max_tokens	integer	वैकल्पिक	output tokens की अधिकतम संख्या।
temperature	number	वैकल्पिक	Sampling temperature।
top_p	number	वैकल्पिक	Nucleus sampling parameter।
stream	boolean	वैकल्पिक	क्या streaming response लौटाना है।
stream_options	object	वैकल्पिक	streaming responses के लिए अतिरिक्त options।
tools	array<object>	वैकल्पिक	model के लिए उपलब्ध tools।
tool_choice	string \| object	वैकल्पिक	model tools कैसे चुनता है, इसे नियंत्रित करता है।
response_format	object	वैकल्पिक	structured output को नियंत्रित करता है, जैसे JSON object responses।

प्रतिक्रिया सामग्री

फ़ील्ड	प्रकार	आवश्यक	विवरण
id	string	आवश्यक	unique chat completion identifier।
object	"chat.completion"	आवश्यक	Chat Completions API द्वारा लौटाया गया object type।
created	integer	आवश्यक	response बनाए जाने का Unix timestamp।
model	string	आवश्यक	वह model जिसने response बनाया।
choices	array<object>	आवश्यक	model द्वारा लौटाए गए candidate responses।
choices[].message.role	string	आवश्यक	लौटाए गए chat message की role।
choices[].message.content	string	वैकल्पिक	लौटाए गए chat message में text content।
choices[].finish_reason	string	वैकल्पिक	generation रुकने का कारण।
usage	object	वैकल्पिक	chat completion के लिए token usage information।
usage.prompt_tokens	integer	वैकल्पिक	Input token count।
usage.completion_tokens	integer	वैकल्पिक	Output token count।
usage.total_tokens	integer	वैकल्पिक	Total token count।
usage.prompt_tokens_details	object	वैकल्पिक	input token usage का breakdown।
usage.prompt_tokens_details.cached_tokens	integer	वैकल्पिक	cache से served tokens।

मॉडल परिचय

MiniMax minimax-m3

MiniMax M3 teams को application work के लिए MiniMax route देता है, जहां long context, coding workflows, multimodal prompts और predictable API pricing मायने रखते हैं। जब आप model IDs, usage logs, cache costs और price review को एक dashboard में रखते हुए OurToken unified API के माध्यम से MiniMax workflows test करना चाहते हों, तब MiniMax M3 API का उपयोग करें।

यह बेहतरीन क्यों है

Input, output और cache read tokens के लिए official MiniMax M3 reference price का 40%।
अन्य supported models द्वारा उपयोग किए जाने वाले उसी OurToken endpoint के माध्यम से OpenAI-compatible API setup।
Cache write $0 के रूप में listed है, जबकि standard input, output और cache read tokens paid categories रहते हैं।
Separate provider-specific integration के बिना coding agents, long-context tasks, tool-use experiments और multimodal workflows evaluate करने के लिए उपयोगी।
Dashboard logs और usage visibility teams को launch के बाद request cost review करने में मदद करते हैं।

मुख्य विशेषताएँ

Model ID: minimax-m3
Input price: $0.2400 per 1M tokens on OurToken
Output price: $0.9600 per 1M tokens on OurToken
Cache read price: $0.0480 per 1M tokens on OurToken
Cache write price: $0 per 1M tokens on OurToken
Provider: MiniMax

विशिष्टताएँ

ProviderMiniMax

Model TypeLarge Language Model (LLM)

Model IDminimax-m3

Context Length1M tokens

Max Output512K tokens

InputText, image, video

OutputText

OurToken Input Price$0.2400 / 1M tokens

OurToken Output Price$0.9600 / 1M tokens

OurToken Cache Read Price$0.0480 / 1M tokens

OurToken Cache Write Price$0 / 1M tokens

Official Input Reference$0.60 / 1M tokens

Official Output Reference$2.40 / 1M tokens

Official Cache Read Reference$0.12 / 1M tokens

MiniMax M3 API Features

Unified MiniMax API access, transparent MiniMax M3 pricing, cache visibility, multimodal evaluation और production agent workflows के लिए MiniMax M3 API का उपयोग करें।

Unified Access

Model access, API key management और usage history को एक जगह रखते हुए OurToken के unified endpoint के माध्यम से MiniMax M3 API call करें। Model ID के रूप में minimax-m3 का उपयोग करें और coding agents, chat systems तथा long-context workflows के लिए OpenAI-compatible request patterns दोबारा उपयोग करें।

Pricing Clarity

Rollout से पहले MiniMax M3 pricing review करें। OurToken $0.2400 input और $0.9600 output per 1M tokens list करता है, ताकि teams coding, multimodal prompts और high-volume assistant workloads के लिए MiniMax M3 price estimate कर सकें।

Cache Costs

Explicit cache pricing के साथ cache behavior को normal prompt spend से अलग करें। OurToken पर MiniMax M3 API cache read $0.0480 per 1M tokens listed है, जबकि cache write $0 है, जो MiniMax M3 free case है जिसे users को स्पष्ट रूप से समझना चाहिए।

Agent Workflows

Coding agents, tool-use experiments और multi-step automation के लिए MiniMax M3 model evaluation का उपयोग करें। Competitor material agentic capability और OpenCode-style workflows highlight करता है, लेकिन teams को अपने prompts और acceptance criteria से Opencode MiniMax M3 behavior validate करना चाहिए।

Multimodal Context

Document review, repository analysis, visual inputs, video-grounded prompts और multi-turn collaboration जैसे long-context और multimodal tasks evaluate करें। Competitor material 1M context और native multimodality describe करता है, जिसे अपने production-like workload में test किया जाना चाहिए।

Deployment Choices

MiniMax M3 HuggingFace और MiniMax M3 Ollama जैसी searches के साथ hosted API access compare करें। OurToken local model hosting के बजाय managed API keys, usage logs, pricing visibility और simple integration पर focused है।

OurToken पर MiniMax M3 API का उपयोग कैसे करें

API key बनाएं, minimax-m3 copy करें, MiniMax M3 pricing compare करें, unified endpoint call करें, और real usage monitor करें।

API Key बनाएं

Dashboard से OurToken API key बनाएं और उसे secure server-side environment variable में store करें। इससे आपका backend client code, notebooks और public repositories से credentials बाहर रखते हुए MiniMax M3 API access कर सकता है।

Model ID Copy करें

अपने request body में model value के रूप में minimax-m3 का उपयोग करें। Exact MiniMax M3 model ID को configuration में रखने से developers local tests, staging traffic और production deployments में MiniMax API routes compare करते समय naming mistakes से बचते हैं।

Endpoint Call करें

अपनी API key, model ID और prompt payload के साथ OurToken unified API endpoint पर requests भेजें। Base URL, credential और model value बदलने के बाद existing OpenAI-compatible chat request patterns आमतौर पर दोबारा उपयोग किए जा सकते हैं।

Pricing Compare करें

Rollout से पहले MiniMax M3 API pricing compare करें: OurToken $0.2400 input, $0.9600 output और $0.0480 cache read per 1M tokens list करता है। Cache write $0 है, जो paid input और output से अलग MiniMax M3 free token category है।

Workflows Test करें

Scale करने से पहले representative coding, agent, long-context, image और video-input prompts run करें। यदि आप Opencode MiniMax M3 workflows evaluate कर रहे हैं, तो tool behavior, response quality, latency और token usage को अपने production acceptance criteria के विरुद्ध compare करें।

Cost Monitor करें

Launch के बाद request count, input tokens, output tokens, cache read tokens और spend के लिए history logs review करें। Real usage data teams को सिर्फ benchmark pages या provider listings पर निर्भर रहने के बजाय actual traffic के विरुद्ध MiniMax M3 price compare करने में मदद करता है।

MiniMax M3 API FAQ

MiniMax M3 API pricing, MiniMax API access, free cache-write usage, model setup, OpenCode workflows और deployment comparisons के बारे में उत्तर।

MiniMax M3 API क्या है?

MiniMax M3 API OurToken के माध्यम से उपलब्ध MiniMax M3 model route है, उन teams के लिए जो coding, agent, long-context और multimodal model तक hosted access चाहती हैं। Developers minimax-m3 model ID का उपयोग कर सकते हैं, OurToken API key बना सकते हैं और अन्य supported models द्वारा उपयोग किए जाने वाले उसी unified API flow से इसे call कर सकते हैं।

OurToken पर MiniMax M3 API pricing क्या है?

OurToken पर MiniMax M3 API pricing $0.2400 per 1M input tokens और $0.9600 per 1M output tokens है। Catalog cache read को $0.0480 per 1M tokens और cache write को $0 पर भी list करता है, ताकि teams traffic scale करने से पहले token category के अनुसार MiniMax M3 pricing estimate कर सकें।

क्या OurToken पर MiniMax M3 free है?

OurToken पर MiniMax M3 free usage cache write pricing को refer करता है, जो $0 per 1M tokens के रूप में listed है। Standard input, output और cache read tokens अभी भी paid categories हैं। Free cache write को useful cost detail मानें, free MiniMax M3 API plan नहीं।

Cache read और cache write के लिए MiniMax M3 price क्या है?

OurToken पर MiniMax M3 price for cache read $0.0480 per 1M cache read tokens है, official $0.12 reference की तुलना में। Cache write $0 per 1M tokens के रूप में listed है। इससे cache behavior को normal input और output spend से अलग track करना महत्वपूर्ण हो जाता है।

क्या मैं OurToken के माध्यम से Opencode MiniMax M3 workflows उपयोग कर सकता हूं?

आप OurToken के माध्यम से minimax-m3 call करके और coding, tool-use तथा multi-step agent prompts test करके Opencode MiniMax M3 workflows evaluate कर सकते हैं। Competitor material agentic और coding capability highlight करता है, लेकिन production decisions को repository prompts, latency, tool behavior और output quality की तुलना अपने acceptance criteria से करनी चाहिए।

MiniMax M3 HuggingFace, MiniMax M3 Ollama और OurToken API access कैसे compare होते हैं?

MiniMax M3 HuggingFace और MiniMax M3 Ollama model discovery या local-style deployment paths के लिए common searches हैं। OurToken API keys, usage logs और pricing visibility के साथ hosted MiniMax M3 API access पर focused है। आपकी priority managed API integration, local experimentation या model research है, इसके आधार पर चुनें।