Moonshot AI / Kimi API Guide for Overseas Developers — Long-Context LLM Access
Complete guide to accessing Moonshot AI and Kimi API from overseas. Covers 128K+ context windows, Moonshot K2 model capabilities, TokenPAPA relay access, pricing, and Python code examples without a Chinese phone.
Moonshot AI / Kimi API Guide for Overseas Developers — Access China's Long-Context LLM Powerhouse
Published: June 24, 2026 · 10 min read
Why Moonshot AI Matters for Overseas Developers
Moonshot AI is one of the most innovative AI startups to emerge from China in 2024-2026, known for pioneering the longest context windows in the industry. Founded by Yang Zhilin (a Tsinghua PhD and former Google AI researcher), Moonshot AI quickly became a major player in China's rapidly expanding LLM ecosystem with their flagship product Kimi — a long-context AI assistant that rivals ChatGPT for document-intensive workflows.
What makes Moonshot AI stand out is their relentless focus on context window size. While most LLMs standardize around 128K tokens (roughly 200 pages of text), Moonshot AI has pushed beyond 1 million tokens in certain configurations, enabling use cases that other models simply cannot handle in a single pass — entire legal case files, multi-hundred-page research papers, full codebases, and complete book-length documents.
For overseas developers, Moonshot AI's models are accessible through API relay platforms like TokenPAPA without the traditional barriers of Chinese-only registration and payment systems.
What makes Moonshot AI especially interesting for overseas developers:
- Industry-leading context windows — Moonshot AI native 128K context with experimental support for up to 1M tokens, exceeding both GPT-4o and DeepSeek V3 on maximum context length
- Long-document optimized architecture — The model architecture is specifically designed for retrieval and reasoning over extended contexts, with attention mechanisms optimized for long sequences
- Competitive pricing — At $0.22/1M input tokens via TokenPAPA, Moonshot K2 is cheaper than both GPT-4o ($2.50/1M) and DeepSeek V3 ($0.27/1M)
- OpenAI-compatible API — Standard chat completions format works with any OpenAI SDK client
- Strong Chinese-English bilingual performance — Matches top Chinese LLMs on bilingual tasks while offering superior long-context handling
According to independent evaluations from the LMSYS Chatbot Arena and community benchmarks (June 2026), Moonshot AI's models achieve competitive scores on standard NLP benchmarks while leading in metrics specifically measuring long-context comprehension and retrieval — tasks where most LLMs degrade significantly beyond 32K tokens.
Key insight: Moonshot AI is the only Chinese LLM provider that has made long-context the core differentiator rather than an afterthought. While GPT-4o and DeepSeek V3 both support 128K context, Moonshot AI's architecture is designed from the ground up for extended sequences, meaning it retains accuracy and coherence far longer than models that add context as a retrofitted feature. For document-heavy workloads, Moonshot AI is the specialist choice.
What Are Moonshot AI and Kimi?
Moonshot AI (月之暗面 — literally "Dark Side of the Moon") was founded in March 2023 by Yang Zhilin, who previously led research at Google AI and holds a PhD from Tsinghua University. The company raised significant funding from Alibaba, Sequoia Capital China, and other major investors, reaching a valuation exceeding $3 billion by 2025.
Kimi (kimi.moonshot.cn) is their consumer-facing product — a long-context AI assistant that operates similarly to ChatGPT but with a specific focus on processing and reasoning over extremely long documents. Kimi has become widely popular in China for research, legal document review, financial analysis, and academic work.
The Moonshot Model Family
| Model | Context Window | Best For |
|---|---|---|
| Moonshot K2 (flagship) | 128K tokens | General-purpose, long-document analysis, research |
| Moonshot v1-128k | 128K tokens | Stable production-grade long-context |
| Moonshot v1-32k | 32K tokens | Lighter, faster inference for standard tasks |
Moonshot AI Pricing Comparison
| Model | Input (per 1M tokens) | Output (per 1M tokens) |
|---|---|---|
| Moonshot K2 (via TokenPAPA) | $0.22 | $0.88 |
| Moonshot v1-128k (via TokenPAPA) | $0.18 | $0.72 |
| Direct Moonshot AI via kimi.moonshot.cn | Varies (CNY pricing) | Varies (CNY pricing) |
Direct pricing from Moonshot AI's official platform is denominated in Chinese Yuan and requires a Chinese bank card, Alipay, or WeChat Pay. Relay platforms like TokenPAPA provide fixed USD pricing without any Chinese payment requirements — US credit cards and PayPal accepted.
Key insight: Moonshot K2 at $0.22/1M input tokens is the most competitively priced long-context specialist LLM available to overseas developers. It undercuts DeepSeek V3 on input pricing while offering the same or better context handling, and is roughly 11x cheaper than GPT-4o. For developers building document analysis pipelines, this pricing unlocks production use cases that would be cost-prohibitive with Western API providers.
Moonshot K2 vs GPT-4o vs DeepSeek V3: Head-to-Head Comparison
Here is a direct comparison of Moonshot K2 against its main competitors, based on published benchmark data and community evaluations as of June 2026:
| Dimension | Moonshot K2 | DeepSeek V3 | GPT-4o |
|---|---|---|---|
| Input price/1M tokens | $0.22 | $0.27 | $2.50 |
| Output price/1M tokens | $0.88 | $1.10 | $10.00 |
| General knowledge (MMLU) | 83% | 88% | 89% |
| Long-context retrieval (>64K) | ★★★★★ | ★★★★☆ | ★★★★☆ |
| Coding (HumanEval) | 80% | 92% | 89% |
| Document summarization | ★★★★★ | ★★★★☆ | ★★★★☆ |
| Bilingual (Chinese-English) | ★★★★☆ | ★★★★☆ | ★★★☆☆ |
| Instruction following | ★★★★☆ | ★★★★☆ | ★★★★★ |
| Context window | 128K (up to 1M experimental) | 128K | 128K |
| Chatbot Arena ELO | ~1,220 | ~1,350 | ~1,380 |
When to Choose Moonshot K2
- Your application requires processing extremely long documents in a single pass — research papers, legal contracts, technical manuals over 200 pages
- You need retrieval accuracy at depth — Moonshot AI architecture maintains comprehension quality even in the middle and tail of very long contexts, where most models experience degradation
- You are building document-heavy AI workflows — due diligence analysis, academic research tools, compliance document review
- Cost is a primary concern for high-volume document processing — Moonshot K2 at $0.22/1M input is 19% cheaper than DeepSeek V3 and 91% cheaper than GPT-4o
When to Choose DeepSeek V3 or GPT-4o
- Coding and software development is your primary use case — DeepSeek V3 leads by a significant margin on HumanEval and other coding benchmarks
- You need state-of-the-art general reasoning — GPT-4o and DeepSeek R1 outperform Moonshot K2 on complex multi-step reasoning and math tasks
- Your application is English-only and you want maximum benchmark performance — GPT-4o remains the overall leader on standard evaluations
- You need multimodal vision capabilities — DeepSeek V3 lacks vision entirely, but GPT-4o offers strong image understanding support
According to comparative analysis from independent benchmarks, Moonshot K2's standout advantage is not raw benchmark score on standard tests, but sustained performance as context length increases. While GPT-4o and DeepSeek V3 show measurable degradation when retrieving information from the middle of a 100K+ token context, Moonshot AI models maintain more consistent accuracy across the full context window.
Key insight: Moonshot K2 is not trying to beat GPT-4o or DeepSeek V3 on every metric — it is building a moat around long-context document understanding. If your workflow involves 50+ page documents, legal contracts, academic papers, or multi-file codebase analysis, Moonshot K2 is likely the best model for the job regardless of its lower scores on generic benchmarks. For short-context chat and coding, use DeepSeek V3. For general English tasks, use GPT-4o. For long documents, use Moonshot K2.
How to Access Moonshot AI / Kimi API from Overseas
The primary barrier for overseas developers wanting to use Moonshot AI is the same as for most Chinese LLM platforms: direct registration on kimi.moonshot.cn requires a Chinese phone number and a Chinese payment method. Here are the practical approaches:
Method 1: TokenPAPA (Recommended — Fastest Setup)
TokenPAPA provides Moonshot AI API access to overseas developers without any Chinese phone verification, Chinese ID, or local payment method. You get a standard OpenAI-compatible endpoint with a single API key.
Setup time: Under 3 minutes
- Visit tokenpapa.ai and create an account with your email
- Add funds using a US credit card, international card, or PayPal
- Generate an API key from the dashboard (starts with
tp-sk-) - Use the endpoint
https://api.tokenpapa.ai/v1with any OpenAI-compatible client
Available Moonshot AI models via TokenPAPA:
| Model ID | Description |
|---|---|
moonshot-k2 | Moonshot K2 — flagship long-context model (128K context) |
moonshot-v1-128k | Moonshot v1-128k — stable production-grade model |
moonshot-v1-32k | Moonshot v1-32k — lighter, faster for standard tasks |
Method 2: Direct Moonshot AI Registration
You can register directly on Moonshot AI's platform (kimi.moonshot.cn or open.moonshot.cn). However, this path has significant hurdles for overseas developers:
- Visit the Moonshot AI developer platform and create an account
- Verify with a Chinese phone number — international numbers are not accepted
- Add a Chinese payment method — Alipay, WeChat Pay, or Chinese bank card
- Navigate the console — the interface is primarily in Chinese with limited English support
Drawbacks: The registration barrier is substantial. Chinese phone numbers are difficult to obtain overseas. Billing requires Chinese payment infrastructure. Customer support operates during Chinese business hours. For most overseas developers, the direct path is impractical.
Method 3: Access via TokenPAPA Multi-Model Platform
One of the strongest advantages of using TokenPAPA is that the same API key and endpoint give you access to multiple Chinese LLMs — not just Moonshot AI. You can switch between Moonshot K2, DeepSeek V3, DeepSeek R1, Qwen 2.5, GLM-4, MiniMax, and others by changing only the model parameter in your API calls. This makes multi-model routing strategies trivially simple to implement.
Code Examples: Using Moonshot AI API via TokenPAPA
The Moonshot AI API via TokenPAPA is fully OpenAI-compatible. Any existing OpenAI SDK code works by simply changing the base URL and API key.
Python: Basic Chat with Moonshot K2
from openai import OpenAI
# Configure the client with TokenPAPA endpoint
client = OpenAI(
api_key="tp-sk-your-api-key-here",
base_url="https://api.tokenpapa.ai/v1"
)
# Moonshot K2 — General Chat
response = client.chat.completions.create(
model="moonshot-k2",
messages=[
{"role": "system", "content": "You are a helpful assistant specialized in document analysis."},
{"role": "user", "content": "Explain how Moonshot AI's long-context architecture differs from standard transformer models."}
],
temperature=0.7,
max_tokens=1000
)
print(response.choices[0].message.content)Python: Streaming Response
from openai import OpenAI
client = OpenAI(
api_key="tp-sk-your-api-key-here",
base_url="https://api.tokenpapa.ai/v1"
)
# Streaming chat with Moonshot K2
stream = client.chat.completions.create(
model="moonshot-k2",
messages=[
{"role": "user", "content": "Write a detailed analysis of the advantages of long-context LLMs for legal document review."}
],
stream=True,
max_tokens=500
)
for chunk in stream:
if chunk.choices[0].delta.content is not None:
print(chunk.choices[0].delta.content, end="")Python: Long-Document Analysis (The Killer Use Case)
This is where Moonshot AI truly shines — processing and analyzing documents that would overwhelm most other models:
from openai import OpenAI
client = OpenAI(
api_key="tp-sk-your-api-key-here",
base_url="https://api.tokenpapa.ai/v1"
)
# Load a very long document (e.g., research paper, legal contract, technical manual)
with open("long_document.txt", "r") as f:
long_text = f.read()
print(f"Document length: {len(long_text)} characters")
# Moonshot K2 — Long-Context Summarization
response = client.chat.completions.create(
model="moonshot-k2",
messages=[
{
"role": "system",
"content": (
"You are an expert document analyst. Your task is to provide "
"a comprehensive yet concise analysis of the provided document. "
"Include: (1) executive summary, (2) key findings, "
"(3) important details by section, and (4) potential issues or gaps."
)
},
{
"role": "user",
"content": f"Please analyze the following document:\n\n{long_text}"
}
],
max_tokens=2000,
temperature=0.3
)
print("=== Document Analysis ===")
print(response.choices[0].message.content)Python: Multi-Turn Conversation with Long-Context Memory
One of the most powerful features of Moonshot AI is maintaining coherent multi-turn conversations across very long exchanges:
from openai import OpenAI
client = OpenAI(
api_key="tp-sk-your-api-key-here",
base_url="https://api.tokenpapa.ai/v1"
)
# Start a conversation about a long document
messages = [
{"role": "system", "content": "You are a research assistant. Maintain context across the entire conversation."},
{"role": "user", "content": "I am analyzing a 150-page financial report. Here is the executive summary section: [pasted 5000 tokens of text]. What are the top 3 risks mentioned?"}
]
# First query
response = client.chat.completions.create(
model="moonshot-k2",
messages=messages,
temperature=0.3,
max_tokens=800
)
assistant_reply = response.choices[0].message.content
print(f"Q1 Analysis: {assistant_reply[:200]}...\n")
# Follow-up — model remembers the full document context
messages.append({"role": "assistant", "content": assistant_reply})
messages.append({"role": "user", "content": "For each risk you identified, what mitigation strategies does the report recommend?"})
response = client.chat.completions.create(
model="moonshot-k2",
messages=messages,
temperature=0.3,
max_tokens=1000
)
print(f"Q2 Analysis: {response.choices[0].message.content[:200]}...")Python: Batch Document Processing
from openai import OpenAI
import time
client = OpenAI(
api_key="tp-sk-your-api-key-here",
base_url="https://api.tokenpapa.ai/v1"
)
documents = [
"contract_1.txt",
"contract_2.txt",
"research_paper.txt",
]
def analyze_document(filepath: str) -> str:
"""Analyze a document using Moonshot K2 long-context capabilities."""
with open(filepath, "r") as f:
content = f.read()
response = client.chat.completions.create(
model="moonshot-k2",
messages=[
{
"role": "system",
"content": "Analyze this document. Provide key points, findings, and recommendations."
},
{
"role": "user",
"content": f"Analyze:\n\n{content[:50000]}" # Send first ~50K tokens
}
],
max_tokens=1000,
temperature=0.3
)
return response.choices[0].message.content
# Process documents sequentially
for doc in documents:
print(f"Analyzing {doc}...")
analysis = analyze_document(doc)
print(f"Analysis complete: {analysis[:100]}...\n")
time.sleep(0.5) # Rate limitingcURL: Quick Test
# Moonshot K2 Chat
curl https://api.tokenpapa.ai/v1/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer tp-sk-...key" \
-d '{
"model": "moonshot-k2",
"messages": [
{"role": "user", "content": "What is Moonshot AI and what makes Kimi different from ChatGPT?"}
],
"temperature": 0.7,
"max_tokens": 500
}'
# Moonshot K2 Long-Context Analysis
curl https://api.tokenpapa.ai/v1/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer tp-sk-...key" \
-d '{
"model": "moonshot-k2",
"messages": [
{"role": "system", "content": "You are a document analysis expert."},
{"role": "user", "content": "Summarize this document in 3 bullet points: $(cat my_report.txt)"}
],
"max_tokens": 500,
"temperature": 0.3
}'Key Integrations
The Moonshot AI API integrates seamlessly with popular developer tools via the OpenAI-compatible interface:
| Tool/Platform | Setup | Notes |
|---|---|---|
| LangChain | Set base_url to https://api.tokenpapa.ai/v1 | Full support for chains, agents, tools |
| LlamaIndex | Change OpenAI base URL | Works with all RAG patterns |
| Vercel AI SDK | Set baseURL in provider config | Streaming and edge support |
| Open WebUI | Add as OpenAI-compatible provider | Chat interface for Moonshot models |
| Continue.dev | Add model config in config.json | IDE code assistant integration |
Moonshot AI in the Chinese LLM Ecosystem
To help you understand where Moonshot AI fits in the broader Chinese LLM landscape, here is a comparison with other Chinese models available through TokenPAPA:
| Chinese LLM | Developer | Input/Output per 1M tokens | Key Strength | Best Use Case |
|---|---|---|---|---|
| Moonshot K2 | Moonshot AI | $0.22 / $0.88 | Long-context, document analysis | Research, legal, document review |
| DeepSeek V3 | DeepSeek | $0.27 / $1.10 | Coding, reasoning | Developer tools, code assistants |
| DeepSeek R1 | DeepSeek | $0.55 / $2.19 | Chain-of-thought reasoning | Complex logic, math problems |
| Qwen 2.5 72B | Alibaba | $0.18 / $0.72 | Multilingual, instruction following | General-purpose with Asian language support |
| GLM-4 | Zhipu AI | $0.15 / $0.60 | Bilingual, cost efficiency | Chinese-English translation, classification |
| MiniMax Text-01 | MiniMax | $0.20 / $1.10 | Long context (256K), creative writing | Long-form content, storytelling |
Moonshot AI occupies a unique position as the long-context specialist among Chinese LLMs. While DeepSeek V3 leads on coding and Qwen 2.5 leads on multilingual performance, Moonshot K2 is the best choice for workflows that demand sustained comprehension across very long documents and conversations.
Multi-Model Strategy: Routing with Moonshot K2 and Other Chinese LLMs
The most cost-effective approach for production applications is to route different query types to the optimal model. Here is a recommended strategy using models available through TokenPAPA:
from openai import OpenAI
client = OpenAI(
api_key="tp-sk-your-api-key",
base_url="https://api.tokenpapa.ai/v1"
)
def route_query(task_type: str, prompt: str) -> str:
"""Route a query to the optimal model based on task type."""
model_map = {
"long_doc": "moonshot-k2", # Best long-context analysis
"research": "moonshot-k2", # Document-heavy research
"legal": "moonshot-k2", # Legal document review
"coding": "deepseek-v3", # Best coding performance
"reasoning": "deepseek-r1", # Best complex reasoning
"chat": "qwen-2.5-72b", # Best general-purpose chat
"translate": "glm-4", # Best bilingual translation
"creative": "minimax-text-01", # Good at creative writing
}
model = model_map.get(task_type, "deepseek-v3")
response = client.chat.completions.create(
model=model,
messages=[{"role": "user", "content": prompt}],
max_tokens=1000,
temperature=0.7
)
return response.choices[0].message.content
# Example usage
print(route_query("long_doc", "Summarize this 300-page technical manual and extract key specifications."))
print(route_query("legal", "Review this 50-page contract for unfavorable terms in sections 12-18."))
print(route_query("research", "Analyze the methodology section of this paper and identify potential flaws."))
print(route_query("coding", "Write a Python script that extracts tables from PDF documents."))This multi-model routing approach typically achieves 40-60% cost savings compared to using a single premium model like GPT-4o, while matching or exceeding quality across different task types by leveraging each model's unique strengths.
Use Cases for Moonshot AI Long-Context Capabilities
Moonshot AI's long-context architecture unlocks several use cases that are difficult or expensive with other models:
Legal Document Analysis
Entire legal case files, contracts, and regulatory documents can be processed in a single pass. Moonshot K2 maintains coherent understanding across 100+ page contracts, identifying cross-references, conflicting clauses, and compliance issues that shorter-context models would miss.
Academic Research
Research papers (including full paper with references, appendices, and supplementary materials) can be analyzed holistically. Moonshot K2 can compare methodology across multiple sections, evaluate conclusions against supporting data, and generate comprehensive literature review summaries.
Financial Due Diligence
Annual reports, prospectuses, and financial filings running hundreds of pages can be analyzed in one query. The model can extract financial metrics, identify risk factors, and cross-reference information across different sections of the document.
Technical Documentation
Product manuals, API documentation, and technical specifications spanning hundreds of pages can be indexed and queried conversationally. This is particularly valuable for enterprise knowledge management and customer support automation.
Codebase Analysis
Entire codebases or multi-file projects can be loaded into context for holistic code review, architecture analysis, and refactoring suggestions. While DeepSeek V3 may write better individual functions, Moonshot K2 can understand how all the pieces fit together across a large repository.
Frequently Asked Questions
1. Can I access Moonshot AI / Kimi API from overseas without a Chinese phone?
Yes. The easiest method is through an API relay platform like TokenPAPA, which provides Moonshot AI API access with no phone verification. You sign up with your email, fund your account with a US credit card or PayPal, and get your API key in minutes. Direct registration on Moonshot AI's platform requires a Chinese phone number and a Chinese payment method.
2. How long is the Moonshot AI context window?
Moonshot AI models support a 128K token native context window — equivalent to roughly 200 pages of text. This matches GPT-4o and DeepSeek V3. Some experimental configurations support up to 1 million tokens (1M), making Moonshot AI one of the most capable models for extreme long-context tasks. The 128K window is sufficient for most production use cases including entire legal case files, full research papers, and large codebases.
3. What is the difference between Moonshot AI and Kimi?
Moonshot AI is the company that develops the underlying large language models (Moonshot K2, Moonshot v1). Kimi (kimi.moonshot.cn) is their consumer-facing AI assistant product — similar to how OpenAI develops GPT models and offers ChatGPT as a consumer product. The API provides direct programmatic access to the underlying Moonshot AI models without going through the Kimi interface.
4. How does Moonshot K2 compare to GPT-4o and DeepSeek V3?
Moonshot K2 is competitive with both models on long-context tasks, where it leads due to its architecture being optimized for extended sequences. On standard benchmarks like MMLU (83% vs 89% for GPT-4o), HumanEval (80% vs 92% for DeepSeek V3), and GSM8K, Moonshot K2 trails the leaders. However, for document analysis, legal review, and research workflows that require sustained comprehension across very long inputs, Moonshot K2 is the strongest option. At $0.22/1M input tokens, it is also cheaper than both alternatives.
5. What is the pricing for Moonshot AI API via TokenPAPA?
Moonshot K2 costs $0.22 per 1M input tokens and $0.88 per 1M output tokens via TokenPAPA. Moonshot v1-128k costs $0.18/1M input and $0.72/1M output. This is significantly cheaper than GPT-4o ($2.50/1M input, 91% savings) and slightly cheaper than DeepSeek V3 ($0.27/1M input, 19% savings). TokenPAPA accepts US credit cards, international cards, and PayPal — no Chinese payment method needed.
6. Is Moonshot AI API compatible with the OpenAI Python SDK?
Yes — fully compatible. Any existing OpenAI client code works by changing the base_url to https://api.tokenpapa.ai/v1 and swapping the API key. The same applies to LangChain, LlamaIndex, Vercel AI SDK, and any other tool that supports OpenAI-compatible APIs. Switching from Moonshot K2 to DeepSeek V3 or any other Chinese LLM requires changing only the model parameter.
7. What Chinese LLM models are available besides Moonshot AI via TokenPAPA?
TokenPAPA provides access to DeepSeek V3, DeepSeek R1, Qwen 2.5 (72B, Coder 32B, Math 72B), GLM-4 (including GLM-4V and GLM-4 32K), MiniMax Text-01, and Moonshot AI — all through a single API key and the same OpenAI-compatible endpoint at https://api.tokenpapa.ai/v1.
Conclusion
Moonshot AI represents a unique and valuable addition to the Chinese LLM ecosystem for overseas developers. While it may not top every benchmark, its laser focus on long-context capability fills a critical gap that neither GPT-4o nor DeepSeek V3 covers as effectively. For developers building document analysis, legal review, academic research, financial due diligence, or any application that processes long-form content, Moonshot K2 is the specialist choice.
Here is the summary:
- Moonshot K2 is the best Chinese LLM for long-document analysis at 128K native context (up to 1M experimental)
- Access via TokenPAPA — no Chinese phone needed, US credit cards accepted, single API key for the entire Moonshot AI family
- At $0.22/1M input tokens, Moonshot K2 is 19% cheaper than DeepSeek V3 and 91% cheaper than GPT-4o
- Long-context architecture maintains comprehension quality across very long documents where other models degrade
- Multi-model routing — combine Moonshot K2 for documents with DeepSeek V3 for coding and Qwen 2.5 for chat, all through one TokenPAPA API key
- OpenAI-compatible — any existing SDK client works with a simple base URL change
Whether you are building a document analysis pipeline, a legal research tool, an academic literature review system, or a financial due diligence platform, Moonshot AI deserves a place in your AI toolkit — and getting started takes just 3 minutes with a single relay platform account.
Ready to try Moonshot AI API from overseas? Sign up at tokenpapa.ai — no Chinese phone required, US credit cards accepted, and you will have access to Moonshot K2, DeepSeek, Qwen, GLM-4, MiniMax, and more in under 3 minutes.
Sources:
- Moonshot AI Official Platform: https://kimi.moonshot.cn [accessed June 2026]
- Moonshot AI Developer Documentation: https://open.moonshot.cn [accessed June 2026]
- LMSYS Chatbot Arena: https://chat.lmsys.org [accessed June 2026]
- Open LLM Leaderboard (Hugging Face): https://huggingface.co/spaces/open-llm-leaderboard [accessed June 2026]
- TokenPAPA API Reference: https://tokenpapa.ai/docs [accessed June 2026]
- DeepSeek V3 Technical Report: https://arxiv.org/abs/2412.19437 [accessed June 2026]
这篇文档对您有帮助吗?
最后更新于
