模型

GPT-5.2 Pro
O

GPT-5.2 Pro

20% 优惠
上下文:400,000
输入:$21/M
输出:$168/M
gpt-5.2-pro is the highest-capability, production-oriented member of OpenAI’s GPT-5.2 family, exposed through the Responses API for workloads that demand maximal fidelity, multi-step reasoning, extensive tool use and the largest context/throughput budgets OpenAI offers.
O

GPT-5.2 Chat

O

GPT-5.2 Chat

上下文:128,000
输入:$1.75/M
输出:$14/M
gpt-5.2-chat-latest is the Chat-optimized snapshot of OpenAI’s GPT-5.2 family (branded in ChatGPT as GPT-5.2 Instant). It is the model for interactive/chat use cases that need a blend of speed, long-context handling, multimodal inputs and reliable conversational behaviour.
GPT-5.2
O

GPT-5.2

上下文:400,000
输入:$1.75/M
输出:$14/M
GPT-5.2 is a multi-flavored model suite (Instant, Thinking, Pro) engineered for better long-context understanding, stronger coding and tool use, and materially higher performance on professional “knowledge-work” benchmarks.
O

GPT-5.1 Chat

O

GPT-5.1 Chat

上下文:400.0k
输入:$1.25/M
输出:$10/M
GPT-5.1 Chat is an instruction-tuned conversational language model for general-purpose chat, reasoning, and writing. It supports multi-turn dialogue, summarization, drafting, knowledge-base QA, and lightweight code assistance for in-app assistants, support automation, and workflow copilots. Technical highlights include chat-optimized alignment, controllable and structured outputs, and integration paths for tool invocation and retrieval workflows when available.
O

GPT-5.1

O

GPT-5.1

输入:$1.25/M
输出:$10/M
GPT-5.1 is a general-purpose instruction-tuned language model focused on text generation and reasoning across product workflows. It supports multi-turn dialogue, structured output formatting, and code-oriented tasks such as drafting, refactoring, and explanation. Typical uses include chat assistants, retrieval-augmented QA, data transformation, and agent-style automation with tools or APIs when supported. Technical highlights include text-centric modality, instruction following, JSON-style outputs, and compatibility with function calling in common orchestration frameworks.
D

Doubao Seedream 4-5

D

Doubao Seedream 4-5

每次请求:$0.04
Seedream 4.5 is ByteDance/Seed’s multimodal image model (text→image + image editing) that focuses on production-grade image fidelity, stronger prompt adherence, and much-improved editing consistency (subject preservation, text/typography rendering, and facial realism).
F

FLUX 2 PRO

F

FLUX 2 PRO

免费
每次请求:$0.1
FLUX 2 PRO is the flagship commercial model in the FLUX 2 series, delivering state-of-the-art image generation with unprecedented quality and detail. Built for professional and enterprise applications, it offers superior prompt adherence, photorealistic outputs, and exceptional artistic capabilities. This model represents the cutting edge of AI image synthesis technology.
F

FLUX 2 FLEX

F

FLUX 2 FLEX

免费
每次请求:$0.01
FLUX 2 FLEX is the versatile, adaptable model designed for flexible deployment across various use cases and hardware configurations. It offers scalable performance with adjustable quality settings, making it ideal for applications requiring dynamic resource allocation. This model provides the best balance between quality, speed, and resource efficiency.
R

Black Forest Labs/FLUX 2 PRO

R

Black Forest Labs/FLUX 2 PRO

每次请求:$0.075
FLUX 2 PRO is the flagship commercial model in the FLUX 2 series, delivering state-of-the-art image generation with unprecedented quality and detail. Built for professional and enterprise applications, it offers superior prompt adherence, photorealistic outputs, and exceptional artistic capabilities. This model represents the cutting edge of AI image synthesis technology.
R

Black Forest Labs/FLUX 2 FLEX

R

Black Forest Labs/FLUX 2 FLEX

每次请求:$0.24
FLUX 2 FLEX is the versatile, adaptable model designed for flexible deployment across various use cases and hardware configurations. It offers scalable performance with adjustable quality settings, making it ideal for applications requiring dynamic resource allocation. This model provides the best balance between quality, speed, and resource efficiency.
R

Black Forest Labs/FLUX 2 DEV

R

Black Forest Labs/FLUX 2 DEV

每次请求:$0.075
FLUX 2 DEV is the development-friendly version optimized for research, experimentation, and non-commercial applications. It provides developers with powerful image generation capabilities while maintaining a balance between quality and computational efficiency. Perfect for prototyping, academic research, and personal creative projects.
G

Veo 3.1 Pro

G

Veo 3.1 Pro

每秒:$0.3125
Veo 3.1-Pro refers to the high-capability access/configuration of Google’s Veo 3.1 family — a generation of short-form, audio-enabled video models that add richer native audio, improved narrative/editing controls and scene-extension tools.
G

Veo 3.1

G

Veo 3.1

每秒:$0.0625
Veo 3.1 is Google’s incremental-but-significant update to its Veo text-and-image→video family, adding richer native audio, longer and more controllable video outputs, and finer editing and scene-level controls.
G

Veo 3 Pro

G

Veo 3 Pro

每秒:$0.3125
Veo 3 pro denotes the production-grade Veo 3 video model experience (high fidelity, native audio, and extended tooling)
G

Veo 3 Fast

G

Veo 3 Fast

每秒:$0.0625
Veo 3 Fast is Google’s speed-optimized variant of the Veo family of generative video models (Veo 3 / Veo 3.1 etc.). It is engineered to produce short, high-quality video clips with natively generated audio while prioritizing throughput and cost per second—trading some top-end visual fidelity and/or longer single-shot duration for much faster generation and lower price. What is Veo 3 Fast — concise introduction
G

Veo 3

G

Veo 3

每秒:$0.0625
Google DeepMind’s Veo 3 represents the cutting edge of text-to-video generation, marking the first time a large-scale generative AI model seamlessly synchronizes high-fidelity video with accompanying audio—including dialogue, sound effects, and ambient soundscapes.
O

GPT Image 1.5

O

GPT Image 1.5

输入:$8/M
输出:$16/M
GPT-Image-1.5 is OpenAI’s image model in the GPT Image family . It is a natively multimodal GPT model designed to generate images from text prompts and to perform high-fidelity edits of input images while following user instructions closely.
G

Gemini 2.5 Flash

G

Gemini 2.5 Flash

上下文:1M
输入:$0.3/M
输出:$7/M
Gemini 2.5 Flash is an AI model developed by Google, designed to provide fast and cost-effective solutions for developers, especially for applications requiring enhanced Inference capabilities. According to the Gemini 2.5 Flash preview announcement, the model was released in preview on April 17, 2025, supports Multimodal input, and has a context window of 1 million tokens. This model supports a maximum context length of 65,536 tokens.
G

Nano Banana

G

Nano Banana

每次请求:$0.039
Gemini 2.5 Flash Image (aka nano-banana), Google's most advanced image generation and editing model. This update enables you to blend multiple images into a single one, maintain character consistency to tell rich stories, perform targeted transformations using natural language, and leverage Gemini's world knowledge to generate and edit images.
G

Gemini 2.5 Pro DeepSearch

G

Gemini 2.5 Pro DeepSearch

输入:$10/M
输出:$80/M
Deep search model, with enhanced deep search and information retrieval capabilities, an ideal choice for complex knowledge integration and analysis.
G

Gemini 2.5 Pro (All)

G

Gemini 2.5 Pro (All)

输入:$1.25/M
输出:$2.5/M
Gemini 2.5 Pro (All) is a multimodal model for text and media understanding, designed for general-purpose assistants and grounded reasoning. It handles instruction following, analytical writing, code comprehension, and image/audio understanding with reliable tool/function calling and RAG-friendly behavior. Typical uses include enterprise chat agents, document and UI analysis, visual question answering, and workflow automation. Technical highlights include unified image‑text‑audio inputs, long-context support, structured JSON output, streaming responses, and system-instruction control.
G

Gemini 2.5 Flash DeepSearch

G

Gemini 2.5 Flash DeepSearch

输入:$6/M
输出:$50/M
Deep search model, with enhanced deep search and information retrieval capabilities, an ideal choice for complex knowledge integration and analysis.
G

Gemini 2.5 Flash (All)

G

Gemini 2.5 Flash (All)

输入:$0.3/M
输出:$0.6/M
Gemini 2.5 Flash is an AI model developed by Google, designed to provide fast and cost-effective solutions for developers, especially for applications requiring enhanced Inference capabilities. According to the Gemini 2.5 Flash preview announcement, the model was released in preview on April 17, 2025, supports Multimodal input, and has a context window of 1 million tokens. This model supports a maximum context length of 65,536 tokens.
G

Gemini 2.0 Flash Lite

G

Gemini 2.0 Flash Lite

输入:$0.1/M
输出:$0.4/M
Gemini 2.0 Flash Lite is a compact, instruction-tuned multimodal model optimized for low-latency, high-throughput inference. It handles text and image understanding, summarization, classification, and lightweight reasoning, with tool/function calling and structured output control. Typical uses include conversational agents, rapid content drafting, metadata extraction from documents or screenshots, and retrieval-augmented workflows. Technical highlights include text-image inputs, streaming generation, function/tool calling, and deployment options suited to latency-sensitive services.
X

Grok Code Fast 1

X

Grok Code Fast 1

上下文:256K
输入:$0.2/M
输出:$0.8/M
Grok Code Fast 1 is an AI programming model launched by xAI, designed for fast and efficient basic coding tasks. The model can process 92 tokens per second, has a 256k context window, and is suitable for rapid prototyping, code debugging, and generating simple visual elements.
X

Grok 3 DeepSearch

X

Grok 3 DeepSearch

输入:$3/M
输出:$12/M
Grok-3 deep networked search model. This model supports a maximum context length of 100,000 tokens.
X

Grok 3 DeeperSearch

X

Grok 3 DeeperSearch

输入:$3/M
输出:$12/M
Grok-3 deep networked search model, superior to grok-3-deepsearch. This model supports a maximum context length of 100,000 tokens.
G

Nano Banana Pro

G

Nano Banana Pro

输入:$1.952/M
输出:$11.712/M
Nano Banana Pro is an AI model for general-purpose assistance in text-centric workflows. It is suitable for instruction-style prompting to generate, transform, and analyze content with controllable structure. Typical uses include chat assistants, document summarization, knowledge QA, and workflow automation. Public technical details are limited; integration aligns with common AI assistant patterns such as structured outputs, retrieval-augmented prompts, and tool or function calling.
O

GPT-5 nano

O

GPT-5 nano

上下文:400K
输入:$0.05/M
输出:$0.4/M
GPT-5 Nano is an artificial intelligence model provided by OpenAI.
O

GPT-5 mini

O

GPT-5 mini

上下文:400K
输入:$0.25/M
输出:$2/M
GPT-5 mini is OpenAI’s cost- and latency-optimized member of the GPT-5 family, intended to deliver much of GPT-5’s multimodal and instruction-following strengths at substantially lower cost for large-scale production use. It targets environments where throughput, predictable per-token pricing, and fast responses are the primary constraints while still providing strong general-purpose capabilities.