Currency and Billing Information
Currency and Billing Information
This site uniformly uses US dollars for billing, with a USD to CNY exchange rate of 1:7.3.
Pricing
For specific pricing, please check the pricing page. Discounts vary for different channel sources, please check user benefits for details. The final price is: Model Price × Channel Discount × User Discount.
User Benefits
User benefits refer to the privileges users enjoy when using the service, including request rates, discounts, etc. Currently, user benefits are automatically upgraded based on cumulative recharge amount. For details, please check the benefits page. If you have higher requirements for rates or need enterprise-level services, please contact our customer service.
Model Billing
Model billing is divided into the following types:
Per-Request Billing
Per-request billing means charging once for each request.
- Billing Formula: Model Input Price × Channel Discount × User Discount
Token-Based Billing
Token-based billing charges based on the number of tokens used in the request.
- Billing Formula: Cost = (Input tokens / 1000 × Input Price) + (Output tokens / 1000 × Output Price) Final Charge = Cost × Channel Discount × User Discount
Per-Second Billing
Per-second billing generally appears in video generation models.
- Billing Formula: Cost = Input Price × Seconds Final Charge = Cost × Channel Discount × User Discount
Per-Megapixel Billing
Per-megapixel billing generally appears in image generation models.
- Billing Formula: Cost = (Width × Height / 1000000) × Input Price Final Charge = Cost × Channel Discount × User Discount
Per-Image Billing
Per-image billing generally appears in image generation models, charging based on the number of generated images.
- Billing Formula: Cost = Number of Images × Input Price Final Charge = Cost × Channel Discount × User Discount
Other Billing (Cache/Audio/Inference Additional Charges)
If the model supports cache/audio/inference, etc., these requests may also be charged. The system's billing method is: charging based on multiples of input/output prices.
- Core Calculation Formula: Input/Output tokens = Input/Output tokens + ((Cache/Audio/Inference tokens) × (Multiple - 1))
- Subsequent Billing: Based on the adjusted actual input/output tokens, calculate according to the corresponding model's billing method (such as token-based billing).
Billing Items and Corresponding Relationships
| Billing Item | Corresponding Relationship | Description |
|---|---|---|
| Cache Tokens | Input tokens × Rate | This value appears when automatic caching is triggered |
| Input Audio Tokens | Input tokens × Rate | If applicable, this value appears when inputting audio |
| Output Audio Tokens | Output tokens × Rate | If applicable, this value appears when outputting audio |
| Input Image Tokens | Input tokens × Rate | If applicable, this value appears when inputting images |
| Output Image Tokens | Output tokens × Rate | If applicable, this value appears when outputting images |
| Cache Write 5M Tokens | Input tokens × Rate | Generally exists in Claude models |
| Cache Write 1H Tokens | Input tokens × Rate | Generally exists in Claude models |
| Cache Read Tokens | Input tokens × Rate | If applicable, this value appears when cache is triggered |
| Inference Tokens | Output tokens × Rate | Billing for some Gemini models |
Field Multipliers
Some models adjust prices based on request field values, generally existing in video models.
For example: The Veo model can use priority pay go. By adding a header when making requests, you can get priority queuing, with each request price ×1.8.