Skip to content

Currency and Billing Information

Currency and Billing Information

This site uniformly uses US dollars for billing, with a USD to CNY exchange rate of 1:7.3.

Pricing

For specific pricing, please check the pricing page. Discounts vary for different channel sources, please check user benefits for details. The final price is: Model Price × Channel Discount × User Discount.

User Benefits

User benefits refer to the privileges users enjoy when using the service, including request rates, discounts, etc. Currently, user benefits are automatically upgraded based on cumulative recharge amount. For details, please check the benefits page. If you have higher requirements for rates or need enterprise-level services, please contact our customer service.

Model Billing

Model billing is divided into the following types:

Per-Request Billing

Per-request billing means charging once for each request.

  • Billing Formula: Model Input Price × Channel Discount × User Discount

Token-Based Billing

Token-based billing charges based on the number of tokens used in the request.

  • Billing Formula: Cost = (Input tokens / 1000 × Input Price) + (Output tokens / 1000 × Output Price) Final Charge = Cost × Channel Discount × User Discount

Per-Second Billing

Per-second billing generally appears in video generation models.

  • Billing Formula: Cost = Input Price × Seconds Final Charge = Cost × Channel Discount × User Discount

Per-Megapixel Billing

Per-megapixel billing generally appears in image generation models.

  • Billing Formula: Cost = (Width × Height / 1000000) × Input Price Final Charge = Cost × Channel Discount × User Discount

Per-Image Billing

Per-image billing generally appears in image generation models, charging based on the number of generated images.

  • Billing Formula: Cost = Number of Images × Input Price Final Charge = Cost × Channel Discount × User Discount

Other Billing (Cache/Audio/Inference Additional Charges)

If the model supports cache/audio/inference, etc., these requests may also be charged. The system's billing method is: charging based on multiples of input/output prices.

  • Core Calculation Formula: Input/Output tokens = Input/Output tokens + ((Cache/Audio/Inference tokens) × (Multiple - 1))
  • Subsequent Billing: Based on the adjusted actual input/output tokens, calculate according to the corresponding model's billing method (such as token-based billing).

Billing Items and Corresponding Relationships

Billing ItemCorresponding RelationshipDescription
Cache TokensInput tokens × RateThis value appears when automatic caching is triggered
Input Audio TokensInput tokens × RateIf applicable, this value appears when inputting audio
Output Audio TokensOutput tokens × RateIf applicable, this value appears when outputting audio
Input Image TokensInput tokens × RateIf applicable, this value appears when inputting images
Output Image TokensOutput tokens × RateIf applicable, this value appears when outputting images
Cache Write 5M TokensInput tokens × RateGenerally exists in Claude models
Cache Write 1H TokensInput tokens × RateGenerally exists in Claude models
Cache Read TokensInput tokens × RateIf applicable, this value appears when cache is triggered
Inference TokensOutput tokens × RateBilling for some Gemini models

Field Multipliers

Some models adjust prices based on request field values, generally existing in video models.

For example: The Veo model can use priority pay go. By adding a header when making requests, you can get priority queuing, with each request price ×1.8.

This documentation is licensed under CC BY-SA 4.0.