Flexible pay-as-you-go credit system for LLM token consumption

Flexible Payments

Purchase credits in advance and use them across any service or model. Credits never expire.

Efficient Storage*

Pay only for the storage you use. Indexing improves search speed and accuracy.

*Storage is taken into account in relation to uploaded files and conversations. Users have a free tier of 25GB of storage. Additional charges apply for usage beyond this limit.

Scalable Compute*

Select from a range of compute instances with varying CPU and memory configurations and choose from temporary or permanent instances to suit your computational needs.

*Charges only apply for compute instances that are deployed permanently or if usage exceeds the free tier limits.

Token Consumption and Cost
  • Each time you interact with a language model or agent in Vitral, the system calculates the tokens consumed in real-time and deducts the corresponding cost from your credits. The cost per token depends on the specific model and provider, as shown in the following tables.
  • Users can purchase credits from their Vitral account dashboard. All credits are applicable to any service or model in Vitral and do not expire.
Model
Input Token
Output Token
Cache Token
Llama 3.3
$0.00071
$0.00071
N/A
Llama 4 Scout
$0.00020
$0.00078
N/A
Llama 4 Maverick
$0.00141
$0.00035
N/A
Model
Input Token
Output Token
Cache Token
Gemini 2.5 Lite
$0.00010
$0.00040
$0.00003
Gemini 2.5 Flash
$0.00030
$0.00250
$0.00007
Gemini 2.5 Pro
$0.00031
$0.01000
N/A
Model
Input Token
Output Token
Cache Token
DeepSeek R1
$0.00055
$0.00219
$0.00014
DeepSeek V3
$0.00014
$0.00028
$0.00001
Model
Input Token
Output Token
Cache Token
Mistral Small
$0.00010
$0.00030
N/A
Mistral Large 2 (2407)
$0.00200
$0.00600
N/A
Pixtral Large
$0.00200
$0.00600
N/A
Model
Input Token
Output Token
Cache Token
Qwen 3
$0.00070
$0.00280
N/A
Model
Input Token
Output Token
Cache Token
Grok 3
$0.00300
$0.01500
$0.00075
Grok 3 Mini
$0.00030
$0.00050
$0.00075