Services Billing:
An explanation of how parasail bills its customers.
You can find the pricing examples also on our pricing page: https://www.saas.parasail.io/price
Serverless Billing:
Pricing for the “serverless” Use Case is token-based (total amount of tokens), and the amount you owe will change depending on which Model(s) you choose to use.
Dedicated Billing:
Our dedicated instances are priced by GPU per hour. We offer various configurations of our hardware fleet to hit your indicated cost, performance, and latency targets. Our dedicated instances autoscale the number of GPUs as your workload fluctuates, but we offer scale-down policy configuration to meet your needs.
Batch Billing:
Pricing for the “batch” Use Case is token-based (total amount of tokens, discounted to reflect the fact that your queries will not be processed in real time), and the amount you owe will change depending on which Model(s) you choose to use.
Last updated