Quota Management

GPU Quota Limits

By default, all users have a GPU quota limit of 4 GPUs across both Batch and Dedicated services. This quota applies collectively, regardless of GPU type. The Quota Increase Form is Here.

Examples:

  • If your current GPU usage is 2 H100 GPUs and you attempt to submit a Batch job requiring 8 H100 GPUs (e.g., DeepSeek V3), the submission will be rejected due to exceeding your 4 GPU quota.

  • If your autoscaling is configured for a range of 1–6 GPUs, and your model currently utilizes 4 GPUs, the quota limit will prevent the model from scaling up to a fifth GPU.

Requesting Increased Quotas

If you need additional GPU capacity, you can request an increased quota using our Quota Request Form. Quota increases up to 8 GPUs may be requested without additional justification. Requests exceeding 8 GPUs will require a detailed explanation of your usage needs.

Last updated