Quota Management
GPU Quota Limits
By default, all users have a GPU quota limit of 4 GPUs across both Batch and Dedicated services. This quota applies collectively, regardless of GPU type. The Quota Increase Form is Here.
Examples:
If your current GPU usage is 2 H100 GPUs and you attempt to submit a Batch job requiring 8 H100 GPUs (e.g., DeepSeek V3), the submission will be rejected due to exceeding your 4 GPU quota.
If your autoscaling is configured for a range of 1–6 GPUs, and your model currently utilizes 4 GPUs, the quota limit will prevent the model from scaling up to a fifth GPU.
Requesting Increased Quotas
If you need additional GPU capacity, you can request an increased quota using our Quota Request Form. Quota increases up to 8 GPUs may be requested without additional justification. Requests exceeding 8 GPUs will require a detailed explanation of your usage needs.
Last updated