GPUPool
API documentation for GPUPool
Resource Information
| Field | Value |
|---|---|
| API Version | tensor-fusion.ai/v1 |
| Kind | GPUPool |
| Scope | Cluster |
Spec
GPUPoolSpec defines the desired state of GPUPool.
| Property | Type | Description |
|---|---|---|
| capacityConfig | object | |
| componentConfig | object | Customize system components for seamless onboarding. |
| nodeManagerConfig | object | |
| qosConfig | object | Define different QoS and their price. |
| schedulingConfigTemplate | string |
Status
GPUPoolStatus defines the observed state of GPUPool.
| Property | Type | Description |
|---|---|---|
| availableTFlops * | any | `pattern: ^(+ |
| availableVRAM * | any | `pattern: ^(+ |
| budgetExceeded | string | If the budget is exceeded, the set value in comma separated string to indicate which period caused the exceeding. If this field is not empty, scheduler will not schedule new AI workloads and stop scaling-up check. |
| cluster | string | |
| componentStatus * | object | when updating any component version or config, pool controller will perform rolling update. the status will be updated periodically, default to 5s, progress will be 0-100. when the progress is 100, the component version or config is fully updated. |
| conditions | array | |
| lastCompactionTime | string<date-time> | |
| notReadyNodes * | integer<int32> | |
| phase * | string | (default: Pending) Allowed values: Pending, Running, Updating, Destroying, Unknown |
| potentialSavingsPerMonth | string | |
| provisioningPhase | string | (default: None) Allowed values: None, Initializing, Provisioning, Completed |
| readyNodes * | integer<int32> | |
| runningAppsCnt | integer<int32> | |
| savedCostsPerMonth | string | |
| totalGPUs | integer<int32> | |
| totalNodes | integer<int32> | |
| totalTFlops * | any | `pattern: ^(+ |
| totalVRAM * | any | `pattern: ^(+ |
| virtualAvailableTFlops | any | `pattern: ^(+ |
| virtualAvailableVRAM | any | `pattern: ^(+ |
| virtualTFlops * | any | `pattern: ^(+ |
| virtualVRAM * | any | `pattern: ^(+ |
TensorFusion 文档