GPUPool

Resource Information

Field	Value
API Version	`tensor-fusion.ai/v1`
Kind	`GPUPool`
Scope	`Cluster`

Spec

GPUPoolSpec defines the desired state of GPUPool.

Property	Type	Description
capacityConfig	`object`
componentConfig	`object`	Customize system components for seamless onboarding.
nodeManagerConfig	`object`
qosConfig	`object`	Define different QoS and their price.
schedulingConfigTemplate	`string`

Status

GPUPoolStatus defines the observed state of GPUPool.

Property	Type	Description
availableTFlops *	`any`	`pattern: ^(+
availableVRAM *	`any`	`pattern: ^(+
budgetExceeded	`string`	If the budget is exceeded, the set value in comma separated string to indicate which period caused the exceeding. If this field is not empty, scheduler will not schedule new AI workloads and stop scaling-up check.
cluster	`string`
componentStatus *	`object`	when updating any component version or config, pool controller will perform rolling update. the status will be updated periodically, default to 5s, progress will be 0-100. when the progress is 100, the component version or config is fully updated.
conditions	`array`
lastCompactionTime	`string<date-time>`
notReadyNodes *	`integer<int32>`
phase *	`string`	(default: `Pending`) Allowed values: `Pending`, `Running`, `Updating`, `Destroying`, `Unknown`
potentialSavingsPerMonth	`string`
provisioningPhase	`string`	(default: `None`) Allowed values: `None`, `Initializing`, `Provisioning`, `Completed`
readyNodes *	`integer<int32>`
runningAppsCnt	`integer<int32>`
savedCostsPerMonth	`string`
totalGPUs	`integer<int32>`
totalNodes	`integer<int32>`
totalTFlops *	`any`	`pattern: ^(+
totalVRAM *	`any`	`pattern: ^(+
virtualAvailableTFlops	`any`	`pattern: ^(+
virtualAvailableVRAM	`any`	`pattern: ^(+
virtualTFlops *	`any`	`pattern: ^(+
virtualVRAM *	`any`	`pattern: ^(+

GPUPool

API documentation for GPUPool

Resource Information

Field	Value
API Version	`tensor-fusion.ai/v1`
Kind	`GPUPool`
Scope	`Cluster`

Spec

GPUPoolSpec defines the desired state of GPUPool.

Property	Type	Description
capacityConfig	`object`
componentConfig	`object`	Customize system components for seamless onboarding.
nodeManagerConfig	`object`
qosConfig	`object`	Define different QoS and their price.
schedulingConfigTemplate	`string`

Status

GPUPoolStatus defines the observed state of GPUPool.

Property	Type	Description
availableTFlops *	`any`	`pattern: ^(+
availableVRAM *	`any`	`pattern: ^(+
budgetExceeded	`string`	If the budget is exceeded, the set value in comma separated string to indicate which period caused the exceeding. If this field is not empty, scheduler will not schedule new AI workloads and stop scaling-up check.
cluster	`string`
componentStatus *	`object`	when updating any component version or config, pool controller will perform rolling update. the status will be updated periodically, default to 5s, progress will be 0-100. when the progress is 100, the component version or config is fully updated.
conditions	`array`
lastCompactionTime	`string<date-time>`
notReadyNodes *	`integer<int32>`
phase *	`string`	(default: `Pending`) Allowed values: `Pending`, `Running`, `Updating`, `Destroying`, `Unknown`
potentialSavingsPerMonth	`string`
provisioningPhase	`string`	(default: `None`) Allowed values: `None`, `Initializing`, `Provisioning`, `Completed`
readyNodes *	`integer<int32>`
runningAppsCnt	`integer<int32>`
savedCostsPerMonth	`string`
totalGPUs	`integer<int32>`
totalNodes	`integer<int32>`
totalTFlops *	`any`	`pattern: ^(+
totalVRAM *	`any`	`pattern: ^(+
virtualAvailableTFlops	`any`	`pattern: ^(+
virtualAvailableVRAM	`any`	`pattern: ^(+
virtualTFlops *	`any`	`pattern: ^(+
virtualVRAM *	`any`	`pattern: ^(+

Resource Information

Spec

Status

目录

GPUPool

Resource Information

Spec

Status

目录

GPUPool

Resource Information

Spec

capacityConfig

componentConfig

nodeManagerConfig

qosConfig

Status

componentStatus

conditions (array items)

目录

GPUPool

Resource Information

Spec

capacityConfig

componentConfig

nodeManagerConfig

qosConfig

Status

componentStatus

conditions (array items)

目录