LogoTensorFusion Docs
LogoTensorFusion Docs
HomepageDocumentation

Getting Started

OverviewKubernetes InstallVM/Server Install(K3S)Helm On-premises InstallHost/GuestVM InstallTensorFusion Architecture

Application Operations

Create WorkloadConfigure AutoScalingMigrate Existing WorkloadBest Practices

Customize AI Infra

Production-Grade DeploymentConfig QoS and BillingBring Your Own CloudManaging License

Maintenance & Optimization

Upgrade ComponentsSetup AlertsGPU Live MigrationPreload ModelOptimize GPU Efficiency

Troubleshooting

HandbookTracing/ProfilingQuery Metrics & Logs

Reference

Comparison

Compare with NVIDIA vGPUCompare with MIG/MPSCompare with Run.AICompare with HAMi

GPUNode

GPUNode is the Schema for the gpunodes API.

GPUNode is the Schema for the gpunodes API.

Resource Information

FieldValue
API Versiontensor-fusion.ai/v1
KindGPUNode
ScopeCluster

Spec

GPUNodeSpec defines the desired state of GPUNode.

PropertyTypeDescription
cloudVendorParamstring
costPerHourstring
gpuCardIndicesarrayif not all GPU cards should be used, specify the GPU card indices, default to empty, onboard all GPU cards to the pool
manageModestring(default: AutoSelect) Allowed values: Manual, AutoSelect, Provisioned

Status

GPUNodeStatus defines the observed state of GPUNode.

PropertyTypeDescription
allocationInfoarray
availableTFlopsany`pattern: ^(+
availableVRAMany`pattern: ^(+
conditionsarray
hypervisorStatusobject
loadedModelsarray
managedGPUDeviceIDsarray
managedGPUs *integer<int32>
nodeInfoobject
observedGenerationinteger<int64>
phase *string(default: Pending) Allowed values: Pending, Provisioning, Migrating, Running, Succeeded, Failed, Unknown, Destroying
totalGPUs *integer<int32>
totalTFlops *any`pattern: ^(+
totalVRAM *any`pattern: ^(+
virtualAvailableTFlopsany`pattern: ^(+
virtualAvailableVRAMany`pattern: ^(+
virtualTFlopsany`pattern: ^(+
virtualVRAMany`pattern: ^(+

Table of Contents

Resource Information
Spec
Status