LogoTensorFusion Docs
LogoTensorFusion Docs
HomepageDocumentation

Getting Started

OverviewKubernetes InstallVM/Server Install(K3S)Helm On-premises InstallHost/GuestVM InstallTensorFusion Architecture

Application Operations

Create WorkloadConfigure AutoScalingMigrate Existing WorkloadBest Practices

Customize AI Infra

Production-Grade DeploymentConfig QoS and BillingBring Your Own CloudManaging License

Maintenance & Optimization

Upgrade ComponentsSetup AlertsGPU Live MigrationPreload ModelOptimize GPU Efficiency

Troubleshooting

HandbookTracing/ProfilingQuery Metrics & Logs

Reference

Comparison

Compare with NVIDIA vGPUCompare with MIG/MPSCompare with Run.AICompare with HAMi

GPUResourceQuota

GPUResourceQuota is the Schema for the gpuresourcequotas API

GPUResourceQuota is the Schema for the gpuresourcequotas API

Resource Information

FieldValue
API Versiontensor-fusion.ai/v1
KindGPUResourceQuota
ScopeNamespaced

Spec

GPUResourceQuotaSpec defines the desired state of GPUResourceQuota

PropertyTypeDescription
singleobjectPer-workload limits (similar to LimitRanges)
totalobjectTotal namespace limits (similar to ResourceQuotas)

Status

GPUResourceQuotaStatus defines the observed state of GPUResourceQuota

PropertyTypeDescription
availablePercentobjectAvailable percentage for each resource type
conditionsarrayConditions represent the latest available observations of the quota's state
lastUpdateTimestring<date-time>LastUpdateTime is the last time the status was updated
usedobjectCurrent resource usage in the namespace

Table of Contents

Resource Information
Spec
Status