LogoTensorFusion 文档
LogoTensorFusion 文档
首页

快速开始

TensorFusion概览在Kubernetes安装在虚拟机/服务器安装(K3S)Helm本地安装在宿主机/虚拟机安装TensorFusion架构

应用操作

创建AI应用配置自动扩缩容迁移现有应用最佳实践

自定义AI基础设施

生产级部署指南QoS级别与计价云厂商集成(BYOC)管理许可证

维护与优化

组件更新配置告警GPU热迁移预加载模型优化GPU效率

故障排除

问题处理手册链路追踪/性能分析查询指标和日志

参考

Helm安装Values配置
TensorFusionClusterGPUPoolGPUNodeGPUGPUNodeClassSchedulingConfigTemplateGPUResourceQuota
Kubernetes 事件监控指标定义性能测试命令行参考GPU/驱动/操作系统支持矩阵TensorFusion 安全白皮书

对比

与NVIDIA vGPU比较与MIG/MPS对比与趋动科技对比与 Run.AI 对比与HAMi的对比
系统管理员参考Kubernetes资源定义

GPU

GPU is the Schema for the gpus API.

GPU is the Schema for the gpus API.

Resource Information

FieldValue
API Versiontensor-fusion.ai/v1
KindGPU
ScopeCluster

Status

GPUStatus defines the observed state of GPU.

PropertyTypeDescription
available *object
capacity *object
gpuModel *string
message *string
nodeSelector *objectThe host match selector to schedule worker pods
phase *string(default: Pending) Allowed values: Pending, Provisioning, Running, Unknown, Destroying, Migrating
runningAppsarray
usedBystringGPU is used by tensor-fusion or nvidia-operator This is the key to be compatible with nvidia-device-plugin to avoid resource overlap Hypervisor will watch kubelet device plugin to report all GPUs already used by nvidia-device-plugin GPUs will be grouped by usedBy to be used by different Pods, tensor-fusion annotation or nvidia-device-plugin resource block Allowed values: tensor-fusion, nvidia-device-plugin
uuid *string

GPUNode

GPUNode is the Schema for the gpunodes API.

GPUNodeClass

GPUNodeClass is the Schema for the gpunodeclasses API.

目录

Resource Information
Status