用于设备端训练的构建

先决条件

Python 3.x
CMake

训练阶段的构建说明

克隆仓库

 git clone --recursive https://github.com/Microsoft/onnxruntime.git
 cd onnxruntime

构建用于 设备端训练 的 ONNX Runtime

a. 对于 Windows
```
 .\build.bat --config RelWithDebInfo --cmake_generator "Visual Studio 17 2022" --build_shared_lib --parallel --enable_training_apis
```
b. 对于 Linux
```
 ./build.sh --config RelWithDebInfo --build_shared_lib --parallel --enable_training_apis
```
c. 对于 Android

请参阅 Android 构建说明并添加 --enable_training_apis 构建标志。

d. 对于 macOS

请参阅 macOS 推理构建说明并添加 --enable_training_apis 构建标志。

e. 对于 iOS

请参阅 iOS 构建说明并添加 --enable_training_apis 构建标志。

f. 对于 Web

请参阅 Web 构建说明。

注意

要构建 C# 绑定，请在上述构建命令中添加 --build_nuget 标志。

要构建 Python wheel

在上述构建命令中添加 --build_wheel 标志。

使用 python -m pip install build/Linux/RelWithDebInfo/dist/*.whl 安装 wheel

config 标志可以是 Debug、RelWithDebInfo、Release、MinSizeRel 之一。请使用适合您用例的配置。

--enable_training_apis 标志可以与 --minimal_build 标志结合使用。

生成训练工件的离线阶段只能通过 Python 进行（使用 --build_wheel 标志）。

上述构建命令仅针对 CPU 执行提供程序进行构建。要针对 CUDA 执行提供程序进行构建，请添加以下标志：

--use_cuda

--cuda_home {您的 CUDA 主目录路径，例如 /usr/local/cuda/}

--cudnn_home {您的 cuDNN 主目录路径，例如 /usr/local/cuda/}

--cuda_version={版本号，例如 11.8}

用于大型模型训练的构建

GPU / CUDA

先决条件

默认的 NVIDIA GPU 构建需要系统上安装 CUDA 运行时库

CUDA
cuDNN

构建说明

检出此代码仓库：

 git clone https://github.com/microsoft/onnxruntime
 cd onnxruntime

设置环境变量：请根据您的构建机器上的位置调整路径

 export CUDA_HOME=<location for CUDA libs> # e.g. /usr/local/cuda
 export CUDNN_HOME=<location for cuDNN libs> # e.g. /usr/local/cuda
 export CUDACXX=<location for NVCC> #e.g. /usr/local/cuda/bin/nvcc

创建 ONNX Runtime Python wheel

./build.sh --config=RelWithDebInfo --enable_training --build_wheel --use_cuda --cuda_home {location of cuda libs eg. /usr/local/cuda/} --cudnn_home {location of cudnn libs eg./usr/local/cuda/} --cuda_version={version for eg. 11.8}

为 ONNX Runtime 训练安装位于 ./build/Linux/RelWithDebInfo/dist 中的 .whl 文件。
```
 python -m pip install build/Linux/RelWithDebInfo/dist/*.whl
```

就是这样！构建完成后，您就可以在您的项目中开始使用 ONNX Runtime 库和可执行文件了。请注意，这些步骤是通用的，可能需要根据您的特定环境和要求进行调整。有关更多信息，您可以在 ONNX Runtime GitHub 社区寻求帮助。

GPU / ROCm

先决条件

默认的 AMD GPU 构建需要系统上安装 ROCm 软件工具包

ROCm 6.0.0

构建说明

检出此代码仓库：

 git clone https://github.com/microsoft/onnxruntime
 cd onnxruntime

创建 ONNX Runtime Python wheel

./build.sh --config Release --enable_training --build_wheel --parallel --skip_tests --use_rocm --rocm_home /opt/rocm

为 ONNX Runtime 训练安装位于 ./build/Linux/RelWithDebInfo/dist 中的 .whl 文件。
```
 python -m pip install build/Linux/RelWithDebInfo/dist/*.whl
```

DNNL 和 MKLML

构建说明

Linux

./build.sh --enable_training --use_dnnl

Windows

.\build.bat --enable_training --use_dnnl

添加 --build_wheel 以构建 ONNX Runtime wheel。

这将在 build/Linux/RelWithDebInfo/dist 中为 ONNX Runtime 训练生成一个 .whl 文件。