# Model Preparation & Conversion

<!--

:::info[Requrement: [ITEP-73289](https://jira.devtools.intel.com/browse/ITEP-73289)]
Section: In a common location and format, organize, explain, and provide references to:
- Model porting/conversion. Include: Tao, Nvidia native format(s), ONNX, Pytorch, HuggingFace
:::

:::warning[ToDo]
- Btter describe model preparation (Geti, Roboflow, ...) and why this can be an option for some users to convert (e.g. having an annotated dataset)
:::

-->

For optimizing and deploying AI models efficiently on Intel hardware — including CPUs, GPUs, VPUs, and FPGAs — which are commonly used in edge computing environments, Open Edge Platform relies heavily on [OpenVINO™ Toolkit](https://docs.openvino.ai/).

OpenVINO (short for Open Visual Inference and Neural Network Optimization), an open-source toolkit from Intel that:
- Optimizes trained AI models (from frameworks like PyTorch, TensorFlow, ONNX, etc.)
- Deploys them efficiently on Intel edge devices
- Maximizes inference speed and resource utilization

Note that OpenVINO is NOT for training, but for running AI models quickly and efficiently once trained.


OpenVINO supports the following model formats: PyTorch, TensorFlow, TensorFlow Lite, ONNX, PaddlePaddle
Although PyTorch, TensorFlow, ONNX, PaddlePaddle are great AI frameworks, [OpenVINO™ Toolkit](https://docs.openvino.ai/) can provide additional benefits in case of inference performance on Intel Hardware as it is heavily optimized for this task.


<!--- ![Classification](media/openvino-overview-diagram.jpg) -->

## OpenVINO IR Format

OpenVINO IR is the proprietary model format of OpenVINO, benefiting from the full extent of its features. It is obtained by converting a model from one of the supported formats (PyTorch, TensorFlow, TensorFlow Lite, ONNX, PaddlePaddle) using the model conversion API or OpenVINO Converter. The process translates common deep learning operations of the original network to their counterpart representations in OpenVINO and tunes them with the associated weights and biases. The resulting OpenVINO IR format contains two files:
- .xml - Describes the model topology.
- .bin - Contains the weights and binary data.

## Conversion Benefits

- Saving to IR to improve first inference latency
- Saving to IR in FP16 to save space
- Saving to IR to avoid large dependencies in inference code

For more information about OpenVINO IR, refer to
[OpenVINO IR format article](https://docs.openvino.ai/2025/openvino-workflow/model-preparation/convert-model-to-ir.html).


## Model preparation

Magna culpa minim eiusmod mollit nisi excepteur laboris veniam labore sit id Lorem eu eu. Sunt nisi tempor amet occaecat. Officia ex tempor nisi ut irure elit cupidatat ad aliquip. Elit enim non id eiusmod cupidatat.


### Geti

The Geti™ platform enables enterprise teams to rapidly build computer vision AI models. Through an intuitive graphical interface, users add image or video data, make annotations, train, retrain, export, and optimize AI models for deployment. Equipped with state-of-the-art technology such as active learning, task chaining, and smart annotations, the Geti™ platform reduces labor-intensive tasks, enables collaborative model development, and speeds up model creation.

![model lifecycle](./images/model-lifecycle-infinity-dark.png)

For more information about Intel Geti, please refer to [Geti Documentation](https://docs.geti.intel.com/).

### Roboflow, etc.

Eu culpa do dolor ea. Deserunt ex deserunt do ex tempor. Irure laborum qui eiusmod veniam sunt qui adipisicing. Consequat aliqua fugiat velit exercitation id non irure. Ea excepteur labore nisi deserunt ullamco sunt do dolor mollit laboris ad pariatur.


## Model conversion

### Conversion from Open Formats

AI models are often portable: PyTorch, TensorFlow, ONNX and other frameworks work with OpenVINO out of the box. Below we briefly describe porting options and provide links to helpful resources.

#### Converting PyTorch models
[PyTorch](https://pytorch.org/) is an open-source machine learning framework developed primarily by Meta AI. It’s widely used for building and training deep learning models, including neural networks for tasks like computer vision, natural language processing, and generative AI.

PyTorch models can not be directly passed to `core.read_model`. `ov.Model` for model objects from this framework can be obtained using `ov.convert_model` API. Here is the simplest example of PyTorch model conversion using a model from torchvision:

```python
import torchvision
import torch
import openvino as ov

model = torchvision.models.resnet50(weights='DEFAULT')
ov_model = ov.convert_model(model)
```

For more information, please refer to [Converting a PyTorch Model article](https://docs.openvino.ai/2025/openvino-workflow/model-preparation/convert-model-pytorch.html) in OpenVINO documentation and in in [pytorch-to-openvino notebook](https://github.com/openvinotoolkit/openvino_notebooks/blob/latest/notebooks/pytorch-to-openvino/pytorch-to-openvino.ipynb) in GitHub.


<!--- TensorFlow -->

#### Converting TensorFlow models
[TensorFlow](https://www.tensorflow.org/) is an open-source machine learning framework developed by Google for building, training, and deploying AI and deep learning models. It’s one of the most popular tools in the AI world — used for everything from simple linear regression to massive neural networks powering image recognition, speech processing, and large language models.

TensorFlow models saved in frozen graph format can also be passed to `read_model`. As an example, please download the [classification.pb](https://storage.openvinotoolkit.org/repositories/openvino_notebooks/models/002-example-models/classification.pb) model from our model repository to the *model* folder

```python
import openvino as ov

core = ov.Core()
tf_model_path = "model/classification.pb"

model_tf = core.read_model(model=tf_model_path)
compiled_model_tf = core.compile_model(model=model_tf, device_name=device.value)
ov.save_model(model_tf, output_model="model/exported_tf_model.xml")
```

For more information, including examples in C++, C, and CLI, please refer to [Converting a TensorFlow Model article](https://docs.openvino.ai/2025/openvino-workflow/model-preparation/convert-model-tensorflow.html) in OpenVINO documentation.


<!--- ONNX -->

#### Converting ONNX models

[ONNX (Open Neural Network Exchange)](https://onnx.ai/) is an open format built to represent machine learning models. ONNX defines a common set of operators - the building blocks of machine learning and deep learning models - and a common file format to enable AI developers to use models with a variety of frameworks, tools, runtimes, and compilers. OpenVINO supports reading models in ONNX format directly. That means they can be used with OpenVINO Runtime without any prior conversion.

To convert an ONNX model, run model conversion with the path to the input model .onnx file:

```python
import openvino as ov
ov.convert_model('your_model_file.onnx')
```

For more information, including CLI approach, please refer to [Converting an ONNX Model article](https://docs.openvino.ai/2025/openvino-workflow/model-preparation/convert-model-onnx.html) in OpenVINO documentation.


<!--- PaddlePaddle -->

#### Converting PaddlePaddle models
[PaddlePaddle (PArallel Distributed Deep LEarning)](https://www.paddlepaddle.org.cn/en) is a simple, efficient and extensible deep learning framework, open-sourced to professional communities since 2016. It is an industrial platform with advanced technologies and rich features that cover core deep learning frameworks, basic model libraries, end-to-end development kits, tools & components as well as service platforms.

To convert the model, use `openvino.convert_model` and specify the path to the input .pdmodel model file:

```python
import openvino as ov
ov.convert_model('your_model_file.pdmodel')
```
For more information, including CLI options,  please refer to [Converting a PaddlePaddle Model article](https://docs.openvino.ai/2025/openvino-workflow/model-preparation/convert-model-paddle.html) in OpenVINO documentation and in [paddle-to-openvino notebook](https://github.com/openvinotoolkit/openvino_notebooks/blob/latest/notebooks/paddle-to-openvino/paddle-to-openvino-classification.ipynb) in GitHub.


### Conversion from TAO

NVIDIA TAO Toolkit (short for Train, Adapt, Optimize) is a low-code AI model development framework that helps to build, fine-tune, and deploy deep learning models, without needing to write complex training code from scratch.
It’s part of NVIDIA’s DeepStream and TensorRT ecosystem and is designed mainly for computer vision and speech AI tasks.

Common model formats used by TAO at various stages of model development:

| Stage      | Format    | Description                                        |
| ---------- | --------- | -------------------------------------------------- |
| Training   | `.tlt`    | TAO internal model checkpoint                      |
| Export     | `.etlt`   | Encrypted TensorRT-ready model                     |
| Deployment | `.engine` | TensorRT optimized inference engine                |
| Optional   | `.onnx`   | Open format for interoperability (optional export) |

There is no option to directly convert .etlt → .onnx, because .etlt (Encrypted TAO model) is a proprietary, encrypted format meant for TensorRT deployment only.

#### Option 1: Export ONNX directly from TAO Toolkit

When using the tao export command, include the --onnx flag:

```bash
tao export <task> --onnx_file output_model.onnx
```

:::info[Note]
This is supported for most networks, though some — especially NVIDIA’s proprietary backbones — may not allow ONNX export.
:::

#### Option 2: Use TensorRT Engine Conversion (Not Recommended for ONNX)

.etlt → TensorRT engine (.engine) is possible using tao-converter, but not to ONNX:

```bash
tao-converter model.etlt -k <encryption_key> -o <output_layers> -d <input_dims> -t fp16 -e model.engine
```

Once you have an .engine, it’s optimized binary for inference, not convertible back to ONNX.


<!--
The default inference engine used by Open Edge Platform components is [OpenVINO](https://docs.openvino.ai/).
Therefore, you need to consider which model formats you can use in your pipeline. For
production use, it is recommended apply the OpenVINO IR model format. You can obtain a
selection of pre-converted models from [Hugging Face](https://huggingface.co/OpenVINO), or
convert your own model from formats such as PyTorch, TensorFlow, TensorFlowLite, ONNX,
PaddlePaddle, and JAX/FLax. OpenVINO gives you a Model Converter for that purpose.

To learn more about supported model formats and conversion options, see the OpenVINO
[Conventional Model Preparation](https://docs.openvino.ai/2025/openvino-workflow/model-preparation.html) and
[Generative Model Preparation](https://docs.openvino.ai/2025/openvino-workflow-generative/genai-model-preparation.html) Guides.
-->