An Azure service that is used to deploy cloud workloads to run on internet of things (IoT) edge devices via standard containers.
If you didn't covert the models to ONNX then the respective inference engines will need to be included in your IOT-Edge module to run the inference.
ONNX provides the common format to represent the NN models from different training frameworks and execute using the same runtime, i.e. onnxruntime, on various edge device platforms. The same base docker image can be used to run the ONNX models across x86, arm64 devices with different accelerators like GPUs and VPUs.