Unexpected Output Shape After Onnx Conversion #171

amarwingxpand · 2025-02-13T21:29:11Z

I'm converting the YOLOv9 model to ONNX for use with NVIDIA DeepStream. Inside FastModelLoader, the _create_onnx_model function appears to handle the PyTorch-to-ONNX conversion. However, when I run this function, it outputs a list of 17 tensors with shapes like:

Output[0] shape: (1, 80, 80, 80)
Output[1] shape: (1, 16, 4, 80, 80)
Output[2] shape: (1, 4, 80, 80)
Output[3] shape: (1, 80, 40, 40)
Output[4] shape: (1, 16, 4, 40, 40)
Output[5] shape: (1, 4, 40, 40)
Output[6] shape: (1, 80, 20, 20)
Output[7] shape: (1, 16, 4, 20, 20)
Output[8] shape: (1, 4, 20, 20)
Output[9] shape: (1, 80, 80, 80)
Output[10] shape: (1, 16, 4, 80, 80)
Output[11] shape: (1, 4, 80, 80)
Output[12] shape: (1, 80, 40, 40)
Output[13] shape: (1, 16, 4, 40, 40)
Output[14] shape: (1, 4, 40, 40)
Output[15] shape: (1, 80, 20, 20)
Output[16] shape: (1, 16, 4, 20, 20)
Output[17] shape: (1, 4, 20, 20)

This is unexpected, as DeepStream typically expects a single output tensor or structured outputs containing bounding boxes (batch_size, num_boxes, 4), class confidence scores (batch_size, num_boxes, num_classes), and objectness scores (batch_size, num_boxes, 1).

How should I interpret these tensors and correctly format them for inference in DeepStream?

The text was updated successfully, but these errors were encountered:

henrytsui000 · 2025-02-22T03:49:04Z

Hi,

You may check out the usage of PostProcess.

Currently, the model outputs predictions at three different levels (20, 40, 80). For each resolution, it produces three types of outputs:
• 80 → class predictions
• 16×4 → grid information
• 4 → bounding box coordinates

Additionally, these outputs come from two branches: auxiliary and main. This results in a total of:
3 levels × 3 output types × 2 branches = 18 outputs.

Typically, we use PostProcess to select the main branch’s outputs and apply NMS to the predictions. The shape (batch_size, num_boxes, 4) is not robust because the number of boxes varies for each image. Customizing PostProcess with padding may help address this issue.

Best regards,
Henry Tsui

ramonhollands · 2025-02-22T20:02:08Z

Hi @henrytsui000,

Would be nice if we could add some additional model layers on export to do the post processing inside the model. What do you think about this?

Best regards,
Ramon

ramonhollands · 2025-02-23T08:54:28Z

Did some experiment and it seems to work fine for coreml support. It saves tons of Swift code ;-)

amarwingxpand added the bug Something isn't working label Feb 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unexpected Output Shape After Onnx Conversion #171

Unexpected Output Shape After Onnx Conversion #171

amarwingxpand commented Feb 13, 2025

henrytsui000 commented Feb 22, 2025

ramonhollands commented Feb 22, 2025

ramonhollands commented Feb 23, 2025

Unexpected Output Shape After Onnx Conversion #171

Unexpected Output Shape After Onnx Conversion #171

Comments

amarwingxpand commented Feb 13, 2025

henrytsui000 commented Feb 22, 2025

ramonhollands commented Feb 22, 2025

ramonhollands commented Feb 23, 2025