Bring the C++ ONNX importer on par with `onnx_importer.py` #3960

giacs-epic · 2025-01-15T09:34:06Z

This PR heavily refactors the C++ importer to make it a viable (and faster) alternative to the Python one for those who need to import ONNX models and want to avoid depending on the python ecosystem.

The C++ importer now outputs the same exact mlir as the Python one (tested on alt_e2eshark test suite). Achieving perfect output matches required to introduce an associative map iterable according to insertion order (to mimic Dict in Python).
The code tries to mirror 1-to-1 the Python counterpart whenever possible/convenient.
Adds support for embedding ONNX external data in the mlir. This functionality is not part of torch-mlir's onnx_importer.py but of IREE's import_onnx.
Efforts have been made to remove the direct dependency on LLVM support lib. There is however a transitive dependency on such lib through MLIRCAPIIR and TorchMLIRCAPI (MLIR libraries uniformly depend on LLVMSupport).

…Shape special handler

stellaraccident · 2025-01-15T18:59:19Z

projects/onnx_c_importer/OnnxImporter.h

@@ -18,19 +18,29 @@
 // this kind of style.

 #include "mlir-c/IR.h"
+#include "mlir/Support/LLVM.h"


Don't expose LLVM support lib internals as part of the public API.

Looks to be just for FailureOr. Best to find a simpler plain C++ way to do that.

I'd prefer to not introduce the LLVM support into the implementation either. The reason is based on experience: This code will likely end up getting used in a variety of places, some of which will be late binding. The LLVM support library is poorly factored and every time we've used it in "highly traveled" utility code like this, I end up fighting an endless list of ODR violations and other bugs. I don't think it pays for itself when used as part of a standalone library like this. Glancing through the implementation, it is using a few conveniences but most could just be inlined and not introduce the dep. The rest (i.e. ArrayRef, string stuff, etc) have easy alternatives in C++17 and 20. I'd consider using C++20 features for code like this to be a less bad way to go than pulling in the support library for some minor ergonomic stuff.

zjgarvey · 2025-02-20T01:03:50Z

@vinayakdsci I'm going to take a look at this tomorrow. If you also have a free cycle to review, I'd appreciate it.

zjgarvey · 2025-02-21T00:09:26Z

Thanks for all of this work on bringing up the C++ importer.

I took a first pass, and I'll need a little more time to review. I'm not sure how long it will take to get this merged, so I was wondering if you would be willing to factor out the python importer bug fixes to a different PR so we can expedite merging those separately (they seem rather important in their own right).

I'm going to take a closer look, then clone and test some of it out tomorrow.

Do you have a branch with some of the SHARK-TestSuite changes you are using to compare the mlir output against the python importer?

giacs-epic · 2025-02-21T14:02:10Z

Thanks for all of this work on bringing up the C++ importer.

I took a first pass, and I'll need a little more time to review. I'm not sure how long it will take to get this merged, so I was wondering if you would be willing to factor out the python importer bug fixes to a different PR so we can expedite merging those separately (they seem rather important in their own right).

I'm going to take a closer look, then clone and test some of it out tomorrow.

Do you have a branch with some of the SHARK-TestSuite changes you are using to compare the mlir output against the python importer?

Thanks for starting looking at this, I understand it will take a while given its size.
I opened this PR with the onnx_importer.py fixes: #4037
Here's the changes to SHARK-TestSuite I've been using to compare outputs: giacs-epic/SHARK-TestSuite#1

giacs-epic added 6 commits December 3, 2024 08:57

Add Constant value attr handler, add none constant, remove ConstantOf…

c9e38c6

…Shape special handler

Port up to ImportGeneralNode()

85a8c2f

Finish port

b3e1a15

Add checks before map insertions

b56bae2

Port command line arguments and loadOnnxModel()

e9b7e11

Update SanitizeNameAsIdentifier() routine

840b280

stellaraccident reviewed Jan 15, 2025

View reviewed changes

giacs-epic added 23 commits January 16, 2025 15:39

Remove most onnx calls in import-onnx-main

523cd45

Switch to c++20

84ed038

Switch to SimpleArgParser. Enable exceptions for ONNX.

db756b0

Fix usage reporting

16ee4ad

Remove usage of LLVM support lib

94fcf0e

Fix name sanitization and accessibility of module's func.

90d16a6

Add Dict to replace unordered_map usages

f72aef0

Remove debug print

732e96a

Fix a couple of out-of-scope memory accesses bugs

7eecd13

Fix ordering of sub-blocks. Fix TypeProto nullptr handling.

d3b13d0

Fix missing calls to Initialize()

fc7841e

Comment out debug check_model

288999f

Merge branch 'main' into update_c_onnx_importer

fd8a351

Fix several bugs arising from operator tests

e29aa97

Fix missing implementation in GraphInfo and minor bugs.

c52f4cd

Change comment of dense bool conversion

c8cd855

Fix shapes not being inferred before version conversion.

a126229

onnx_importer.py: fix dim_value None not correctly processed

154739c

Removed check_model() (not actually used in onnx_importer.py)

f319fe3

Add external data loading.

bdf64cd

Merge commit 'c9694c6' into update_c_onnx_importer

9f7257e

Clean up

0655791

Fix InferShapes uncaught exceptions

6326092

giacs-epic changed the title ~~Update C onnx importer~~ Bring the C++ ONNX importer on par with onnx_importer.py Feb 17, 2025

giacs-epic marked this pull request as ready for review February 17, 2025 13:36

Merge branch 'main' into update_c_onnx_importer

944ab77

zjgarvey self-requested a review February 20, 2025 01:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bring the C++ ONNX importer on par with `onnx_importer.py` #3960

Bring the C++ ONNX importer on par with `onnx_importer.py` #3960

giacs-epic commented Jan 15, 2025 •

edited

Loading

stellaraccident Jan 15, 2025

zjgarvey commented Feb 20, 2025

zjgarvey commented Feb 21, 2025

giacs-epic commented Feb 21, 2025

Bring the C++ ONNX importer on par with onnx_importer.py #3960

Are you sure you want to change the base?

Bring the C++ ONNX importer on par with onnx_importer.py #3960

Conversation

giacs-epic commented Jan 15, 2025 • edited Loading

stellaraccident Jan 15, 2025

Choose a reason for hiding this comment

zjgarvey commented Feb 20, 2025

zjgarvey commented Feb 21, 2025

giacs-epic commented Feb 21, 2025

Bring the C++ ONNX importer on par with `onnx_importer.py` #3960

Bring the C++ ONNX importer on par with `onnx_importer.py` #3960

giacs-epic commented Jan 15, 2025 •

edited

Loading