Model file export #4305

omer-candan · 2024-07-08T23:31:17Z

Write models to .mps files

We can export linear solver models and get a string as a result. However, when working with very large models, the process (.NET in my case) fails when the export method is called, due to string type's size limitation. In our use case, we don't actually need the exported model in memory, we write the string to a file. A method that directly writes to a file helps us overcome the limitation.
The file write method has been added by @lperron, which simplifies this PR.

lperron · 2024-07-09T09:37:51Z

this is interesting. I cannot use it as is. We cannot use iostream. The complex path is to implement gzip reading/writing in the File interface, which is a pain.

omer-candan · 2024-07-09T11:12:55Z

this is interesting. I cannot use it as is. We cannot use iostream. The complex path is to implement gzip reading/writing in the File interface, which is a pain.

Thanks @lperron . My first attempt was to not use iostream. I had to duplicate ExportModelAsLpFormat, AppendComments and AppendConstraint functions to directly write to gz files. I guess that would also not be OK, because of too much duplication.
Did you mean something similar?

lperron · 2024-07-09T15:58:47Z

So, the best way to integrate it to add a field in the File class (ortools/base/file.h) that indicates if the file is a gzip one.
In that case, in the relevant methods, we need to duplicate the code to use gzip method instead of the C FILE* methods.

For instance, the MPS reader uses the FileLine class, which is turns call File::Read() raw API.

On the MPS writer, I would make sure that we flush the output string to file regularly, and that this flush is done by the File API, which internally would use the gzip API.

Am I clear ?

omer-candan · 2024-07-10T07:04:24Z

So, the best way to integrate it to add a field in the File class (ortools/base/file.h) that indicates if the file is a gzip one. In that case, in the relevant methods, we need to duplicate the code to use gzip method instead of the C FILE* methods.

For instance, the MPS reader uses the FileLine class, which is turns call File::Read() raw API.

On the MPS writer, I would make sure that we flush the output string to file regularly, and that this flush is done by the File API, which internally would use the gzip API.

Am I clear ?

Yes, thanks again! I'll try my best to work on those parts.

lperron · 2024-07-12T15:52:11Z

Please sync with main. You will have a conflict.
I have implemented MpModelProtoExporter::WriteModelToMpsFile() and hooked it with model_builder (python, java, .NET).
The implementation is not that robust to very large models, but better than before.

The only missing is gzip support in File (and hooking to linear_solver C# if you want it).

std::ostream allows the model string to be exported into a string stream. thus, export functions may be used when writing models to files.

This reverts commit 978b796.

This reverts commit 6f29fa3.

This reverts commit 8e392e2.

omer-candan · 2024-08-21T10:57:06Z

Please sync with main. You will have a conflict. I have implemented MpModelProtoExporter::WriteModelToMpsFile() and hooked it with model_builder (python, java, .NET). The implementation is not that robust to very large models, but better than before.

The only missing is gzip support in File (and hooking to linear_solver C# if you want it).

@lperron Thank you so much. I resolved the conflicts and called your function from linear_solver. Made it optional to export to gzip with a bool parameter.
I had a difficult time adding compression part. Your method can write a 13GB(1.2GB compressed) model file I used in my tests in a single call, but it was not possible with the new gzip write (gzwrite). The compressed output files were cut off and usually not decompressable. Multiple write operations to work with chunks solved this problem. Is this an OK approach? If yes, I'd like to get your opinion on chunk size and compression level (1=fastest now) that I chose.

omer-candan added 10 commits August 17, 2024 12:50

use stream when exporting models

978b796

std::ostream allows the model string to be exported into a string stream. thus, export functions may be used when writing models to files.

call model file writer functions from linear solver

6f29fa3

expose file export functions to csharp

8e392e2

Revert "use stream when exporting models"

5c885c9

This reverts commit 978b796.

Revert "call model file writer functions from linear solver"

b2debce

This reverts commit 6f29fa3.

Revert "expose file export functions to csharp"

a5009ca

This reverts commit 8e392e2.

export to mps file from linear solver

c907064

expose file export operation to csharp

9b8aa84

add write to gzip compressed file function

8f3e844

add option to compress while exporting models to files

5b325e2

omer-candan force-pushed the model-file-export branch from 247818b to 5b325e2 Compare August 21, 2024 10:45

omer-candan marked this pull request as ready for review August 21, 2024 11:10

rename variables snake case

3217355

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Model file export #4305

Model file export #4305

omer-candan commented Jul 8, 2024 •

edited

Loading

lperron commented Jul 9, 2024

omer-candan commented Jul 9, 2024

lperron commented Jul 9, 2024

omer-candan commented Jul 10, 2024

lperron commented Jul 12, 2024 •

edited

Loading

omer-candan commented Aug 21, 2024

Model file export #4305

Are you sure you want to change the base?

Model file export #4305

Conversation

omer-candan commented Jul 8, 2024 • edited Loading

Write models to .mps files

lperron commented Jul 9, 2024

omer-candan commented Jul 9, 2024

lperron commented Jul 9, 2024

omer-candan commented Jul 10, 2024

lperron commented Jul 12, 2024 • edited Loading

omer-candan commented Aug 21, 2024

omer-candan commented Jul 8, 2024 •

edited

Loading

lperron commented Jul 12, 2024 •

edited

Loading