Performance of PER codec #303

dudycz · 2024-04-10T12:05:33Z

dudycz
Apr 10, 2024

Hi. I have been playing with two most popular PER codecs: rasn and asn1-codecs and made some benchmark comparing their performance. One thing I have noticed is that rasn can be ~10x slower in encoding in some complex cases:

Codec	Encoding (µs)	Decoding (µs)
rasn	1916.7	826.8
asn1-codecs	208	145.5

I looked into flamegraphs and callgrinds but I couldn't figure out what contributes to this big difference. If you're interested I could try to collect some and attach here.

Link to repo with benchmark: https://github.com/dudycz/asn1_codecs_bench

XAMPPRocky · 2024-04-10T13:22:00Z

XAMPPRocky
Apr 10, 2024
Maintainer

Thank you for your issue! Would you be able to attach the flamegraphs? That would be very helpful. In general the initial version I made prioritised correctness over performance so there's likely a lot of places where it could be more efficient. I already know that I haven't added the empty struct optimisation that was added to BER to PER.

0 replies

dudycz · 2024-04-10T13:51:57Z

dudycz
Apr 10, 2024
Author

I had to encode sample.asn 256 times to get something recorded in flamegraph. Here it is:

0 replies

Nicceboy · 2024-04-14T04:02:08Z

Nicceboy
Apr 14, 2024

Seems like your test bench uses mostly integers. asn1-codecs crate seems to use i128 internally for Integer type while rasn uses BigInts. I would guess that this plays big factor at least.

I think there was a plan to make inner implementation of integer types as feature, so that it would be possible to select different internal type for performance reasons if very big numbers are not required.

0 replies

XAMPPRocky · 2024-04-14T05:14:19Z

XAMPPRocky
Apr 14, 2024
Maintainer

Yeah, that's still a todo, I should write up an issue giving details in case someone else who has more time is interested.

0 replies

Nicceboy · 2024-04-22T11:38:54Z

Nicceboy
Apr 22, 2024

Yeah, that's still a todo, I should write up an issue giving details in case someone else who has more time is interested.

I think this is rather important optimisation in general and should not take too much work to implement, if we have, for example, just the i128 and BigInts as initial options. If you have time to write what you had on your mind, maybe I can try to do it.

0 replies

Nicceboy · 2024-05-28T15:14:18Z

Nicceboy
May 28, 2024

I have been reworking integer type (by using primitives (i128) by default, and switching to larger ones on overflows, or if big one is created manually) Hopefully I can open draft PR in the end of week. The resulting integer will be enum, and the type of the big integers does not matter that much anymore.

Not sure if it is the best approach, but it is the one I chose after trying quite many different things. Let's leave those comments for the PR. It can be completely changed still.

Seems like integers are not the only problem with UPER. The extensive use of new vector buffers and moving this data contributes more. Default low capacity's in vectors, a lot single pushes and overall creation of new buffers instead of sharing pointer of one or reusing existing allocations slows down quite much.

Some initial differences on M2 Pro from the integer change:

UPER:

COER (The difference was much more impactful)

I have also made initial rework for optimising allocations in COER (the results below are based on the int remake) Maybe UPER will follow if I have time.

So by changing the integer type and reducing allocations, it was already possible to get 3x speedup at least for COER, based on the benchmark base of @dudycz .

Allocations could be improved further but I am having painful issues with lifetimes.

0 replies

repnop · 2024-06-14T14:48:05Z

repnop
Jun 14, 2024

@dudycz since #256 got merged a little while ago, are you able to rerun your benchmark, at least for the decoding side of things? it should be quite a bit better now.

0 replies

dudycz · 2024-06-14T20:06:01Z

dudycz
Jun 14, 2024
Author

Yes, I can confirm big improvement in decoding time (on my "bench" setup I observed decrease from 457us to 341us!). Good job!

0 replies

dudycz · 2024-08-27T22:04:14Z

dudycz
Aug 27, 2024
Author

Meanwhile I had updated my benchmark with third uper codec - asn1rs.

Codec	Encoding (µs)	Decoding (µs)
rasn	1759	259
asn1-codecs	187	122
asn1rs	72	65

0 replies

XAMPPRocky · 2024-08-28T07:13:43Z

XAMPPRocky
Aug 28, 2024
Maintainer

Separately I think we should add continuous profiling to the CI and I've created an issue for that. Since this issue doesn't have a specific goal/end, I'm going to move this to a discussion.

#302

0 replies

dudycz · 2024-10-07T09:52:35Z

dudycz
Oct 7, 2024
Author

FYI: I have made small improvement of this benchmark, by introducing automated asn1->rs code generation for each codec. So now it easier to update their versions.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

librasn

Performance of PER codec #303

{{title}}

Replies: 11 comments

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

librasn

Performance of PER codec #303

dudycz Apr 10, 2024

Replies: 11 comments

XAMPPRocky Apr 10, 2024 Maintainer

dudycz Apr 10, 2024 Author

Nicceboy Apr 14, 2024

XAMPPRocky Apr 14, 2024 Maintainer

Nicceboy Apr 22, 2024

Nicceboy May 28, 2024

repnop Jun 14, 2024

dudycz Jun 14, 2024 Author

dudycz Aug 27, 2024 Author

XAMPPRocky Aug 28, 2024 Maintainer

dudycz Oct 7, 2024 Author

dudycz
Apr 10, 2024

XAMPPRocky
Apr 10, 2024
Maintainer

dudycz
Apr 10, 2024
Author

Nicceboy
Apr 14, 2024

XAMPPRocky
Apr 14, 2024
Maintainer

Nicceboy
Apr 22, 2024

Nicceboy
May 28, 2024

repnop
Jun 14, 2024

dudycz
Jun 14, 2024
Author

dudycz
Aug 27, 2024
Author

XAMPPRocky
Aug 28, 2024
Maintainer

dudycz
Oct 7, 2024
Author