[BUG] Extreme latency in BulkIndexer #113

dokterbob · 2022-06-02T19:44:37Z

What is the bug?
It seems that with a BulkIndexer with 2 workers, I am getting unexpected latency on BulkIndexer.Add(). It seems that somehow the workers are not consuming the queue within any reasonable sort of timeframe, I'm seeing delays of over 20s!

For example, in the last hour I've 53 cases of >1s latency on just Add() out of a total of 174 calls.

How can one reproduce the bug?
With 2 workers running, adding items from different goroutines and a relatively busy search cluster.

What is the expected behavior?
Sub-millisecond latencies, basically the time it takes to shove something into a channel.

What is your host/environment?

OS: Ubuntu 20.02
Version: 1.1.0 (but nothing has changed to the bulkindexer since the fork from ES)

Do you have any screenshots?

The text was updated successfully, but these errors were encountered:

dokterbob · 2022-06-02T19:52:28Z

Note; returning to the default of numCPU workers seems to alleviate the issue, but given the significant delays (seconds versus sub-millisecond) I would still strongly argue that there is an underlying issue here. At the very least I would suggest documenting this unexpected behaviour.

Please see the difference below:

The issue seems reduced but it is still occurring!

CPU load on this server is around 10% and the load average is around 4. There is still about 10% of Add() calls which takes >1s.

dokterbob · 2022-06-16T15:19:01Z

@VijayanB @VachaShah Any ideas?

dokterbob · 2022-07-07T07:10:08Z

Poke!

dblock · 2022-07-11T18:10:41Z

@dokterbob This looks visible problematic, but doesn't look like the folks here got to looking into it. Let's try to move this forward? First, what's the easiest way to reproduce this (maybe post code similar to the benchmarks in this project)? Are you able to bulk load data a lot faster into this instance with other mechanisms (aka is this a client issue for sure)?

APoolio · 2022-08-05T21:21:31Z

@dokterbob Do you mind posting some of the code you were using to help pinpoint this issue?

dokterbob · 2022-10-16T14:42:20Z

Sorry, I didn't see the messages. Code is https://github.com/ipfs-search/ipfs-search/ but of course you'll need a more detailed test case.

After increasing the workers it seems the problem has become less severe. Now that it's been picked up I'll see if I can get more concrete feedback over the next couple of weeks.

zethuman · 2023-04-04T01:40:50Z

@dokterbob hey, I want to work on solving the issue, how relevant is this?
Are there any problems now, or was it on the client side?

dokterbob added the bug Something isn't working label Jun 2, 2022

dokterbob changed the title ~~[BUG] Race condition in BulkIndexer~~ [BUG] Extreme latency in BulkIndexer Jun 2, 2022

wbeckler added the good first issue Good for newcomers label Jan 18, 2023

dblock added the performance Make it fast! label Dec 5, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Extreme latency in BulkIndexer #113

[BUG] Extreme latency in BulkIndexer #113

dokterbob commented Jun 2, 2022

dokterbob commented Jun 2, 2022 •

edited

Loading

dokterbob commented Jun 16, 2022

dokterbob commented Jul 7, 2022

dblock commented Jul 11, 2022 •

edited

Loading

APoolio commented Aug 5, 2022

dokterbob commented Oct 16, 2022

zethuman commented Apr 4, 2023

[BUG] Extreme latency in BulkIndexer #113

[BUG] Extreme latency in BulkIndexer #113

Comments

dokterbob commented Jun 2, 2022

dokterbob commented Jun 2, 2022 • edited Loading

dokterbob commented Jun 16, 2022

dokterbob commented Jul 7, 2022

dblock commented Jul 11, 2022 • edited Loading

APoolio commented Aug 5, 2022

dokterbob commented Oct 16, 2022

zethuman commented Apr 4, 2023

dokterbob commented Jun 2, 2022 •

edited

Loading

dblock commented Jul 11, 2022 •

edited

Loading