Skip to content
This repository has been archived by the owner on Aug 2, 2022. It is now read-only.

Add missing upstream rca node for heap health decider #475

Merged
merged 1 commit into from
Oct 16, 2020

Conversation

ktkrg
Copy link
Contributor

@ktkrg ktkrg commented Oct 16, 2020

Fixes #:
#476
Description of changes:
Adds LargeHeapClusterRca - the cluster level RCA for detecting heap size increase opportunities - as an upstream to the HeapHealthDecider.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

@ktkrg ktkrg self-assigned this Oct 16, 2020
@ktkrg ktkrg requested review from sidheart and yojs October 16, 2020 19:28
@@ -200,7 +199,7 @@ public void construct() {
HeapHealthDecider heapHealthDecider = new HeapHealthDecider(RCA_PERIOD, highHeapUsageClusterRca,
largeHeapClusterRca);
heapHealthDecider.addTag(TAG_LOCUS, LOCUS_MASTER_NODE);
heapHealthDecider.addAllUpstreams(Collections.singletonList(highHeapUsageClusterRca));
heapHealthDecider.addAllUpstreams(Arrays.asList(highHeapUsageClusterRca, largeHeapClusterRca));
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Did we add decider integ tests? How did they pass without this?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No, we don't have decider integ tests yet.

How did they pass without this?

Not having it in the upstream list won't necessarily fail the integration test even if we had one. The scheduler can at worst get the scheduling wrong and have the decider run before the RCA(because they'll be executed in parallel based on the distance from the leaf nodes). This causes race conditions which may or may not fail in the integ test. This upstream dependency was present in the 128G PR: #439 though, so this wouldn't have been caught then. Maybe at/after merge?

@sidheart sidheart merged commit 4c78b48 into master Oct 16, 2020
@ktkrg ktkrg deleted the ktkrg-missing-upstream branch October 16, 2020 20:56
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants