MQE: improve runtime of mixed metrics tests #10678

charleskorn · 2025-02-18T00:52:47Z

What this PR does

This PR makes some improvements to the mixed metrics tests (aka the gauntlet), with the aim of preserving coverage while dramatically reducing how long they take to run.

On my machine, running all of the tests in engine_test.go took 3m35s before these changes, and now takes 45s.

I've tried to build up these changes incrementally, I suggest reviewing each commit separately.

Which issue(s) this PR fixes or relates to

(none)

Checklist

[n/a] Tests updated.
[n/a] Documentation added.
[n/a] CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX].
[n/a] about-versioning.md updated with experimental features.

…ime over multiple series at a time

…rs, and only run with two series rather than up to four

charleskorn · 2025-02-18T00:56:06Z

pkg/streamingpromql/engine_test.go

-	// Generate combinations of 2, 3, and 4 labels. (e.g., "a,b", "e,f", "c,d,e", "a,b,c,d", "c,d,e,f" etc)
+	// Generate combinations of 2 labels. (e.g., "a,b", "e,f" etc)
 	labelCombinations = append(labelCombinations, testutils.Combinations(labelsToUse, 2)...)
-	labelCombinations = append(labelCombinations, testutils.Combinations(labelsToUse, 3)...)
-	labelCombinations = append(labelCombinations, testutils.Combinations(labelsToUse, 4)...)



Rationale: these functions all operate a series at a time, and any issue working with multiple series is likely to be surfaced running with two series.

that's fair for non aggregations

charleskorn · 2025-02-18T00:57:05Z

pkg/streamingpromql/engine_test.go

@@ -2981,8 +2977,6 @@ func TestCompareVariousMixedMetricsVectorSelectors(t *testing.T) {
 		for _, function := range []string{"rate", "increase", "changes", "resets", "deriv", "irate", "idelta", "delta", "deriv", "stddev_over_time", "stdvar_over_time"} {
 			expressions = append(expressions, fmt.Sprintf(`%s(series{label=~"(%s)"}[45s])`, function, labelRegex))
 			expressions = append(expressions, fmt.Sprintf(`%s(series{label=~"(%s)"}[1m])`, function, labelRegex))
-			expressions = append(expressions, fmt.Sprintf(`sum(%s(series{label=~"(%s)"}[2m15s]))`, function, labelRegex))


Rationale: we already extensively test sum in TestCompareVariousMixedMetricsAggregations.

I would like to keep these. They test reusing pools that have been used by the range-vector function.

Do we need both sum test cases?

Don't forget we also have TestConcurrentQueries that should cover pooled slices being reused as well.

one is probably sufficient. I'd suggest keeping this one (2m15s).

Done, I've restored that in c4b2c57.

charleskorn · 2025-02-18T00:57:15Z

pkg/streamingpromql/engine_test.go

-	// Generate combinations of 2, 3, and 4 labels. (e.g., "a,b", "e,f", "c,d,e", "a,b,c,d", "c,d,e,f" etc)
+	// Generate combinations of 2 labels. (e.g., "a,b", "e,f" etc)
 	labelCombinations = append(labelCombinations, testutils.Combinations(labelsToUse, 2)...)
-	labelCombinations = append(labelCombinations, testutils.Combinations(labelsToUse, 3)...)
-	labelCombinations = append(labelCombinations, testutils.Combinations(labelsToUse, 4)...)


Rationale: these functions all operate a series at a time, and any issue working with multiple series is likely to be surfaced running with two series.

jhesketh

Thanks for that. I'm in support of most of these changes, but I would like to keep the range vector tests being passed through an aggregation personally.

jhesketh

Great, thank you :-)

* Make gauntlet tests ~3x faster by not using `t.Run()` * Don't bother running functions that operate over single series at a time over multiple series at a time * Don't run `sum` over functions that operate over range vector selectors, and only run with two series rather than up to four * Restore one `sum` over range vector function test case

charleskorn added 3 commits February 18, 2025 11:42

Make gauntlet tests ~3x faster by not using t.Run()

454c356

Don't bother running functions that operate over single series at a t…

06ebdbb

…ime over multiple series at a time

Don't run sum over functions that operate over range vector selecto…

b94d4fb

…rs, and only run with two series rather than up to four

charleskorn commented Feb 18, 2025

View reviewed changes

jhesketh reviewed Feb 18, 2025

View reviewed changes

charleskorn marked this pull request as ready for review February 19, 2025 02:10

charleskorn requested a review from a team as a code owner February 19, 2025 02:10

Restore one sum over range vector function test case

c4b2c57

jhesketh approved these changes Feb 19, 2025

View reviewed changes

charleskorn merged commit faec99e into main Feb 19, 2025
28 checks passed

charleskorn deleted the charleskorn/mqe-gauntlet branch February 19, 2025 04:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MQE: improve runtime of mixed metrics tests #10678

MQE: improve runtime of mixed metrics tests #10678

charleskorn commented Feb 18, 2025

charleskorn Feb 18, 2025

jhesketh Feb 18, 2025

charleskorn Feb 18, 2025

jhesketh Feb 18, 2025

charleskorn Feb 18, 2025

jhesketh Feb 19, 2025

charleskorn Feb 19, 2025

charleskorn Feb 18, 2025

jhesketh left a comment

jhesketh left a comment

MQE: improve runtime of mixed metrics tests #10678

MQE: improve runtime of mixed metrics tests #10678

Conversation

charleskorn commented Feb 18, 2025

What this PR does

Which issue(s) this PR fixes or relates to

Checklist

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jhesketh left a comment

Choose a reason for hiding this comment

jhesketh left a comment

Choose a reason for hiding this comment