planner: use ordered index with is null predicate | tidb-test=pr/2368 #54253

ari-e · 2024-06-26T20:43:57Z

What problem does this PR solve?

Issue Number: close #54188

Problem Summary: Properly classify columns with null predicates (e.g. WHERE a IS NULL) as constant so index selection can take advantage of that to satisfy a sort in

tidb/pkg/planner/core/find_best_task.go

Line 791 in 432bb79

if path.ConstCols == nil || i >= len(path.ConstCols) || !path.ConstCols[i] {

.

What changed and how does it work?

Added checks for is null predicate during planning and fill corresponding data structures marking that column as a constant.

Check List

Tests

Unit test
Integration test
Manual test (add detailed scripts or steps below)
No need to test
- I checked and no code files have been changed.

Side effects

Performance regression: Consumes more CPU
Performance regression: Consumes more Memory
Breaking backward compatibility

Documentation

Release note

Please refer to Release Notes Language Style Guide to write a quality release note.

None

ti-chi-bot · 2024-06-26T20:44:09Z

Welcome @ari-e!

It looks like this is your first PR to pingcap/tidb 🎉.

I'm the bot to help you request reviewers, add labels and more, See available commands.

We want to make sure your contribution gets all the attention it needs!

Thank you, and welcome to pingcap/tidb. 😃

ti-chi-bot · 2024-06-26T20:44:09Z

Hi @ari-e. Thanks for your PR.

I'm waiting for a pingcap member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

tiprow · 2024-06-26T20:44:18Z

Hi @ari-e. Thanks for your PR.

PRs from untrusted users cannot be marked as trusted with /ok-to-test in this repo meaning untrusted PR authors can never trigger tests themselves. Collaborators can still trigger tests on the PR using /test all.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

hawkingrei · 2024-06-27T00:00:44Z

/ok-to-test

codecov · 2024-06-27T00:20:40Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 55.9786%. Comparing base (c361587) to head (e71d8f2).

Additional details and impacted files

@@                Coverage Diff                @@
##             master     #54253         +/-   ##
=================================================
- Coverage   74.8413%   55.9786%   -18.8628%     
=================================================
  Files          1555       1676        +121     
  Lines        363239     609792     +246553     
=================================================
+ Hits         271853     341353      +69500     
- Misses        71804     245136     +173332     
- Partials      19582      23303       +3721

Flag	Coverage Δ
integration	`37.0598% <100.0000%> (?)`
unit	`71.7446% <100.0000%> (-2.0380%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Components	Coverage Δ
dumpling	`52.9656% <ø> (-2.2339%)`	⬇️
parser	`∅ <ø> (∅)`
br	`52.5929% <ø> (+4.8890%)`	⬆️

hawkingrei · 2024-06-27T03:28:15Z

pkg/planner/core/casetest/index/index_test.go

@@ -148,3 +148,15 @@ func TestRowFunctionMatchTheIndexRangeScan(t *testing.T) {
 		tk.MustQuery(tt).Sort().Check(testkit.Rows(output[i].Result...))


Please run the make bazel_prepare and upload the changement.

@hawkingrei thanks! unfortunately @ari-e is out for a bit and I don't have permissions to update this PR, so I've addressed the comment separately in #54290

mzhang77 · 2024-06-28T21:33:55Z

Please ignore this PR and review #54290 instead.

ari-e · 2024-07-12T00:33:42Z

/retest

ari-e · 2024-07-12T17:36:10Z

Picked this PR back up now that I'm back from vacation. So please consider this one and we'll abandon #54290 #54512.

I've fixed the unit test failures and bazel build issues. Now the 3 issues left are failed integration tests. I suspect though that these are failing because the plan output is legitimately different with my change. The outputs of those 3 tests are:

run test [util/ranger] err: sql:explain format='brief' select * from t where a = 1 and (b is null or b = 2) and c > 1;: failed to run query
"explain format='brief' select * from t where a = 1 and (b is null or b = 2) and c > 1;"
 around line 176,
we need(316):
explain format='brief' select * from t where a = 1 and (b is null or b = 2) and c > 1;
id      estRows task    access object   operator info
IndexReader     0.07    root            index:Selection
└─Selection     0.07    cop[tikv]               gt(util__ranger.t.c, 1)
  └─IndexRangeScan      0.20    cop[tikv]       table:t, index:a(a, b, c)       range:[1 NULL,1 NULL], [1
but got(316):
explain format='brief' select * from t where a = 1 and (b is null or b = 2) and c > 1;
id      estRows task    access object   operator info
IndexReader     0.67    root            index:IndexRangeScan
└─IndexRangeScan        0.67    cop[tikv]       table:t, index:a(a, b, c)       range:(1 NULL 1,1 NULL +inf], (1 2 1,1 2 +inf], keep order:false, stats:pseudo

run test [util/ranger] err: sql:explain format='brief' select * from t where a = 1 and (b is null or b = 2) and c > 1;: failed to run query
"explain format='brief' select * from t where a = 1 and (b is null or b = 2) and c > 1;"
 around line 176,
we need(316):
explain format='brief' select * from t where a = 1 and (b is null or b = 2) and c > 1;
id      estRows task    access object   operator info
IndexReader     0.07    root            index:Selection
└─Selection     0.07    cop[tikv]               gt(util__ranger.t.c, 1)
  └─IndexRangeScan      0.20    cop[tikv]       table:t, index:a(a, b, c)       range:[1 NULL,1 NULL], [1
but got(316):
explain format='brief' select * from t where a = 1 and (b is null or b = 2) and c > 1;
id      estRows task    access object   operator info
IndexReader     0.67    root            index:IndexRangeScan
└─IndexRangeScan        0.67    cop[tikv]       table:t, index:a(a, b, c)       range:(1 NULL 1,1 NULL +inf], (1 2 1,1 2 +inf], keep order:false, stats:pseudo

run test [func_group] err: sql:explain format="brief"
select max(a3) from t1 where a2 is null;: failed to run query
"explain format="brief"
select max(a3) from t1 where a2 is null;"
 around line 330,
we need(421):
explain format="brief"
select max(a3) from t1 where a2 is null;
id      estRows task    access object   operator info
StreamAgg       1.00    root            funcs:max(func_group.t1.a3)->Column#7
└─TopN  1.00    root            func_group.t1.a3:desc, offset:0, count:1
  └─IndexReader 1.00    root            index:TopN
    └─TopN      1.00    cop[tikv]               func_group.t1.a3:desc, offset:0, count:1
      └─IndexRangeScan  2.00    cop[tikv]       table:t1, index:k1(a2, a3)      range:[N
but got(421):
explain format="brief"
select max(a3) from t1 where a2 is null;
id      estRows task    access object   operator info
StreamAgg       1.00    root            funcs:max(func_group.t1.a3)->Column#7
└─Limit 1.00    root            offset:0, count:1
  └─IndexReader 1.00    root            index:Limit
    └─Limit     1.00    cop[tikv]               offset:0, count:1
      └─IndexRangeScan  1.00    cop[tikv]       table:t1, index:k1(a2, a3)      range:[NULL -inf,NULL +inf], keep order:true, desc

@hawkingrei can you provide any guidance for whether these seem like legitimate cases where the integration test needs to be updated, or whether my PR needs to change to not affect these query plans?

hawkingrei · 2024-07-17T12:47:08Z

@ari-e LGTM

But you should update the result in the tests/integrationtest.

for example

cd tests/integrationtest
./run-tests.sh -r executor/merge_join

ari-e · 2024-07-17T19:05:58Z

@hawkingrei I fixed the integration test in tests/integrationtest but it looks like there's some test external to this repo that is being run that is still failing called func_group which is part of ghpr_mysql_test. Do you know where that is defined? Jenkins log

pkg/util/ranger/detacher.go

ari-e · 2024-07-18T17:06:24Z

/retest

hawkingrei · 2024-07-19T16:50:46Z

/retest

hawkingrei · 2024-07-19T16:51:16Z

@ari-e Please sync code with master.

winoros · 2024-07-19T17:01:09Z

/retest

hawkingrei · 2024-07-19T17:09:26Z

/test all

ari-e · 2024-07-19T17:30:29Z

Rebased on master @hawkingrei @winoros

winoros · 2024-07-19T17:34:10Z

The idc-jenkins-ci-tidb/mysql-test passed. I think your pr should have passed all tests now. Other failures are not caused by your main code changes.

ti-chi-bot · 2024-07-19T18:41:50Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: hawkingrei, winoros

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [hawkingrei,winoros]
~~pkg/planner/OWNERS~~ [hawkingrei,winoros]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

ti-chi-bot · 2024-07-19T18:41:53Z

[LGTM Timeline notifier]

Timeline:

2024-07-17 12:42:49.389601575 +0000 UTC m=+444191.380543045: ☑️ agreed by hawkingrei.
2024-07-19 18:41:52.244257202 +0000 UTC m=+638534.235198671: ☑️ agreed by winoros.

ti-chi-bot · 2024-07-22T05:12:31Z

In response to a cherrypick label: new pull request created to branch release-7.5: #54788.

Signed-off-by: ti-chi-bot <[email protected]>

ti-chi-bot bot added ok-to-test Indicates a PR is ready to be tested. and removed needs-ok-to-test Indicates a PR created by contributors and need ORG member send '/ok-to-test' to start testing. labels Jun 27, 2024

hawkingrei self-requested a review June 27, 2024 00:00

hawkingrei reviewed Jun 27, 2024

View reviewed changes

michaelmdeng mentioned this pull request Jun 27, 2024

planner: use ordered index with is null predicate #54290

Closed

13 tasks

ti-chi-bot bot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Jul 5, 2024

HaoW30 mentioned this pull request Jul 8, 2024

planner: use ordered index with is null predicate #54512

Closed

13 tasks

ari-e force-pushed the ari--use-ordered-index-with-is-null branch 2 times, most recently from 031c2c5 to fd9d15a Compare July 11, 2024 19:52

ti-chi-bot bot removed the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Jul 11, 2024

ari-e force-pushed the ari--use-ordered-index-with-is-null branch 2 times, most recently from 9653ec3 to 01a7e18 Compare July 12, 2024 00:15

hawkingrei approved these changes Jul 17, 2024

View reviewed changes

ti-chi-bot bot added approved needs-1-more-lgtm Indicates a PR needs 1 more LGTM. labels Jul 17, 2024

hawkingrei self-requested a review July 17, 2024 12:45

ari-e force-pushed the ari--use-ordered-index-with-is-null branch from 01a7e18 to d81df44 Compare July 17, 2024 16:42

winoros reviewed Jul 18, 2024

View reviewed changes

pkg/util/ranger/detacher.go Show resolved Hide resolved

ari-e force-pushed the ari--use-ordered-index-with-is-null branch from d81df44 to eb7d420 Compare July 18, 2024 16:55

ari-e force-pushed the ari--use-ordered-index-with-is-null branch from eb7d420 to 3543c79 Compare July 19, 2024 16:18

hawkingrei changed the title ~~planner: use ordered index with is null predicate~~ planner: use ordered index with is null predicate | tidb-test=pr/2368 Jul 19, 2024

Use ordered index with is null predicate

e71d8f2

ari-e force-pushed the ari--use-ordered-index-with-is-null branch from 3543c79 to e71d8f2 Compare July 19, 2024 17:30

winoros approved these changes Jul 19, 2024

View reviewed changes

ti-chi-bot bot added lgtm and removed needs-1-more-lgtm Indicates a PR needs 1 more LGTM. labels Jul 19, 2024

ti-chi-bot bot merged commit 41ed0e5 into pingcap:master Jul 19, 2024

ti-chi-bot added the needs-cherry-pick-release-7.5 Should cherry pick this PR to release-7.5 branch. label Jul 22, 2024

ti-chi-bot pushed a commit to ti-chi-bot/tidb that referenced this pull request Jul 22, 2024

This is an automated cherry-pick of pingcap#54253

b5827c5

Signed-off-by: ti-chi-bot <[email protected]>

ti-chi-bot mentioned this pull request Jul 22, 2024

planner: use ordered index with is null predicate | tidb-test=pr/2368 (#54253) #54788

Closed

13 tasks

ti-chi-bot bot removed the needs-cherry-pick-release-7.5 Should cherry pick this PR to release-7.5 branch. label Sep 25, 2024

		@@ -148,3 +148,15 @@ func TestRowFunctionMatchTheIndexRangeScan(t *testing.T) {
		tk.MustQuery(tt).Sort().Check(testkit.Rows(output[i].Result...))

planner: use ordered index with is null predicate | tidb-test=pr/2368 #54253

planner: use ordered index with is null predicate | tidb-test=pr/2368 #54253

Uh oh!

Conversation

ari-e commented Jun 26, 2024

What problem does this PR solve?

What changed and how does it work?

Check List

Release note

Uh oh!

ti-chi-bot bot commented Jun 26, 2024

Uh oh!

ti-chi-bot bot commented Jun 26, 2024

Uh oh!

tiprow bot commented Jun 26, 2024

Uh oh!

hawkingrei commented Jun 27, 2024

Uh oh!

codecov bot commented Jun 27, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

hawkingrei Jun 27, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

michaelmdeng Jun 27, 2024

Choose a reason for hiding this comment

Uh oh!

mzhang77 commented Jun 28, 2024

Uh oh!

ari-e commented Jul 12, 2024

Uh oh!

ari-e commented Jul 12, 2024

Uh oh!

hawkingrei commented Jul 17, 2024

Uh oh!

ari-e commented Jul 17, 2024

Uh oh!

Uh oh!

ari-e commented Jul 18, 2024

Uh oh!

hawkingrei commented Jul 19, 2024

Uh oh!

hawkingrei commented Jul 19, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

winoros commented Jul 19, 2024

Uh oh!

hawkingrei commented Jul 19, 2024

Uh oh!

ari-e commented Jul 19, 2024

Uh oh!

winoros commented Jul 19, 2024

Uh oh!

ti-chi-bot bot commented Jul 19, 2024

Uh oh!

ti-chi-bot bot commented Jul 19, 2024

[LGTM Timeline notifier]

Uh oh!

ti-chi-bot commented Jul 22, 2024

Uh oh!

Uh oh!

codecov bot commented Jun 27, 2024 •

edited

Loading

hawkingrei Jun 27, 2024 •

edited

Loading

hawkingrei commented Jul 19, 2024 •

edited

Loading