bpo-31324: Optimize support._match_test(). by serhiy-storchaka · Pull Request #4420 · python/cpython

serhiy-storchaka · 2017-11-16T14:35:24Z

https://bugs.python.org/issue31324

vstinner

Apart the wrong re.I flag, LGTM.

I tested manually your PR: it works as expected.
https://bugs.python.org/issue31324#msg306362

vstinner · 2017-11-16T14:41:00Z

        return True
+    if match_tests != _match_tests:
+        match_tests_re = '|'.join(map(fnmatch.translate, match_tests))
+        _match_test1 = re.compile(match_tests_re, re.I).match


Hum no, re.I must no be used: the match is expected to be case sensitive, not case insensitve.

vstinner · 2017-11-16T14:48:42Z

-    return False
+    if _match_test1(test_id):
+        return True
+    return any(map(_match_test1, test_id.split(".")))


If all match_tests patterns contain a dot ".", we could make this code even simpler:

return match_test(test_id)

But I'm unable to see a major performance difference when running:

./python -m test.bisect --fail-env-changed -o bisect test_asyncio -v

I'm checking how much time it takes before running the first test: so the time to load and filter tests.

vstinner · 2017-11-16T15:20:11Z

support._match_test() has at least a complexity of O(n) where n is the number of patterns in support.match_tests, just to check if support.match_tests was modified :-(

I proposed PR #4421 which replaces support.match_tests global variable with a new support.set_match_tests() function to cache the "matcher" object and avoid the need of checking if patterns changed. Moreover, I implemented a very simple matcher using set() for the test.bisect use case: all patterns are "full identifiers".

Using "./python -m test.bisect --fail-env-changed -o bisect test_asyncio -v" as a "benchmark": I see the first test running faster with my PR than using this PR.

I guess that the problem bottleneck of this PR is the minimum O(n) cost because by mutable support.mtach_tests, whereas my benchmark uses 768 patterns.

vstinner · 2017-11-21T23:34:57Z

I merged PR #4421 which is based on this PR but goes further in term of optimization and adds news tests.

bpo-31324: Optimize support._match_test().

0f49a44

serhiy-storchaka added needs backport to 2.7 skip news type-feature A feature request or enhancement tests Tests in the Lib/test dir labels Nov 16, 2017

serhiy-storchaka requested a review from vstinner November 16, 2017 14:35

the-knights-who-say-ni added the CLA signed label Nov 16, 2017

bedevere-bot added the awaiting merge label Nov 16, 2017

vstinner reviewed Nov 16, 2017

View reviewed changes

No re.I flag.

75990b1

vstinner reviewed Nov 16, 2017

View reviewed changes

vstinner closed this Nov 21, 2017

serhiy-storchaka deleted the faster-match_test branch November 25, 2017 15:56

serhiy-storchaka removed needs backport to 2.7 labels Apr 1, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

bpo-31324: Optimize support._match_test().#4420

bpo-31324: Optimize support._match_test().#4420
serhiy-storchaka wants to merge 2 commits intopython:masterfrom
serhiy-storchaka:faster-match_test

serhiy-storchaka commented Nov 16, 2017 •

edited by bedevere-bot

Loading

Uh oh!

vstinner left a comment

Uh oh!

vstinner Nov 16, 2017

Uh oh!

vstinner Nov 16, 2017

Uh oh!

vstinner commented Nov 16, 2017

Uh oh!

vstinner commented Nov 21, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

serhiy-storchaka commented Nov 16, 2017 • edited by bedevere-bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vstinner left a comment

Choose a reason for hiding this comment

Uh oh!

vstinner Nov 16, 2017

Choose a reason for hiding this comment

Uh oh!

vstinner Nov 16, 2017

Choose a reason for hiding this comment

Uh oh!

vstinner commented Nov 16, 2017

Uh oh!

vstinner commented Nov 21, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

serhiy-storchaka commented Nov 16, 2017 •

edited by bedevere-bot

Loading