benchmark| Potential solution for performance regressions #3473

doublethefish · 2020-04-03T13:31:04Z

Steps

(na) Add yourself to CONTRIBUTORS if you are a new contributor.
(na) Add a ChangeLog entry describing what your PR does.
(na) If it's a new feature or an important bug fix, add a What's New entry in doc/whatsnew/<current release.rst>.
Write a good description on what the PR does.

Description

These commits generate comparable performance data for a run of benchmarked code.

This is a potential 1st step in establishing performance regression tests. Later work would be to actually do the comparison against the target-branch somehow.

We use pytest-benchmark here, rather than asv, because pytest-benchmark was easier to integrate into the existing tests.

Use-case

A use-case for this functionality is:

Write a good benchmark test:

See tests/benchmark/test_baseline_benchmarks.py for an example

Run tox -e benchmark

This generates benchmark .json files in the .tox directory

Apply some change that you suspect makes the code faster
Re-run tox -e benchmark

This generates another benchmark .json file, probably 00002-something.json

Compare stats:

In .tox/ run py.test-benchmark compare 0001 0002 - the numbers are used as a glob to find files that match, use the numbers that refer to the tox -e benchmark runs you want to compare.

Look for statistically significant differences. You will need to use your common sense here.
Profit.

Baseline benchmark for pylint

The baseline benchmark in this PR attempts to benchmark basic runs:

with no-files
a single empty file
lots of empty files
a single empty checker
many empty checkers
a checker that mimics doing expensive work
across workers and a single-processed

Type of Changes

	Type
✓	🔨 Refactoring (support of)
✓	📜 Docs (new performance metrics)

Related Issue

coveralls · 2020-04-03T13:56:49Z

Coverage remained the same at 90.449% when pulling ce02b98 on doublethefish:chore/1954/Add_performance_regressions into be5a61b on PyCQA:master.

Pierre-Sassoulas

This is a much needed addition to be able to make performances better. I know it's a draft, but I reviewed some things let me know what you think.

tests/benchmark/test_baseline_benchmarks.py

tests/test_functional.py

tests/unittest_checker_similar.py

tests/test_self.py

doublethefish · 2020-04-20T21:03:44Z

PR #3498 is a slice of this PR.

Here we establish baseline benchmarks for the system when used in minimal way. Here we just confirm that -j1 vs -jN gives some boost in performance under simple situations, establishing a baseline for other benchmarks.

doublethefish · 2020-04-21T11:20:28Z

Having used this code in anger, this PR has been updated to include the most useful change and I think it should go in.

I have removed some of the noise. The files that have been removed muddied the water:

The changes to tests/test_self.py and tests/test_functional.py were a "lets see what happens" commit and it is very debatable if those changes are useful.
- This is because of how fast most of the tests are and the fact that we're not aiming for high-performance-computing.
- There are performance wins to be made, especially in test_functional.py but those should be targeted with specific benchmarks.
Whilst the changes to tests/unittest_checker_similar.py are moderately useful, they can wait until we establish that this is the right way to do benchmarking.

doublethefish mentioned this pull request Apr 19, 2020

Fix #3314 duplicate code error only shows up with pylint jobs 1 #3458

Closed

4 tasks

Pierre-Sassoulas reviewed Apr 20, 2020

View reviewed changes

tests/benchmark/test_baseline_benchmarks.py Show resolved Hide resolved

tests/test_functional.py Outdated Show resolved Hide resolved

tests/unittest_checker_similar.py Outdated Show resolved Hide resolved

tests/test_self.py Outdated Show resolved Hide resolved

doublethefish changed the title ~~Chore/1954/add performance regressions~~ benchmark| Potential solution for performance regressions Apr 21, 2020

doublethefish force-pushed the chore/1954/Add_performance_regressions branch 2 times, most recently from 8c57c1c to 2337556 Compare April 21, 2020 09:29

doublethefish mentioned this pull request Apr 21, 2020

profile| Adds (opt-inable) profile-heatmap generation output to tox test-runs #3503

Merged

4 tasks

doublethefish added 2 commits April 21, 2020 11:24

benchmark| Add benchmarking option to tox

cfbb933

benchmark| Adds basic performance benchmark baselines for pylint

5b23a24

Here we establish baseline benchmarks for the system when used in minimal way. Here we just confirm that -j1 vs -jN gives some boost in performance under simple situations, establishing a baseline for other benchmarks.

doublethefish force-pushed the chore/1954/Add_performance_regressions branch from 2337556 to 5b23a24 Compare April 21, 2020 10:24

doublethefish marked this pull request as ready for review April 21, 2020 11:20

doublethefish mentioned this pull request Apr 21, 2020

Performance Benchmarks / Integration tests #1954

Open

Merge branch 'master' into chore/1954/Add_performance_regressions

ce02b98

Pierre-Sassoulas merged commit 9a11ae2 into pylint-dev:master Apr 26, 2020

doublethefish deleted the chore/1954/Add_performance_regressions branch April 26, 2020 20:09

doublethefish mentioned this pull request Nov 22, 2021

Feature: Persistent caching of inference results pylint-dev/astroid#1145

Open

DanielNoord mentioned this pull request Feb 22, 2022

Don't assume runners have more than 2 cores available for benchmarking #5827

Merged

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

benchmark| Potential solution for performance regressions #3473

benchmark| Potential solution for performance regressions #3473

doublethefish commented Apr 3, 2020 •

edited

Loading

coveralls commented Apr 3, 2020 •

edited

Loading

Pierre-Sassoulas left a comment

doublethefish commented Apr 20, 2020

doublethefish commented Apr 21, 2020

benchmark| Potential solution for performance regressions #3473

benchmark| Potential solution for performance regressions #3473

Conversation

doublethefish commented Apr 3, 2020 • edited Loading

Steps

Description

Use-case

Baseline benchmark for pylint

Type of Changes

Related Issue

coveralls commented Apr 3, 2020 • edited Loading

Pierre-Sassoulas left a comment

Choose a reason for hiding this comment

doublethefish commented Apr 20, 2020

doublethefish commented Apr 21, 2020

doublethefish commented Apr 3, 2020 •

edited

Loading

coveralls commented Apr 3, 2020 •

edited

Loading