ci: Add numpydoc pre-commit hook #1371

auguste-probabl · 2025-02-28T10:22:23Z

Only files in skore/src are checked
Some lints are ignored
- Even then, there remains a lot of work to make numpydoc happy
- I didn't ignore the "summary line should be on a new line" rule, but actually this is PEP 257-compliant and we could ignore it

Replaced '(\s*)"""(.*)([^"])$' with '$1"""\n$1$2$3' using repgrep

thomass-dev · 2025-02-28T10:30:40Z

Surprised that it isn't already effective/covered by the pyproject.toml section

[tool.ruff.lint.pydocstyle]
convention = "numpy"

Do you know why?

github-actions · 2025-02-28T10:38:56Z

Documentation preview @ 96045fb

auguste-probabl · 2025-02-28T10:39:30Z

Surprised that it isn't already effective/covered by the pyproject.toml section

It kind of is, but not completely; see the failed CI run for examples of things that aren't checked by ruff. There is an open issue in ruff to cover all of numpydoc: astral-sh/ruff#8425.

See also:

All docs-related ruff checks: https://docs.astral.sh/ruff/rules/#pydocstyle-d
What convention = "numpy" does: https://docs.astral.sh/ruff/faq/#does-ruff-support-numpy-or-google-style-docstrings

auguste-probabl · 2025-02-28T16:00:07Z

Personally I don't think that it's worth the trouble right now.

glemaitre · 2025-03-01T12:22:58Z

skore/pyproject.toml

+  "ES01",  # No "extended summary" section
+  "EX01",  # No "Examples" section
+  "SA01",  # No "See also" section
+]


We have those filters in scikit-learn: https://github.com/scikit-learn/scikit-learn/blob/fef620292973dd25ca206c8bbdff194771c857fc/sklearn/tests/test_docstrings.py#L45

glemaitre · 2025-03-01T12:23:27Z

skore/pyproject.toml

+  "ES01",  # No "extended summary" section
+  "EX01",  # No "Examples" section
+  "SA01",  # No "See also" section


Those one are quite important for a nice documentation actually.

glemaitre · 2025-03-01T12:32:38Z

Personally I don't think that it's worth the trouble right now.

Right now apart from the long summary, examples and see also that find important to have, we only have 3 failures with the rendering of the help which would could skip if we see that we cannot support properly.

In terms of rule, we have set that we skip in scikit-learn because numpydoc is picky a bit too much on something that does not necessarily improve the documentation.

Regarding the "necessity", I think that we should pretty soon have a user guide and have proper docstring. Otherwise, it is already a project smell when you don't have a clear documentation explaining with words why a project is useful (a getting_started is not enough). So I would think that we should still fix those issues on the way and not only postpone them releases after releases.

auguste-probabl · 2025-03-03T11:26:50Z

Update with different settings. If we don't ignore "extended summary", "example" or "see also", but we do ignore "summary should be on a different line", we get ~2000 lints. Most of them are superfluous IMO.

Yeah the fine-grained approach that sklearn uses sounds like a good start because we can spread out the change over time, so that we don't have to keep a PR open while we fix 2000 errors.

glemaitre · 2025-03-03T11:42:50Z

we get ~2000 lints. Most of them are superfluous IMO.

Is it that we get warning on function that are not part of the public API? The code in scikit-learn, is actually reducing this behaviour for the public code base and I would argue that it is indeed enough.

auguste-probabl · 2025-03-03T12:45:50Z

we get ~2000 lints. Most of them are superfluous IMO.

Is it that we get warning on function that are not part of the public API? The code in scikit-learn, is actually reducing this behaviour for the public code base and I would argue that it is indeed enough.

Yes that's right. I don't know what regex to give to numpydoc to filter things correctly (e.g EstimatorReport is defined in _estimator/report.py so _+ is not enough), so we'd need something like sklearn's custom pytest-based system to get something correct (whatever "correct" means for us).

glemaitre · 2025-03-03T12:49:01Z

Yep so it is indeed more involved work than just having the linter on. So we might delay that. We just need to ensure that we keep documenting as good as possible in the meantime ;)

auguste-probabl · 2025-03-03T13:14:52Z

What we could also do is start enforcing rules in CI only on changed files, so for a while every PR would include a bit of docs improvement

add numpydoc pre-commit hook

c3019b8

github-actions bot assigned auguste-probabl Feb 28, 2025

thomass-dev self-requested a review February 28, 2025 10:29

auguste-probabl added 6 commits February 28, 2025 11:30

add numpydoc in CI

4ce9eb2

All multi-line docstrings have a newline after the first """

b105b3b

Replaced '(\s*)"""(.*)([^"])$' with '$1"""\n$1$2$3' using repgrep

fix docstrings in _show_versions.py

5077563

refactor _get_sys_info

c3fcfef

fix docstrings in _progress_bar.py

7597bf8

fix docstrings in _patch.py

96045fb

auguste-probabl force-pushed the add-numpydoc branch from 67c5292 to 96045fb Compare February 28, 2025 10:30

auguste-probabl requested a review from glemaitre February 28, 2025 15:59

glemaitre reviewed Mar 1, 2025

View reviewed changes

auguste-probabl closed this Mar 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ci: Add numpydoc pre-commit hook #1371

ci: Add numpydoc pre-commit hook #1371

auguste-probabl commented Feb 28, 2025 •

edited

Loading

thomass-dev commented Feb 28, 2025 •

edited

Loading

github-actions bot commented Feb 28, 2025

auguste-probabl commented Feb 28, 2025 •

edited

Loading

auguste-probabl commented Feb 28, 2025 •

edited

Loading

glemaitre Mar 1, 2025

glemaitre Mar 1, 2025

glemaitre commented Mar 1, 2025

auguste-probabl commented Mar 3, 2025

glemaitre commented Mar 3, 2025

auguste-probabl commented Mar 3, 2025

glemaitre commented Mar 3, 2025

auguste-probabl commented Mar 3, 2025

ci: Add numpydoc pre-commit hook #1371

ci: Add numpydoc pre-commit hook #1371

Conversation

auguste-probabl commented Feb 28, 2025 • edited Loading

thomass-dev commented Feb 28, 2025 • edited Loading

github-actions bot commented Feb 28, 2025

auguste-probabl commented Feb 28, 2025 • edited Loading

auguste-probabl commented Feb 28, 2025 • edited Loading

glemaitre Mar 1, 2025

Choose a reason for hiding this comment

glemaitre Mar 1, 2025

Choose a reason for hiding this comment

glemaitre commented Mar 1, 2025

auguste-probabl commented Mar 3, 2025

glemaitre commented Mar 3, 2025

auguste-probabl commented Mar 3, 2025

glemaitre commented Mar 3, 2025

auguste-probabl commented Mar 3, 2025

auguste-probabl commented Feb 28, 2025 •

edited

Loading

thomass-dev commented Feb 28, 2025 •

edited

Loading

auguste-probabl commented Feb 28, 2025 •

edited

Loading

auguste-probabl commented Feb 28, 2025 •

edited

Loading