🧪 Unmeasure coverage in tests expected to fail #12531

webknjaz · 2024-06-25T21:30:51Z

These tests are known to only be executed partially or not at all. So we always get incomplete, missing, and sometimes flaky, coverage in the test functions that are expected to fail.

This change updates the coverage.py config to prevent said tests from influencing the coverage level measurement.

These tests are known to only be executed partially or not at all. So we always get incomplete, missing, and sometimes flaky, coverage in the test functions that are expected to fail. This change updates the ``coverage.py`` config to prevent said tests from influencing the coverage level measurement.

webknjaz · 2024-06-25T21:41:20Z

FTR this is motivated by my attempt to look into improving the Codecov config. While checking it out, I noticed that there's uncovered lines in tests, meaning that some tests might never run in CI, and we'd never notice. It shouldn't be like that: https://nedbatchelder.com/blog/202008/you_should_include_your_tests_in_coverage.html.

I hope, this will help us start requiring 100% coverage on tests, having followed the best practices as outlined in https://adamj.eu/tech/2019/04/30/getting-a-django-application-to-100-percent-coverage/. This won't affect non-testing code for now but seemed like a good place to start.

bluetech · 2024-06-26T12:53:46Z

I like the idea, however I fear it might over-ignore some stuff. If we don't have too many xfails, I'd feel better with just manual pragma: no cover comments...

webknjaz · 2024-06-26T14:52:06Z

@bluetech yeah, I had a feeling that there might not be enough buy-in. However, I believe this will not over-ignore stuff. Yes, there's some coverage on the first lines of some xfail tests, but it's not really useful. Over time, more or less lines may start failing there, and it'll influence PR-reported coverage changes here and there for no reason. It's even worse if the tests are very flaky.

It might make sense to measure coverage when working on fixing said xfail tests, but in that situation, one could just as well comment out or remove the mark while debugging.

I gave it enough thought over the years to think that there's no case where having coverage collected on any xfailing test is useful. Do you have any example of where this would over-ignore tests? I intentionally wrote the regex in a way that would work with explicit xfail mark decorators applied to all the parametrized variants and not to the cases where this mark is applied to only some parametrize factors. This will also leave out module-level pytestmark, which I think is fair.

bluetech · 2024-06-26T15:16:37Z

For a project which is using pytest, I agree there is little risk. But here we are not only using pytest, we are also developing pytest and testing pytest, and my "over-ignoring" concern is for the latter two. Probably it's not a real issue but it feels like something that might start ignoring some unintended piece of code without us noticing.

webknjaz · 2024-06-26T16:42:04Z

without us noticing.

That's the concern I'm trying to minimize here — with random unrelated coverage information changes, people are trained to ignore it altogether and never look into it. Just look into the effect of Codecov having a reputation of being flaky — it's been broken on main for 4.5 months (#11921) until I noticed and addressed it in #12508+#12516 just 4 days ago. And I only knew to check for it due to my previous experience and watching these things closely (it was so bad in the day of the v4 release that I even issued an advisory recommending avoiding upgrades in @aio-libs — https://github.com/orgs/aio-libs/discussions/36) — most people either don't know or learned to ignore whatever codecov shows them.

And I think that this is a low-effort improvement that would enhance the experience greatly. I just can't imagine a situation where we'd want to have xfailing tests non-ignored. Of course, for the lines that we know get always executed, it'd be nice to keep the coverage but then, we'd have to add a no cover pragma to tens or hundreds of lines since coverage.py doesn't have a concept of ignoring coverage starting with a certain line and until the end of the function.

Do you have any suggestions on handling things like https://app.codecov.io/gh/pytest-dev/pytest/blob/main/testing%2Ftest_debugging.py#L388?

bluetech · 2024-06-28T21:35:51Z

@webknjaz OK, I trust you judgment on this. Thanks for improving the coverage situation!

bluetech · 2024-06-28T21:38:37Z

I hope, this will help us start requiring 100% coverage on tests

Just a heads up from me on this -- while I think reaching 100% coverage is a laudable goal, I think that enforcing 100% is somewhat harmful. Sometimes you just want to do something without doing it perfectly...

webknjaz · 2024-06-28T22:26:54Z

I think that enforcing 100% is somewhat harmful

I don't think that enforcing it is harmful, as long as you use # pragma: no cover for all the places you consciously want to skip from being taken into account. Especially, for the tests.

webknjaz · 2024-06-28T22:27:44Z

@nicoddemus @RonnyPfannschmidt @The-Compiler do you want to post your opinions here before this is merged?

nicoddemus

I like the configuration, thanks!

I don't think that enforcing it is harmful, as long as you use # pragma: no cover for all the places you consciously want to skip from being taken into account. Especially, for the tests.

I agree, I have a few projects where this is enforced and indeed it is beneficial, given you can add an explicit ignore when needed. 👍

patchback · 2024-07-01T14:21:39Z

Backport to 8.2.x: 💚 backport PR created

✅ Backport PR branch: patchback/backports/8.2.x/1a8394ed8964a43e2fe766df3a48fa0573362512/pr-12531

Backported as #12551

🤖 @patchback
I'm built with octomachinery and
my source is open — https://github.com/sanitizers/patchback-github-app.

webknjaz · 2024-07-01T14:21:43Z

Thanks for the feedback, everyone!

(cherry picked from commit 1a8394e)

webknjaz · 2024-07-02T13:14:24Z

changelog/12531.contrib.rst

+report. This has an effect of reducing the influence of flaky
+tests on the resulting number.
+
+-- by :user`webknjaz`


Syntax correction: #12560

It is easy to forget backticks in change note bylines. It's happened in pytest-dev#12531 already, requiring a hotfix in pytest-dev#12560. The pre-commit based check idea is coming from the Tox project and have been battle-tested in aiohttp, CherryPy, and other ecosystems.

…a75f65c7316bb135740456bc87173017c2ac998/pr-12560 [PR #12560/4a75f65c backport][8.2.x] Correct the `:user:` role @ PR #12531 change note

It is easy to forget backticks in change note bylines. It's happened in pytest-dev#12531 already, requiring a hotfix in pytest-dev#12560. The pre-commit based check idea is coming from the Tox project and have been battle-tested in aiohttp, CherryPy, and other ecosystems.

These tests are known to only be executed partially or not at all. So we always get incomplete, missing, and sometimes flaky, coverage in the test functions that are expected to fail. This change updates the ``coverage.py`` config to prevent said tests from influencing the coverage level measurement. Ref pytest-dev/pytest#12531

webknjaz requested review from RonnyPfannschmidt, The-Compiler, nicoddemus and bluetech June 25, 2024 21:30

psf-chronographer bot added the bot:chronographer:provided label Jun 25, 2024

webknjaz changed the title ~~🧪 Unmeasure xfail tests~~ 🧪 Unmeasure coverage in tests expected to fail Jun 25, 2024

webknjaz added the backport 8.2.x label Jun 25, 2024

webknjaz added the type: selftests label Jun 26, 2024

bluetech approved these changes Jun 28, 2024

View reviewed changes

nicoddemus approved these changes Jun 28, 2024

View reviewed changes

webknjaz merged commit 1a8394e into pytest-dev:main Jul 1, 2024
29 checks passed

webknjaz deleted the maintenance/xfail-no-cover branch July 1, 2024 14:21

patchback bot pushed a commit that referenced this pull request Jul 1, 2024

Merge pull request #12531 from webknjaz/maintenance/xfail-no-cover

Loading
Loading status checks…

1dc3a28

(cherry picked from commit 1a8394e)

patchback bot mentioned this pull request Jul 1, 2024

[PR #12531/1a8394ed backport][8.2.x] 🧪 Unmeasure coverage in tests expected to fail #12551

Merged

webknjaz commented Jul 2, 2024

View reviewed changes

webknjaz mentioned this pull request Jul 2, 2024

🧪 Lint for typos in :user: RST role #12562

Merged

patchback bot mentioned this pull request Jul 2, 2024

[PR #12562/e8aee213 backport][8.2.x] 🧪 Lint for typos in :user: RST role #12566

Merged

Glyphack pushed a commit to Glyphack/pytest that referenced this pull request Jul 30, 2024

📝 Add a change note for PR pytest-dev#12531

3be11ca

Glyphack pushed a commit to Glyphack/pytest that referenced this pull request Jul 30, 2024

Correct the :user: role @ PR pytest-dev#12531 change note

a732351

webknjaz mentioned this pull request Sep 13, 2024

🧪 Unmeasure coverage in tests expected to fail ansible/awx#15512

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GitHub Sponsors

Uh oh!

🧪 Unmeasure coverage in tests expected to fail #12531

🧪 Unmeasure coverage in tests expected to fail #12531

webknjaz commented Jun 25, 2024

Uh oh!

webknjaz commented Jun 25, 2024

Uh oh!

bluetech commented Jun 26, 2024

Uh oh!

webknjaz commented Jun 26, 2024 •

edited

Loading

Uh oh!

bluetech commented Jun 26, 2024

Uh oh!

webknjaz commented Jun 26, 2024

Uh oh!

bluetech commented Jun 28, 2024

Uh oh!

bluetech commented Jun 28, 2024

Uh oh!

webknjaz commented Jun 28, 2024

Uh oh!

webknjaz commented Jun 28, 2024

Uh oh!

nicoddemus left a comment

Uh oh!

Uh oh!

patchback bot commented Jul 1, 2024 •

edited

Loading

Uh oh!

webknjaz commented Jul 1, 2024

Uh oh!

webknjaz Jul 2, 2024

Uh oh!

🧪 Unmeasure coverage in tests expected to fail #12531

🧪 Unmeasure coverage in tests expected to fail #12531

Conversation

webknjaz commented Jun 25, 2024

Uh oh!

Uh oh!

Uh oh!

webknjaz commented Jun 25, 2024

Uh oh!

bluetech commented Jun 26, 2024

Uh oh!

webknjaz commented Jun 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bluetech commented Jun 26, 2024

Uh oh!

webknjaz commented Jun 26, 2024

Uh oh!

bluetech commented Jun 28, 2024

Uh oh!

bluetech commented Jun 28, 2024

Uh oh!

webknjaz commented Jun 28, 2024

Uh oh!

webknjaz commented Jun 28, 2024

Uh oh!

nicoddemus left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

patchback bot commented Jul 1, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Backport to 8.2.x: 💚 backport PR created

Uh oh!

webknjaz commented Jul 1, 2024

Uh oh!

Uh oh!

webknjaz Jul 2, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

webknjaz commented Jun 26, 2024 •

edited

Loading

patchback bot commented Jul 1, 2024 •

edited

Loading