The new version v2.2.4 generates false positive output #2786

pirat89 · 2023-03-10T10:06:53Z

The new codespell version released yesterday started to generate false positives when

a string is composed dynamically
when string contains escaped characters , so e.g. following strings are reported as mispelled:
"My repositor{suffix} ...".format(suffix=suffix)
'We couldn\'t do ....'

Additional info:

https://github.com/oamg/leapp-repository/actions/runs/4383259849/jobs/7673283029

Error: ./repos/system_upgrade/common/actors/localreposinhibit/actor.py:60: repositor ==> repository
Error: ./repos/system_upgrade/common/actors/localreposinhibit/actor.py:67: repositor ==> repository
Error: ./repos/system_upgrade/common/actors/checktargetiso/libraries/check_target_iso.py:149: repositor ==> repository
Error: ./repos/system_upgrade/common/actors/selinux/selinuxapplycustom/actor.py:154: couldn ==> could, couldn't
Error: ./repos/system_upgrade/common/actors/selinux/selinuxapplycustom/actor.py:162: couldn ==> could, couldn't

The text was updated successfully, but these errors were encountered:

DimitriPapadopoulos · 2023-03-10T11:02:36Z

That has always been the case. Are you certain this is a regression?

pirat89 · 2023-03-10T11:39:39Z

@DimitriPapadopoulos the mentioned code has been part of the upstream for a longer time and tests (including codespell) have been passing. e.g. like in this one PR oamg/leapp-repository#979 that introduced the chacktargetiso actor. It's visible that it passed originally and the same code is failing the test now. Also the previous run in the PR has been ok and it started to faile just today.

It's possible I missed something, but all clues I have seem to be suggesting it's regression.

DimitriPapadopoulos · 2023-03-10T12:21:12Z

OK, let me see:

$ cat > foo.bar.txt
My repositor{suffix}
We couldn\'t do
$

With codespell 2.2.2:

$ codespell foo.bar.txt
$

With codespell 2.2.3:

$ codespell foo.bar.txt
foo.bar.txt:1: repositor ==> repository
foo.bar.txt:2: couldn ==> could, couldn't
$

It's just that repositor and couldn have been added to the dictionary of typos:

repositor: a80c4d5 / Add many corrections to dictionary.txt #2608
couldn: 36c8c27 / Add some more misspellings found in Emacs #2660

DimitriPapadopoulos · 2023-03-10T12:32:31Z

This does raise the question of dictionary updates, especially in CI actions.

pirat89 · 2023-03-10T12:33:22Z

@DimitriPapadopoulos thanks for the info. It make sense. So I will add it to the ignorelist in our projects.

The new version of codespell contains additional "typos" for the detection in the dictionary, which produces FP fails in tests as typos are detected also in cases like: couldn\'t repositor{suffix} etc. For now, we will just update the ignorelist, but in future it would be ideal to not generate such cases. Doing differences between singular/plural is not providing big benefit in report. Escaping is not so problematic I would say, but in case of issues, we could just switch to longer form - like "could not". But there is no beenfit to update the existing code now, so let's focus in future on better texts and keep the existing strings as they are until they are reworded due to additional wanted changes (I mean, if there is any additional reason in future to change them). FYI: codespell-project/codespell#2786

pirat89 mentioned this issue Mar 10, 2023

Include leapp data files in the RPM & repository oamg/leapp-repository#1046

Merged

DimitriPapadopoulos added the question label Mar 10, 2023

DimitriPapadopoulos closed this as completed Mar 10, 2023

pirat89 mentioned this issue Mar 10, 2023

Update codespell ignorelist: couldn,repositor oamg/leapp-repository#1055

Merged

DimitriPapadopoulos mentioned this issue Mar 12, 2023

Skip new codespell false positives adrienverge/openfortivpn#1077

Merged

This comment was marked as off-topic.

Sign in to view

bdice mentioned this issue Dec 17, 2024

Fix codespell behavior. rapidsai/rmm#1769

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The new version v2.2.4 generates false positive output #2786

The new version v2.2.4 generates false positive output #2786

pirat89 commented Mar 10, 2023

DimitriPapadopoulos commented Mar 10, 2023

pirat89 commented Mar 10, 2023 •

edited

Loading

DimitriPapadopoulos commented Mar 10, 2023 •

edited

Loading

DimitriPapadopoulos commented Mar 10, 2023

pirat89 commented Mar 10, 2023 •

edited

Loading

This comment was marked as off-topic.

This comment was marked as off-topic.

The new version v2.2.4 generates false positive output #2786

The new version v2.2.4 generates false positive output #2786

Comments

pirat89 commented Mar 10, 2023

DimitriPapadopoulos commented Mar 10, 2023

pirat89 commented Mar 10, 2023 • edited Loading

DimitriPapadopoulos commented Mar 10, 2023 • edited Loading

DimitriPapadopoulos commented Mar 10, 2023

pirat89 commented Mar 10, 2023 • edited Loading

This comment was marked as off-topic.

This comment was marked as off-topic.

pirat89 commented Mar 10, 2023 •

edited

Loading

DimitriPapadopoulos commented Mar 10, 2023 •

edited

Loading

pirat89 commented Mar 10, 2023 •

edited

Loading