fix: Handle extra white-space in `MatchSpec` #3456

jjerphan · 2024-09-17T15:50:56Z

Fix #3453.

Signed-off-by: Julien Jerphanion <git@jjerphan.xyz>

Hind-M · 2024-09-18T10:42:32Z

micromamba/tests/test_env.py

+
+
+@pytest.mark.parametrize("shared_pkgs_dirs", [True], indirect=True)
+def test_env_create_whitespace(tmp_home, tmp_root_prefix, tmp_path):


I don't think we need this test, tests in test_match_spec are enough to make sure the parsing is done correctly.
Adding this one is redundant and is not relevant in my opinion.

I agree with this, I just wanted to have a non-regression test for the reported issue.

libmamba/tests/src/specs/test_match_spec.cpp

Hind-M · 2024-09-18T10:54:49Z

libmamba/src/specs/match_spec.cpp

+        std::string raw_match_spec_str = std::string(str);
+        raw_match_spec_str = util::strip(raw_match_spec_str);
+
+        // Remove any with space after binary operators, such as:
+        //  - `openmpi-4.1.4-ha1ae619_102`'s improperly encoded `constrains`: "cudatoolkit >= 10.2"
+        //  - `pytorch-1.13.0-cpu_py310h02c325b_0.conda`'s improperly encoded
+        //  `constrains`: "pytorch-cpu = 1.13.0", "pytorch-gpu = 99999999"
+        //  - `fipy-3.4.2.1-py310hff52083_3.tar.bz2`'s improperly encoded `constrains` or
+        //  `dep`: ">=4.5.2"
+        //  - `infokonoha-4.6.3-pyhd8ed1ab_0.tar.bz2`'s `kytea >=0.1.4, 0.2.0` -> `kytea
+        //  >=0.1.4,0.2.0`
+        // TODO: this solution reallocates memory several times potentially, but the
+        //  number of operators is small and the strings are short, so it must be fine.
+        //  If needed it can be optimized so that the string is only copied once.
+        for (const std::string& op : { ">=", "<=", "==", ">", "<", "!=", "=", "==", "," })
+        {
+            const std::string& bad_op = op + " ";
+            while (raw_match_spec_str.find(bad_op) != std::string::npos)
+            {
+                raw_match_spec_str = raw_match_spec_str.substr(0, raw_match_spec_str.find(bad_op)) + op
+                                     + raw_match_spec_str.substr(
+                                         raw_match_spec_str.find(bad_op) + bad_op.size()
+                                     );
+            }
+        }
+


Maybe we won't have a choice and we will need to fix this this way, but I really think that we should do this properly and stop postponing everything to later, because it will just increase the complexity (making it harder to change things afterwards), especially regarding the MatchSpec...
IIRC this should be rather handled here, so we need to adapt the logic accordingly (and try to keep the string_view).

I agree with your proposal although there is some complexity with the low level parser: if we want to handle all the current cases while excluding the PEP 508 environment markers, I am afraid that the amount of complexity to manage will be far more complex that this horrible yet working and short solution.

In my opinion, the long-term robust solution (as discussed in the past) is to define a grammar for MatchSpec and use lexers to generate parsers in applications. But this goes far beyond the scope of this PR or the time we have at hand.

Signed-off-by: Julien Jerphanion <git@jjerphan.xyz>

Signed-off-by: Julien Jerphanion <git@jjerphan.xyz> Co-authored-by: Hind Montassif <hind.montassif@gmail.com>

micromamba/tests/test_env.py

libmamba/tests/src/specs/test_match_spec.cpp

Co-authored-by: Hind-M <70631848+Hind-M@users.noreply.github.com>

libmamba/tests/src/specs/test_match_spec.cpp

Signed-off-by: Julien Jerphanion <git@jjerphan.xyz>

test: Add non-regression test for mamba-org#3453

b095102

Signed-off-by: Julien Jerphanion <git@jjerphan.xyz>

jjerphan mentioned this pull request Sep 17, 2024

test: Adapt test_env_update_pypi_with_conda_forge #3455

Closed

jjerphan added the 1.x Related to mamba 1.x branch/versions label Sep 18, 2024

JohanMabille added the release::bug_fixes For PRs fixing bugs label Sep 18, 2024

jjerphan removed the 1.x Related to mamba 1.x branch/versions label Sep 18, 2024

jjerphan added 2 commits September 18, 2024 09:45

Minimal suboptimal fix

2b5a30f

Signed-off-by: Julien Jerphanion <git@jjerphan.xyz>

Add edge cases to the env specification

60658f7

Signed-off-by: Julien Jerphanion <git@jjerphan.xyz>

jjerphan force-pushed the fix/handle-extra-white-space branch from caf12de to 60658f7 Compare September 18, 2024 08:05

jjerphan added 4 commits September 18, 2024 10:37

test: Add MatchSpec parsing subcases

90a8af8

Signed-off-by: Julien Jerphanion <git@jjerphan.xyz>

test: Complete test_env_create_whitespace

facfa9f

Signed-off-by: Julien Jerphanion <git@jjerphan.xyz>

Add kytea test case

6cb7a00

Signed-off-by: Julien Jerphanion <git@jjerphan.xyz>

Merge replacement of binary operators

6f6e041

Signed-off-by: Julien Jerphanion <git@jjerphan.xyz>

jjerphan marked this pull request as ready for review September 18, 2024 09:53

jjerphan mentioned this pull request Sep 18, 2024

test: MatchSpec edges cases #3458

Merged

Hind-M reviewed Sep 18, 2024

View reviewed changes

libmamba/tests/src/specs/test_match_spec.cpp Outdated Show resolved Hide resolved

Hind-M reviewed Sep 18, 2024

View reviewed changes

jjerphan and others added 4 commits September 19, 2024 13:59

Merge branch 'main' into fix/handle-extra-white-space

fb05fe9

Lint with pre-commit

e65bc9b

Signed-off-by: Julien Jerphanion <git@jjerphan.xyz>

Adapt MatchSpec

7bed635

Signed-off-by: Julien Jerphanion <git@jjerphan.xyz> Co-authored-by: Hind Montassif <hind.montassif@gmail.com>

Remove redundant test

6944d93

Signed-off-by: Julien Jerphanion <git@jjerphan.xyz> Co-authored-by: Hind Montassif <hind.montassif@gmail.com>

Hind-M reviewed Sep 19, 2024

View reviewed changes

micromamba/tests/test_env.py Outdated Show resolved Hide resolved

Hind-M reviewed Sep 19, 2024

View reviewed changes

libmamba/tests/src/specs/test_match_spec.cpp Outdated Show resolved Hide resolved

jjerphan and others added 2 commits September 19, 2024 14:25

Rename subcase

589092d

Co-authored-by: Hind-M <70631848+Hind-M@users.noreply.github.com>

Adapt comparison on versions

2146c4f

Co-authored-by: Hind-M <70631848+Hind-M@users.noreply.github.com>

Hind-M reviewed Sep 19, 2024

View reviewed changes

libmamba/tests/src/specs/test_match_spec.cpp Outdated Show resolved Hide resolved

jjerphan added 2 commits September 19, 2024 14:30

Adapt test case

bba3c3a

Signed-off-by: Julien Jerphanion <git@jjerphan.xyz>

Remove pytorch-cpu as it is not available on windows

01c5e68

Signed-off-by: Julien Jerphanion <git@jjerphan.xyz>

SylvainCorlay approved these changes Sep 19, 2024

View reviewed changes

SylvainCorlay merged commit 1c75567 into mamba-org:main Sep 19, 2024
32 checks passed

jjerphan deleted the fix/handle-extra-white-space branch September 19, 2024 14:55

jjerphan mentioned this pull request Sep 19, 2024

maint: Improve the parser of MatchSpec #3463

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Handle extra white-space in `MatchSpec` #3456

fix: Handle extra white-space in `MatchSpec` #3456

jjerphan commented Sep 17, 2024 •

edited

Loading

Hind-M Sep 18, 2024

jjerphan Sep 18, 2024

Hind-M Sep 18, 2024 •

edited

Loading

jjerphan Sep 19, 2024



		@pytest.mark.parametrize("shared_pkgs_dirs", [True], indirect=True)
		def test_env_create_whitespace(tmp_home, tmp_root_prefix, tmp_path):

fix: Handle extra white-space in MatchSpec #3456

fix: Handle extra white-space in MatchSpec #3456

Conversation

jjerphan commented Sep 17, 2024 • edited Loading

Hind-M Sep 18, 2024

Choose a reason for hiding this comment

jjerphan Sep 18, 2024

Choose a reason for hiding this comment

Hind-M Sep 18, 2024 • edited Loading

Choose a reason for hiding this comment

jjerphan Sep 19, 2024

Choose a reason for hiding this comment

fix: Handle extra white-space in `MatchSpec` #3456

fix: Handle extra white-space in `MatchSpec` #3456

jjerphan commented Sep 17, 2024 •

edited

Loading

Hind-M Sep 18, 2024 •

edited

Loading