Fix the TODO test in ack-w.t #14

petdance · 2011-12-01T18:43:02Z

No description provided.

hoelzro · 2012-03-08T13:24:54Z

More specifically, this one:

local $TODO = q{I can't figure why the -w works from the command line, but not inside this test};

In my experience, this invocation doesn't on the command line either, from either ack 1 or 2. The command line in question is this:

ack mu. -w -h t/text

The -w flag throws on a boundary check (\b) on each side of the search term, but only if the corresponding side ends in an alphanumeric. What we really need is a regular expression that matches a non-space character on one side, and a space character on the other, similar to \b.

gedge · 2013-12-07T02:30:55Z

The test does fail on the command-line, for me, too.

Why is \b put either side of the regexp only when it has \w each side? This breaks any (?:alternative|list) too.

Can't we just make \b...\b cuddling unconditional?

epa · 2014-04-09T15:57:00Z

The documentation says

       -w, --word-regexp
       Force PATTERN to match only whole words.  The PATTERN is wrapped with "\b" metacharacters.

So it does sound that the code should unconditionally add \b on each side.

hoelzro · 2014-04-09T16:12:49Z

No matter what we do, the docs have to change. The two sentences you've pointed out are contradictory, as wrapping things with \b unconditionally will fail to watch whole words only (at least, for liberal definitions of 'word' here). I'm personally in favor of removing the second sentence, as it's an implementation detail, subject to change, and not every ack user is familiar with Perl's regular expressions and \b.

petdance · 2014-04-09T16:13:40Z

The reason it says that is because this issue of "how should -w work" has been around since the dawn of time. I'm not saying we shouldn't change the \b-wrapping behavior, but we've been down the road of "This is what -w should do! No, this is what it should do!" before.

epa · 2014-04-09T16:17:16Z

I don't have strong views on the particular semantics of -w and I think the documented behaviour is as good as any. We just need to make sure the code and the test suite match the documentation!

gedge · 2014-04-12T17:16:28Z

ack has three interpretations of -w.

Two documentation sentences specify different cases:

"whole words" makes the assumption that the supplied pattern will match /^\w+$/ (great for simple documentation and simple use cases)
the second sentence has much wider scope (e.g. ack -w 'a|an' will end up as \ba|an\b and will match the a in after and also the an in bran - fail) - the simplistic first sentence no longer applies

The code (perl) is a third interpretation on the word theme, implementing sentence 2 conditionally and (worse, IMO) partially (we might get zero, one or two \b affixed to the PATTERN). The recent commit merely resolved one of the alternation issues.

The question remains: What is the most desired interpretation (and how do we document it so that users understand it)?

For me, the answer to that is to unconditionally apply \b(?:PATTERN)\b - with the following documentation:

Word match (the PATTERN matches if it is bounded by word breaks). PATTERN is wrapped in \b(?:....)\b

In short, I want ack -w 'a|an|th(?:e|at|is)' to match only the words a, an, the, that and this.

epa · 2014-04-12T17:21:42Z

I agree 100% with gedge, and would add that peeking at the pattern (to see if it begins with a word character, or whatever) is generally a broken approach because it will never understand regexp syntax sufficiently.

epa · 2015-01-14T09:13:59Z

A possible fix for this is in issue #445 .

petdance · 2017-03-14T04:49:38Z

This will get fixed in ack3.

petdance · 2017-03-29T18:26:18Z

This behavior is redone in ack3.

hoelzro mentioned this issue Aug 28, 2013

Fix TODO test in t/ack-w.t beyondgrep/ack1#132

Closed

hoelzro mentioned this issue Apr 9, 2014

-w does not interact well with metacharacters #445

Closed

epa mentioned this issue Jul 9, 2015

Fix -w behaviour and docs (<https://github.com/petdance/ack2/issues/445>) #558

Closed

petdance added the ack3 label Mar 14, 2017

petdance removed this from the Sooner milestone Mar 18, 2017

petdance closed this as completed Mar 29, 2017

petdance added the fixed in ack3 label Mar 29, 2017

ljedrz mentioned this issue Feb 14, 2018

ack closed the issues linked in the readme BurntSushi/ripgrep#803

Closed

matkoniecz mentioned this issue Mar 3, 2020

"(Yes, ack has a bug.)" appears to be outdated BurntSushi/ripgrep#1506

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix the TODO test in ack-w.t #14

Fix the TODO test in ack-w.t #14

petdance commented Dec 1, 2011

hoelzro commented Mar 8, 2012

gedge commented Dec 7, 2013

epa commented Apr 9, 2014

hoelzro commented Apr 9, 2014

petdance commented Apr 9, 2014

epa commented Apr 9, 2014

gedge commented Apr 12, 2014

epa commented Apr 12, 2014

epa commented Jan 14, 2015

petdance commented Mar 14, 2017

petdance commented Mar 29, 2017

Fix the TODO test in ack-w.t #14

Fix the TODO test in ack-w.t #14

Comments

petdance commented Dec 1, 2011

hoelzro commented Mar 8, 2012

gedge commented Dec 7, 2013

epa commented Apr 9, 2014

hoelzro commented Apr 9, 2014

petdance commented Apr 9, 2014

epa commented Apr 9, 2014

gedge commented Apr 12, 2014

epa commented Apr 12, 2014

epa commented Jan 14, 2015

petdance commented Mar 14, 2017

petdance commented Mar 29, 2017