Add regex support #13

rasa · 2024-05-10T02:11:05Z

Fixes #12.

I tried to write logic that converts regexs such as (abc|defg) to defg, but I gave up, as it's non-trivial. The cost/benefit is not worth it :). Instead, I just note the issue in the readme.

axllent · 2024-05-10T10:50:19Z

I'm sorry (and not to be rude...), but I'm really struggling to see the point of this functionality. Can you please explain the use-case for searching a regular expression?

Let me rephrase that: Can you please give me an example (or examples) of what someone may use a regular expression for?

rasa · 2024-05-10T13:55:44Z

Can you please give me an example (or examples) of what someone may use a regular expression for?

Certainly. Here are some examples:

.*word.* - find word anywhere in the key (word.* and .*word also work)
^.{0,10}word - find word anywhere in the first 10 letters of the key (how wireguard-vanity-address currently works)
word1.*word2 - find two words, but anywhere in the key. The first word may be the hostname, the second word could be the OS, its location, whether it's a server, or just a peer, etc.
(word1|word2).*(word1|word2) - find two words, but in any order, anywhere in the key (word1.*word1 will also match)
^word[/+A-Z0-9] - find lowercase word at beginning of key, but delimit with non-lowercase character, so word stands out more clearly. See also Tip: Getting a totally "vain" address warner/wireguard-vanity-address#22
^[s5][o0][ll]ar - find 'solar' or the visually similar 's01ar`, per Match visually-ambiguous characters for more matches & longer strings warner/wireguard-vanity-address#25
^[s5][i1][z2][a4][b86][l7][e3].* - find 'sizable' in leet speek

Since adding each letter to the search term increasing the time exponentially, we want to give the user maximum flexibility in finding the term, or terms, they are looking for.

See also warner/wireguard-vanity-address#23

rasa · 2024-05-10T14:07:57Z

I changed this to draft, as I think we should add some of the above to the readme, as I'm sure if you've questioned its usefulness, others will to.

Also, I think it's important to stop users from creating regex patterns that would never match. So, for example: aa$ will never match, as it's missing the = character.

As defer defers to the end of the function, not the end of the block. See https://blog.learngoprogramming.com/gotchas-of-defer-in-go-1-8d070894cb01

axllent

I definitely would not do any if len(..) for either slices as this has to count both slices for every calculation, nor would I do a separate mutex for the string & regex wordmaps.

I would do something like:

        // Allow only one routine at a time to avoid
	// "concurrent map iteration and map write"
	c.mapMutex.Lock()
	defer c.mapMutex.Unlock()
	for w, count := range c.WordMap {
		if count == 0 {
			continue
		}
		completed = false
		if strings.HasPrefix(matchKey, w) {
			c.WordMap[w] = count - 1
			cb(Pair{Private: k.String(), Public: pub})
		}
	}

	for w, count := range c.RegexpMap {
		if count == 0 {
			continue
		}
		completed = false
		if w.MatchString(matchKey) {
			c.RegexpMap[w] = count - 1
			cb(Pair{Private: k.String(), Public: pub})
		}
	}

axllent · 2024-05-11T03:44:18Z

Also in main.go, I would bypass the estimation for regex entirely as it cannot be calculated, as well as validate the regex (rather than a MustCompile())

                if stripped != sword {
			regex, err := regexp.Compile(sword)
			if err != nil {
				fmt.Printf("Invalid regular expression: %s\n", sword)
				os.Exit(2)
			}
			c.RegexpMap[regex] = options.LimitResults
			fmt.Printf("Cannot calculate probability for a regular expression: %s\n", sword)
		} else {
			c.WordMap[sword] = options.LimitResults
			probability := keygen.CalculateProbability(stripped, options.CaseSensitive)
			estimate64 := int64(speed) * probability
			estimate := time.Duration(estimate64)

			fmt.Printf("Probability for \"%s\": 1 in %s (approx %s per match)\n",
				word, keygen.NumberFormat(probability), keygen.HumanizeDuration(estimate))
		}

Also: Output error messages to stderr Only add (?i) to regex if not already there Move regex logic to end of utils.go Trim spaces from search term

rasa · 2024-05-11T17:49:02Z

@axllent It's ready for review. Let me know your thoughts. Happy to make any changes you deem worthy.

axllent · 2024-05-12T03:30:50Z

Thanks awesome, thanks @rasa - I did some testing and it works as I'd expect 👍

axllent · 2024-05-12T04:28:50Z

This has been merged and released in 0.0.9. I also just did a manual change to the README which resolves your other PR.

Thanks again for your hard work @rasa!

rasa added 3 commits May 9, 2024 19:07

Add regex support

f462fb2

feat: strip out {n}s

993549b

fix: ignore binary on Windows

3aa5ac7

rasa marked this pull request as draft May 10, 2024 14:00

fix: Unlock after loop, not function end

c33ad12

As defer defers to the end of the function, not the end of the block. See https://blog.learngoprogramming.com/gotchas-of-defer-in-go-1-8d070894cb01

rasa force-pushed the feat-add-regexs branch from 4443999 to c33ad12 Compare May 10, 2024 17:16

fix: add (?i) prefix for case-insensitive regexes

9de5de6

axllent reviewed May 11, 2024

View reviewed changes

rasa added 6 commits May 10, 2024 22:28

Skip calc for regexs, simplying loop logic

5bd50ae

go fmt

4106e59

Replace MustCompile with Compile

52aefee

Don't include added (?i) in error message

a034694

Add regex section to readme, tweak regex error message

03a9bdf

Reject regexes that have no chance of finding a match

87032ea

Also: Output error messages to stderr Only add (?i) to regex if not already there Move regex logic to end of utils.go Trim spaces from search term

rasa marked this pull request as ready for review May 11, 2024 17:48

docs: Add link to regex tester, add escaping +s

d109f78

axllent merged commit 6ff2d42 into axllent:develop May 12, 2024

rasa deleted the feat-add-regexs branch May 12, 2024 19:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add regex support #13

Add regex support #13

rasa commented May 10, 2024

axllent commented May 10, 2024 •

edited

Loading

rasa commented May 10, 2024 •

edited

Loading

rasa commented May 10, 2024 •

edited

Loading

axllent left a comment

axllent commented May 11, 2024

rasa commented May 11, 2024 •

edited

Loading

axllent commented May 12, 2024

axllent commented May 12, 2024

Add regex support #13

Add regex support #13

Conversation

rasa commented May 10, 2024

axllent commented May 10, 2024 • edited Loading

rasa commented May 10, 2024 • edited Loading

rasa commented May 10, 2024 • edited Loading

axllent left a comment

Choose a reason for hiding this comment

axllent commented May 11, 2024

rasa commented May 11, 2024 • edited Loading

axllent commented May 12, 2024

axllent commented May 12, 2024

axllent commented May 10, 2024 •

edited

Loading

rasa commented May 10, 2024 •

edited

Loading

rasa commented May 10, 2024 •

edited

Loading

rasa commented May 11, 2024 •

edited

Loading