feat: UPS tracking numbers #228

P403n1x87 · 2021-11-04T11:51:02Z

⚠ Pull Requests not made with this template will be automatically closed 🔥

Prerequisites

Have you read the documentation on contributing? https://github.com/bee-san/pyWhat/wiki/Adding-your-own-Regex

Why do we need this pull request?

This PR adds the regex for UPS tracking numbers

What GitHub issues does this fix?

N. A.

Copy / paste of output

❯ pywhat 1Z123CAB9912345678    
Matched on: 1Z123CAB9912345678
Name: UPS Tracking Number
Link:  https://www.ups.com/track?tracknum=1Z123CAB9912345678

tests/test_regex_identifier.py

ghost · 2021-11-04T11:54:43Z

pywhat/Data/regex.json

+      "Regex": "^(1Z[0-9A-Z]{6}[0-9]{2}[0-9]{8})$",
+      "plural_name": false,
+      "Description": null,
+      "Rarity": 1,


I would say that rarity should be lowered.

Any suggestions? 🙂

Something around 0.3

Why 0.3? I'd say higher like 0.5 or 0.6 because:

The string has to start with 1Z

It needs 7 chars 0-9A-Z

It has exactly 2 numbers

It has 8 numbers

Also, can we make it:

- ^(1Z[0-9A-Z]{6}[0-9]{2}[0-9]{8})$ - + ^(1Z[0-9A-Z]{6}[0-9]{10})$

?

I think 0.4 or 0.5. And yes, regex should be changed.

The idea of the 2+8 split is because the first 2 digits in this group represent a service indicator code and perhaps it could be captured and handled in the future.

Aside: I wonder if the "rarity" could be estimated more reliably through some entropy-based measure 🤔

service indicator code

We have precedence for this called sub-categories. See the Mastercard / Phone Numbers regex. I am not sure it'll work on data in the middle of the regex, we may need to change the code for that :)

Aside: I wonder if the "rarity" could be estimated more reliably through some entropy-based measure 🤔

Probably! Currently I am estimating it based on what I see when people post this:

And also whether we have any false positives.

@P403n1x87 You can use subcategories with regex method for that.

pywhat/Data/regex.json

codecov-commenter · 2021-11-07T11:13:07Z

Codecov Report

Merging #228 (62b3df8) into main (a5a4a3b) will not change coverage.
The diff coverage is n/a.

@@           Coverage Diff           @@
##             main     #228   +/-   ##
=======================================
  Coverage   92.60%   92.60%           
=======================================
  Files          15       15           
  Lines        1217     1217           
=======================================
  Hits         1127     1127           
  Misses         90       90

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update a5a4a3b...62b3df8. Read the comment docs.

pywhat/Data/regex.json

Co-authored-by: piatrashkakanstantinass <74979584+piatrashkakanstantinass@users.noreply.github.com>

P403n1x87 force-pushed the feat/ups-tracking branch from c287929 to 2516179 Compare November 4, 2021 11:52

ghost suggested changes Nov 4, 2021

View reviewed changes

bee-san reviewed Nov 4, 2021

View reviewed changes

pywhat/Data/regex.json Outdated Show resolved Hide resolved

bee-san reviewed Nov 4, 2021

View reviewed changes

pywhat/Data/regex.json Outdated Show resolved Hide resolved

P403n1x87 force-pushed the feat/ups-tracking branch 3 times, most recently from 14b9fcc to 45d37fe Compare November 7, 2021 11:02

P403n1x87 requested review from bee-san and a user November 7, 2021 11:03

feat: UPS tracking numbers

e3880c0

P403n1x87 force-pushed the feat/ups-tracking branch from 45d37fe to e3880c0 Compare November 7, 2021 11:09

Merge branch 'main' into feat/ups-tracking

27cdcf2

ghost suggested changes Nov 7, 2021

View reviewed changes

pywhat/Data/regex.json Outdated Show resolved Hide resolved

Update pywhat/Data/regex.json

62b3df8

Co-authored-by: piatrashkakanstantinass <74979584+piatrashkakanstantinass@users.noreply.github.com>

P403n1x87 requested a review from a user November 7, 2021 11:27

ghost approved these changes Nov 7, 2021

View reviewed changes

bee-san merged commit 559a89c into bee-san:main Nov 7, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: UPS tracking numbers #228

feat: UPS tracking numbers #228

P403n1x87 commented Nov 4, 2021 •

edited by ghost

Loading

ghost Nov 4, 2021

P403n1x87 Nov 4, 2021

ghost Nov 4, 2021

bee-san Nov 4, 2021

ghost Nov 4, 2021

P403n1x87 Nov 4, 2021

P403n1x87 Nov 4, 2021

bee-san Nov 4, 2021

ghost Nov 4, 2021

codecov-commenter commented Nov 7, 2021 •

edited

Loading

feat: UPS tracking numbers #228

feat: UPS tracking numbers #228

Conversation

P403n1x87 commented Nov 4, 2021 • edited by ghost Loading

Prerequisites

Why do we need this pull request?

What GitHub issues does this fix?

Copy / paste of output

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codecov-commenter commented Nov 7, 2021 • edited Loading

Codecov Report

P403n1x87 commented Nov 4, 2021 •

edited by ghost

Loading

codecov-commenter commented Nov 7, 2021 •

edited

Loading