-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[DealabsBridge - HotUKDealsBridge - MydealsBridge] Update Groups #2083
Conversation
The bridges has been updated with the newest "groups" available on every website !
'Huile moteur' => 'huile-moteur', | ||
'Hygiène corporelle' => 'hygiene-corporelle', |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Links with hygiene-corporelle
become invalid.
'Hygiène de la maison' => 'hygiene-de-la-maison', | ||
'Hygiène des bébés' => 'hygiene-des-bebes', | ||
'Image, son & vidéo' => 'image-son-video', |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Links with image-son-video
become invalid.
'Magasins d'usine' => 'magasins-usine', | ||
'Magazines' => 'magazines', | ||
'Maillots de bain' => 'maillots-de-bain', | ||
'Maillots de football' => 'maillots-de-football', | ||
'Maison & Jardin' => 'maison-et-jardin', |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Same
'Microsoft Office' => 'microsoft-office', | ||
'Microsoft Surface' => 'microsoft-surface', |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Same. Didn't check other values that were removed
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Started checking, if old feed links are still valid on those values, that were removed. Found some, that became invalid.
Some of the existing links (like 'image-son-video') are redirected to another URL on the website, and some (like 'microsoft-surface') don't exists anymore. I'll check if some of them are still visible in the "Similar Groups" box. |
…uble check Groups were extracted from : - Website menu and the links to groups categories - Group categories Popular Groups and "More similar groups" links - Group page breadcrumb - Group page similar grous
Hello ! I double checked : I extended my crawling script, to get more "sources" : some of the existing groups (like 'maison-et-jardin') does still work, but there are not on the website anymore. I think those are abandoned groups : the last deal in this group is from 2019. The group order changed a bit, because I fixed a duplicate entries issue in my crawling script, and did not need to use the sort command to fix them. IMHO, the feed this pull request will break are already broken, or does not return anything new since a while. |
Did I miss something ? :) |
Hello ! Am I missing some changes ? |
gj! |
Btw, just pushed 877707f, so you can add your script crawling scripts to |
The bridges has been updated with the newest "groups" available on every
website !