Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Toronto Star - wrong CSS element scraped? #54

Open
SkelAlex opened this issue May 16, 2023 · 1 comment
Open

Toronto Star - wrong CSS element scraped? #54

SkelAlex opened this issue May 16, 2023 · 1 comment
Assignees
Labels
invalid This doesn't seem right

Comments

@SkelAlex
Copy link
Collaborator

Is it possible that the scraper for Toronto Star scraper scrapes the GTA's headline & frontpage instead of the general headline & frontpage? When I visit the Toronto Star's webpage, the headline/frontpage that appear are not the same ones as in Radar+. I currently see "STAR INVESTIGATION: Ontario’s top pathologist was accused of abusing his power. Now, judges say the bitter dispute never should have made it to their court" as the first article on the Toronto Star's webpage. On the other hand, "Man wanted after woman sexually assaulted on Toronto walking trail" is the current headline/frontpage as per Radar+. It only appears further down the page as the main element in the "GTA" section.

Radar+ headline: https://clhub.clessn.cloud/admin/core/lake/57273/change/
Radar+ frontpage: https://clhub.clessn.cloud/admin/core/lake/57272/change/

Screenshots:
Capture d’écran, le 2023-05-16 à 12 36 38
Capture d’écran, le 2023-05-16 à 12 36 49

Capture d’écran, le 2023-05-16 à 12 38 14 Capture d’écran, le 2023-05-16 à 12 38 21
@SkelAlex SkelAlex added the invalid This doesn't seem right label May 16, 2023
@SkelAlex
Copy link
Collaborator Author

Same problem today. The GTA article is scraped instead of the real headline/frontpage.

Screenshots:
Capture d’écran, le 2023-05-19 à 10 35 08
Capture d’écran, le 2023-05-19 à 10 35 16
Capture d’écran, le 2023-05-19 à 10 36 03
Capture d’écran, le 2023-05-19 à 10 36 10

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
invalid This doesn't seem right
Projects
None yet
Development

No branches or pull requests

2 participants