incomplete content on multiple pages #739

Grienauer · 2023-04-17T21:41:58Z

Currently on following pages the parser seems to be lost.
I don't see any markup problems.
maybe the newspapers detect and block the scraper?

thx for info. happy to help.

Overwatching · 2023-04-20T21:06:18Z

There are multiple mentions in the issues section about header content being removed erroneously.
I think this falls into the same problem.

I came here to report the same thing happening on Hackaday.com/blog

ctipper · 2023-07-30T20:10:47Z

And https://www.thetimes.co.uk/ multiple articles, it clips the first one or two paragraphs on every page I'v tried. Kind of useeless in this state.

Provide feedback