-
-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
crawler does not skip links of the sharebuttons #49
Comments
That should be the case, yes. Did you check the HTML output? |
Sorry, I am an absolute beginner in these matters. What does that mean: "...check the html output"? Where can I check it and what should I see then? |
In your browser (right click - view source). The links that should not be checked by the broken link checker should have the |
Hello Fritz, grashalm |
Does the broken link checker actually check those links or just count them? |
That's a good question! I will check that tonight. Tomorrow, when the check is finished, I will download the logfiles and see, whether there are sharebutton links amongst the broken links. From former scans I can remember, that there were a lot of sharebutton links among the broken links. I will give you a current report tomorrow. |
so, the scan is through ... "2021-03-03 22: 25: 27.860060", "Contao \ CoreBundle \ Crawl \ Escargot \ Subscriber \ BrokenLinkCheckerSubscriber", http: //www.linkedin.com/shareArticle? Mini = true & url = https% 3A% 2F% 2Fdie -schreibmaus.de% 2Ftexte% 2Fdu-hast-geweint.html & title = You% 20hast% 20gewein, https: //die-schreibmaus.de/texte/du-hast-geweint.html,3, "skip-broken-link- checker, rel-nofollow, disallowed-robots-txt "," Did not check because it was marked to be skipped using the data-skip-broken-link-checker attribute. " "2021-03-03 22: 25: 27.860250", "Contao \ CoreBundle \ Crawl \ Escargot \ Subscriber \ BrokenLinkCheckerSubscriber", https: //www.xing.com/social_plugins/share/new? Sc_p = xing-share & h = 1 & url = https% 3A% 2F% 2Fdie-schreibmaus.de% 2Ftexte% 2Fdu-hast-geweint.html, https: //die-schreibmaus.de/texte/du-hast-geweint.html,3, "skip-broken- link-checker, rel-nofollow "," Did not check because it was marked to be skipped using the data-skip-broken-link-checker attribute. " I now conclude from this that it counts the sharebutton links, but does not check them, as if I were to deactivate them in the layout. |
I don't think there is anything I can or should do about this. If links with |
Okay, thank you very much for your assessment! |
hello fritz, you recently processed my pull request that I had sent to the core team regarding the long running times of the crawler on the share buttons. now i followed yanick's suggestion and added the "data-escargot-ignore" attribute to all links in the template. and now it works: the crawler no longer adds these links to the queue. Thanks a lot for this! grashalm |
Why do you want it removed? If anything it should be changed to |
That is what I meant. I just wanted to ask, if the code snippet "rel =" noopener noreferrer nofollow "data-skip-broken-link-checker" is still necessary, when the code "data-escargot-ignore" seems to work, or if the new snippet just could replace the old one. |
The templates now use |
That sounds great. Thank you very much! |
hello fritz,
Here is grashalm again. if I have correctly understood your revision to version 2.1.4, then the links of the sharebuttons from the crawler in contao should actually be skipped when crawling for broken links. Is that correct?
Since the sharebuttons appear as a module on every page of my site, my crawler still finds around 65,000 links on my site with every search. If I switch off the module in the layout beforehand, there are only around 8,700 links. is that still a bug, or have I misinterpreted your revision to version 2.1.4?
The text was updated successfully, but these errors were encountered: