-
Notifications
You must be signed in to change notification settings - Fork 291
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Lots of broken links, invalid HTML and so on (checklink) #561
Comments
Are you sure you ran the program correctly? In the first item,
Yet, no scheme I checked a couple of other errors, and they similarly are false positives. |
There're a bazillion of markup errors though: https://validator.w3.org/nu/?doc=http%3A%2F%2Fdoc.perl6.org%2Ftype%2FX%3A%3ATypeCheck%3A%3ASplice |
I’ll take a look at the 404’s, that should keep me occupied for a while. I suspect some of them will not be easily fixed, e.g. #155 From: Zoffix Znet [mailto:notifications@github.com] There's a bazillion markup errors though: https://validator.w3.org/nu/?doc=http%3A%2F%2Fdoc.perl6.org%2Ftype%2FX%3A%3ATypeCheck%3A%3ASplice — |
Saying that these are false positives is very optimistic. Sometimes the error message is LTA, other times the tool got confused because of markup errors. I'm not sure what's the issue in this particular case, but if you think that it is working incorrectly (that is, if it gets confused with correct HTML) then please file a bug report here. |
I have fixed quite a few broken links. Unless I have missed some, which I probably have, the ones that remain point to the following, undocumented, entities: |
I've added a graph so that we can track the progress. However, until someone fixes html problems we are not going to get any meaningful data there. |
If it helps anyone, I tried to debug the double-links on headers and gotten only as far as the three passes the torch |
Wikipedia is now redirecting all http requests to https. This means that any current link causes an unnecessary (permanent) redirect when the user clicks on it. The link checker also complains about it, so it is going to help with #561 a bit.
This is going to help with #561 a bit. There are probably better ways to fix these links, but that's good enough.
Actually, I think that most problems are now either fixed or have a corresponding issue filed. We need #584 to be fixed. |
@AlexDaniel - looks like #584 might be fixed; can you re-run this scan? |
@coke Just did. |
Mostly there are issues caused by "broken" method names. The whole thing is like this:
Without a type name there will be duplication of ids with routine/method/sub pages. The priority task is to find a way to avoid duplication and to avoid breaking fragments. About 3 months ago, there was a talk with gfldex++ about how one can use number suffixes to avoid duplication in Pod::To::Html, but this part is hard. There also a few easier issues such as links to items removed from Glossary, etc. They can be just removed/fixed and it will be okay, I think. |
Some further investigation notes:
|
Still looks pretty bad. |
Updated the graph again. |
FWIW this is the command I'm using: checklink -b -D 25 -q doc.perl6.org | tee "$(date '+%F')" |
What about listing broken links by file and go at them, one by one? Some links might be errors, some of them might actually have disappeared due to bitrot... |
@JJ well, you can just search the output for “404 Not Found” error and you'll find all broken links. Everything else is probably broken html. The goal here is to get the output to 0 lines (we were almost there in 2016). P.S. Updated the graph and the output again. |
@abraxxa thanks a lot for the report, but there are so many, that it's difficult to know where to start... |
Most 404 are now done, but now heading titles have drifted from their links, like in #2146 . Whenever you're working on a page, please check outgoing anchors for correctness. |
@AlexDaniel can you run this again? |
@JJ thanks for pinging, updated. |
So it got worse... |
Maybe we should check this again... |
@JJ updated. But there's not much of use updating it unless somebody wants to actively work on fixing the issues. |
A lot of broken links seem to be caused by a different directory structure: Things like:
It doesn't mind the redirect (although we should avoid it anyway), but the link should have |
Where have you found that link? |
@antoniogamiz you can see it in the output yourself. In this case it's the 🖉 link (Edit this page) from https://docs.raku.org/type/X::Proc::Async::AlreadyStarted. |
Thanks, Aleks. In many cases, broken links are the result of errors in HTML
that might or might not have been fixed; I was curious about that. And, of
course, I don't think there's been any check after the Big Change (TM).
I'm also going to start a round of bugfixes, so I just wanted to know the
state of this. Again, thanks, I appreciate it.
|
In reality, output is a little bit smaller, because if you take a look at:
You that is reporting
But these links work perfectly. I do not know what those appear as broken fragments. |
Because there are HTML errors and these are converted for some reason into
missing links.
|
Hum, that's a weird behavior. Anyway, I'm using this command
And errors in edit links will be solved in the next |
Yeah, I'm not sure about these particular ones. The link to webchat can be changed to have |
Will track in Raku/doc-website#72 on new site. |
I've used checklink utility on doc.perl6.org. Here is the output. I believe that all of these issues have to be fixed.
Some of the errors are repeated many times, that's why the output is so large.
Current status
The text was updated successfully, but these errors were encountered: