-
Notifications
You must be signed in to change notification settings - Fork 14
Update robots.txt for every stable release #17
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
"Disallow" considered harmful. :)FYI, Rust docs SEO is probably being hurt by the use of "disallow" in the current
It's also the cause of the "No information is available for this page" (or similar) message in top search results on Google . How Godot handled poor doc SEO issueUntil recently Godot (as did many other RTD-based projects) had a similar SEO problem that was exacerbated by multiple language support. If I recall correctly, the proper way to handle older versions of docs and ensure their "Google juice" isn't lost while also causing current docs appear highest is via use of <link rel="canonical" href="https://docs.godotengine.org/en/stable/" /> or for a specific page: <link rel="canonical" href="https://docs.godotengine.org/en/stable/getting_started/step_by_step/" /> I did a lot of research into the issue for Godot and this issue has some additional details that may also apply to the Rust docs: godotengine/godot-docs#3262 (Especially the "Cause: robots.txt related." section.) Tool for checking
|
cc @jsha, this seems like something you'd be interested in |
Thanks for the tag @jyn514! I'd be curious what the original problems were with SEO for doc.rust-lang.org. From context I'm guessing it was a problem of Google choosing the wrong canonical URL, like we're trying to solve for docs.rs at rust-lang/docs.rs#1438? Perhaps in 2018 it was the case that each release was published at a versioned URL and there was no "current" URL like Also just to confirm, the current contents of https://doc.rust-lang.org/robots.txt are the same as shown in @follower's comment. It looks like the current robots.txt actually does block the versioned URL for each stable release, by blocking I have access to the Google Search Console for Also, qualitatively, my experience has been that search results for pages on doc.rust-lang.org usually point to the right page. So I propose to close this issue unless there are specific queries that someone can cite as giving wrong results. |
amazing, thanks @jsha! going to close this unless someone runs into trouble again :) |
I opened it 5 years ago but I think the problem was that when you looked for some concept on rust, google would link to a random version's rustdoc, like say 1.20.0. Which would then surprise users when not everything was available. Ultimately, no idea how or when this got resolved. I opened this back then to organize discussion on the issue which wasn't successful, but at least it got fixed. Shrug, closing is okay I guess. |
I'd heard that @imperio and @QuietMisdreavus from the docs team wanted to improve SEO for official rust documentation. So I dug a little and found that doc.rust-lang.org has a robots.txt. If we could list all stable releases in robots.txt, we could achieve the SEO improvements that the docs team wanted. Some things I found out:
It would be awesome if:
The text was updated successfully, but these errors were encountered: