-
Notifications
You must be signed in to change notification settings - Fork 20
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Canonical links breaking spiders #181
Comments
Yep I agree |
The best way to think of a
readthedocs.io uses canonical links in a similar way to us (view the source of https://phpunit.readthedocs.io/en/7.0/installation.html). (edit: This paragraph is still true, but not too relevant to what Sam has raised) Now, I'd suggest that Swiftype shouldn't be acting as a normal spider, it should be indexing all content for search and allowing the customer to say which pages should or shouldn't be used for search results (perhaps a "respect canonical tags" flag is in order). I would not consider it unusual that a website would have it's own search index that doesn't respect canonical tags because they are intended to only be for public search engines. Of course we can't expect them to change their product overnight, but this is a point that should be put to them. |
Dan's RTD example:
That looks like Sam's second option. If RTD has decided that's a good option for them, I think it should be our default choice as well. If there was any issues with frequently changing the canonical tags, they would've picked that up by now. I don't see any relevant open issues about it. As a sanity check, Laravel Docs do this as well. |
Ah, sorry - I think I've got the wrong end of the stick here. I didn't realise our canonical tags pointed to a URL which itself 301-ed to somewhere (and occasionally the same page that had a canonical URL that had just redirected us back). Yes; that's super bad and needs to stop. I'd go for the 2nd option; as @chillu points out that seems fairly standard and I think it's more helpful for users to see the version number in the URL. It will still be interesting to see if Swiftype will index docs pages that do have canonical tags (eg: 3.x docs pages) as it seems they are respecting them a bit too eagerly. Clearly Google can handle canonical tags with this kind of redirect loop. |
Google are... big. |
This has been resolved with the Gatsby rewrite |
Following up on #156 and #180.
According to Swiftype support staff, having a canonical URL link that point to a redirect back to your current page is a bad thing to do for all search engines. It is breaking Swiftype's ability to index any content and they've claimed that it's a confusing thing to do for all spiders, including Google's.
If we want to use Swiftype, we'd need to work a way of reducing duplicate search results in google without creating infinite loops in our canonical links:
Comments below suggest that we fix this in the following way:
This solution was rejected:
The text was updated successfully, but these errors were encountered: