Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Port sitemap work #9

Closed
7 tasks done
mbarnett opened this issue Jun 20, 2017 · 0 comments
Closed
7 tasks done

Port sitemap work #9

mbarnett opened this issue Jun 20, 2017 · 0 comments
Assignees
Labels

Comments

@mbarnett
Copy link
Contributor

mbarnett commented Jun 20, 2017

Caching currently lasts 24 hours from the first request to /sitemap.xml, although we can raise or lower this as desired. My back-of-the envelope math says it will take around 45 seconds to respond to the first request assuming about 45k items in the repository. A nightly cron task to curl the URL would eliminate any lag for most users.

  • Get the modified date indexed into Solr
  • Theses.
  • Item visibility (filter out private items)
  • Add a static robots.txt file?
  • Other requirements from Google Scholar requirements: Google Scholar optimization #234
  • Change SitemapPolicy to simply inherit from ApplicationPolicy.
  • Add an application specific property on fileset to include hash, file size, and mimetype information.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants