Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add custom extension to robots.txt cache #135

Closed
p37307 opened this issue Jun 28, 2024 · 1 comment
Closed

Add custom extension to robots.txt cache #135

p37307 opened this issue Jun 28, 2024 · 1 comment

Comments

@p37307
Copy link

p37307 commented Jun 28, 2024

Because of the domain names extension, my OS is associating filetypes to the cached robots files in artifacts.

For instance, it associates domains ending in .au as audio files, .one as MS Office OneNote files, .cat as Windows security cat files, .com as MS_DOS applications, etc.

It may not be a big deal unless my search indexer tries to read its data, expecting to get data associated with those filetypes and mucks up.

Could you add a unique extension when it saves the robots file that is universally recognized as a text file.

image

image

image

@nanos
Copy link
Owner

nanos commented Jun 28, 2024

Oops. I do realise that this is probably not the smartest anyway. I’m sure bad stuff will happen if someone has utf-8 glyphs in their domain name. I think I’ll just do a hash of the domain name and use that…

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants