-
Notifications
You must be signed in to change notification settings - Fork 339
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature Request] Add ArchiveBox as an archival endpoint #380
Comments
Integrating this might be an option, but I'd prefer using a REST API rather than including a CLI tool into the Docker image, and then trying to drive that from the Django app. Apart from that the expectations should be clarified. Creating the snapshot seems pretty clear. What do you expect to happen after the snapshot was created?
An alternative could be to expose the
|
@sissbruecker It would be great if the snapshot link would replace the internet archive link. It would be enough if we can manually trigger it for the bookmarks that haven't been archived from the admin panel. |
REST API is arrived: ArchiveBox/ArchiveBox#1397 (comment) |
How would this be different from having ArchiveBox pull an RSS feed from LinkDing? |
I have no ideia, how do that? |
That sounds like a good idea to me. From what I can tell linkding provides RSS feeds under settings -> integrations. And Archivebox can pull rss feeds. I haven't tried it yet, but I imagine the big difference with that solution is that it's not as integrated. It would be nice to browse the bookmark within linkding and be able to select the at-home-archival link, without having to switch over to archivebox and then go hunting for it. |
I've did my first Archivebox backup of my Linkding feed yesterday. It was very easy: I grabbed the RSS link and added it to Archivebox UI with depth=1 and my archive methods of choice (wget + mercury + title + favicon). I understand that Archivebox also comes with a scheduler that can refresh the backups regularly, so once my Linkding RSS is registered, I expect that it will keep my archive updated. (This is good enough for me.) Maybe the crux of this request is for Linkding to do its own orchestration of backups? E.g. whenever there's a new bookmark created, Linkding would enqueue a Huey task to archive it. Opinion: If we'd like to go this way, my preferred solution would be for Linkding's background task to actually perform the backup using the tool of choice (e.g. wget), so that archivebox wouldn't be required in the first place. Archivebox itself is just an orchestrator of backups, so it seems redundant for one orchestrator to require another. |
I see the value of Archivebox as the multiple options it provides to get easy coverage of tools for archival. No single tool that I've tried has "just worked"(tm) on all webpages. For some pages only screenshots will actually save the content in the manner it was displayed.
Edit: OK, I pulled the latest-plus image, and the snapshot feature does much better on the acid test than I anticipated. I'm still leery of the pitfalls of using a single tool to archive, but given this feature is available in linkding and works quite well I think I'd be a mistake to ask for yet another feature just to integrate Archivebox. For page-doesn't-archive edge cases I think the RSS feed to Archivebox will suffice just fine. I don't want to rugpull any further discussion, but as for my original ask I think this feature request can be closed. |
ArchiveBox is a self-hosted web archival tool that stores pages in a variety of formats. While it appears there isn't a stable REST API yet, there is a pretty simple CLI that should be workable with linkding.
Would a pull request for adding Archivebox support be given consideration?
The text was updated successfully, but these errors were encountered: