Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow indexing and replay from live Web contents #211

Open
machawk1 opened this issue Jun 29, 2017 · 4 comments
Open

Allow indexing and replay from live Web contents #211

machawk1 opened this issue Jun 29, 2017 · 4 comments

Comments

@machawk1
Copy link
Member

https://github.com/VictorBjelkholm/ipfscrape is a command-line tool to allow a user to enter a URI, wgets the content from the live Web, pushed it to IPFS, then serves it from localhost.

https://github.com/webrecorder/warcio is a python modules that provides the ability to create a WARC file from a live Web page.

ipwb currently has the ability to go from warc->ipfs->ipwb replay

Integrate with warcio to (write warc via warcio)->warc->ipfs->ipwb replay

@machawk1 machawk1 changed the title Push to IPFS from live Web Allow indexing and replay from live Web contents Jun 29, 2017
@machawk1
Copy link
Member Author

Q: for testing, can we instruct the CI system to kill the network connection mid-way through the test? This would allow us to test the feature in this issue post-push at pull/replay-time.

@machawk1
Copy link
Member Author

@machawk1
Copy link
Member Author

machawk1 commented Jun 29, 2017

@machawk1
Copy link
Member Author

@b5 from @datatogether stated that this week he is hoping to get a proof of concept to exhibit the following procedure:

  1. Start with a user-generated collection of URLs. Allow users to fire off a "task" that will...
  2. Generate a WARC of that collection using https://github.com/datatogether/warc
  3. Generate an IPWB-Compatible CDXJ file.
  4. Put all of that on IPFS
  5. Demo the WARC in IPWB.

Stand by and keep an eye on these efforts.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant