Upload images to global location, with filenames that are hashes of file contents #178

lencioni · 2017-01-05T18:20:41Z

I was chatting with @lelandrichardson about some of our plans for supporting multiple rendering targets (#73) and generating a review HTML page (#176), and we thought it might not be a bad idea to change the way we upload images.

Currently, we upload an HTML file and a set of images as a sort of bundle to a single "directory". Over many runs, this is likely to produce duplicate images. What if, instead, we put the images in a more general location where the filenames are hashes of the file contents. That way, if we have multiple images that are the same, we don't upload them again.

This would have the advantage of allowing us to make all examples browsable on the review page with minimal overhead.

If we do this, the files should be stored in sub-directories so that the directory doesn't end up growing to contain millions of files over time.

We could probably also apply the same technique to the JavaScript files that we need for the review page, to cut down on duplication, but that seems less valuable than doing it for the images.

trotzig · 2017-01-05T18:27:16Z

This is a good idea. Do you have an idea for how it could be implemented as well? It could be as simple as using the uploader configuration option that I'm adding/thinking of adding as part of supporting #176.

// .happo.js
const S3Uploader = require('happo/server/S3Uploader');

module.exports  = {
  uploader: new S3Uploader({ 
    imageDirectory: 'some-global-dir',
  }),
}

lencioni · 2017-01-05T18:36:01Z

It would be nice to avoid re-uploading assets that already exist, so we might want to build in that check to the upload process. imageDirectory option sounds okay to me, but I think we could probably just set it to something and not make it configurable, right?

trotzig · 2017-01-05T19:11:46Z

Ah, that might be right. Since people mostly have to specify an s3 bucket name. We can choose to make a special folder for images, and check before upload.

lencioni · 2017-01-18T16:32:08Z

We might also try a potential optimization where we upload a JSON file for every sha that maps examples to their images. ~~~Then we can potentially download all of the images for the base commit if they already exist. Not sure if that will be faster but it might depending on how long the build takes.~~~ Then we can compare the hash of the current snapshot to the hash in the JSON file, which should be a lot faster.

lencioni added the enhancement label Jan 16, 2017

lencioni mentioned this issue Jan 25, 2017

Click to view visual history of an example #187

Open

lencioni mentioned this issue Jun 19, 2017

fix(firefox): handle data uris #209

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Upload images to global location, with filenames that are hashes of file contents #178

Upload images to global location, with filenames that are hashes of file contents #178

lencioni commented Jan 5, 2017

trotzig commented Jan 5, 2017

lencioni commented Jan 5, 2017

trotzig commented Jan 5, 2017

lencioni commented Jan 18, 2017 •

edited

Loading

Upload images to global location, with filenames that are hashes of file contents #178

Upload images to global location, with filenames that are hashes of file contents #178

Comments

lencioni commented Jan 5, 2017

trotzig commented Jan 5, 2017

lencioni commented Jan 5, 2017

trotzig commented Jan 5, 2017

lencioni commented Jan 18, 2017 • edited Loading

lencioni commented Jan 18, 2017 •

edited

Loading