Access files remotely, with caching.
devtools::install_github("richfitz/remotefile")
This is designed for use with the enron corpus and I'll bundle the usage into a package at some point. But this should work for now:
path <- "https://gitlab.com/rsheets/enron_corpus/raw/master/sheets"
dest <- tempfile()
dir.create(dest)
x <- remotefile_init(dest, path)
head(remotefile_list(x))
head(remotefile_list(x, local=TRUE))
res <- remotefile_fetch(remotefile_list(x)[[555]], x)
file.exists(res)