Skip to content

Commit

Permalink
Merge pull request #8752 from GlobalDataverseCommunityConsortium/GDCC…
Browse files Browse the repository at this point in the history
…/8750-DRS_Archiver

GDCC/8750 DRS archiver
  • Loading branch information
kcondon authored Aug 15, 2022
2 parents e1f862e + 1ddbe38 commit 6bd3ec6
Show file tree
Hide file tree
Showing 3 changed files with 398 additions and 5 deletions.
4 changes: 2 additions & 2 deletions doc/sphinx-guides/source/installation/config.rst
Original file line number Diff line number Diff line change
Expand Up @@ -1081,7 +1081,7 @@ These archival Bags include all of the files and metadata in a given dataset ver

The Dataverse Software offers an internal archive workflow which may be configured as a PostPublication workflow via an admin API call to manually submit previously published Datasets and prior versions to a configured archive such as Chronopolis. The workflow creates a `JSON-LD <http://www.openarchives.org/ore/0.9/jsonld>`_ serialized `OAI-ORE <https://www.openarchives.org/ore/>`_ map file, which is also available as a metadata export format in the Dataverse Software web interface.

At present, archiving classes include the DuraCloudSubmitToArchiveCommand, LocalSubmitToArchiveCommand, GoogleCloudSubmitToArchive, and S3SubmitToArchiveCommand , which all extend the AbstractSubmitToArchiveCommand and use the configurable mechanisms discussed below.
At present, archiving classes include the DuraCloudSubmitToArchiveCommand, LocalSubmitToArchiveCommand, GoogleCloudSubmitToArchive, and S3SubmitToArchiveCommand , which all extend the AbstractSubmitToArchiveCommand and use the configurable mechanisms discussed below. (A DRSSubmitToArchiveCommand, which works with Harvard's DRS also exists and, while specific to DRS, is a useful example of how Archivers can support single-version-only semantics and support archiving only from specified collections (with collection specific parameters)).

All current options support the archival status APIs and the same status is available in the dataset page version table (for contributors/those who could view the unpublished dataset, with more detail available to superusers).

Expand Down Expand Up @@ -1185,7 +1185,7 @@ The S3 Archiver defines one custom setting, a required :S3ArchiverConfig. It can

The credentials for your S3 account, can be stored in a profile in a standard credentials file (e.g. ~/.aws/credentials) referenced via "profile" key in the :S3ArchiverConfig setting (will default to the default entry), or can via MicroProfile settings as described for S3 stores (dataverse.s3archiver.access-key and dataverse.s3archiver.secret-key)

The :S3ArchiverConfig setting is a json object that must include an "s3_bucket_name" and may include additional S3-related parameters as described for S3 Stores, including "profile", "connection-pool-size","custom-endpoint-url", "custom-endpoint-region", "path-style-access", "payload-signing", and "chunked-encoding".
The :S3ArchiverConfig setting is a JSON object that must include an "s3_bucket_name" and may include additional S3-related parameters as described for S3 Stores, including "profile", "connection-pool-size","custom-endpoint-url", "custom-endpoint-region", "path-style-access", "payload-signing", and "chunked-encoding".

\:S3ArchiverConfig - minimally includes the name of the bucket to use. For example:

Expand Down
18 changes: 15 additions & 3 deletions pom.xml
Original file line number Diff line number Diff line change
Expand Up @@ -52,7 +52,7 @@
-->
<!-- TODO: Housekeeping is utterly needed. -->
<dependencies>

<!-- This dependency ensures the lib is on the classpath. That way, anything coming from
SLF4J should flow through our usual ways of using java.util.logging.
(Which makes it also configurable using those mechanisms.) No direct use intended. -->
Expand Down Expand Up @@ -357,7 +357,7 @@
<dependency>
<groupId>commons-codec</groupId>
<artifactId>commons-codec</artifactId>
<version>1.9</version>
<version>1.15</version>
</dependency>
<!-- JavaSwift/JOSS: for accessing OpenStack cloud storage -->
<dependency>
Expand Down Expand Up @@ -516,7 +516,19 @@
<artifactId>google-cloud-storage</artifactId>
<!-- no version here as managed in <dependencyManagement> of parent for convergence! -->
</dependency>

<!-- JWT support, currently in DRSSubmitToArchiveCommand-->
<!-- https://mvnrepository.com/artifact/com.auth0/java-jwt -->
<dependency>
<groupId>com.auth0</groupId>
<artifactId>java-jwt</artifactId>
<version>3.19.1</version>
</dependency>
<!-- https://mvnrepository.com/artifact/io.github.erdtman/java-json-canonicalization -->
<dependency>
<groupId>io.github.erdtman</groupId>
<artifactId>java-json-canonicalization</artifactId>
<version>1.1</version>
</dependency>

<!-- TESTING DEPENDENCIES -->
<dependency>
Expand Down
Loading

0 comments on commit 6bd3ec6

Please sign in to comment.