This repository has been archived by the owner on Mar 14, 2022. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 7
Robots & Workflows Replacement Analysis
Christina Harlow edited this page Apr 30, 2018
·
1 revision
Reviewed for first pass of mapping required processing steps to our TACO Processing Framework.
Assembly (Items Only): assemblyWF (robot suite | workflow definition | config.yaml)
- Start-assembly (Initiate assembly of the object)
- Just starts assembly workflow
- Content-metadata-create (method) (Create content-metadata from stub content metadata if it exists)
- Checks resource is an Item (Object)
- Raise an issue if both stub content metadata + full content metadata exists
- Checks if full content metadata exists (as to not overwrite it)
- If stub content metadata exists: iterate through list of Files and add assertions to Item for each file contained, order, filepath
- If no content metadata exists: queries the current filepath / directory for all files, iterates through that lists and adds files to Item metadata
- Jp2-create (method) (Create JP2 derivatives for any images in object)
- For each supported image type that is part of specific resource types, generate a jp2 derivative and modify content metadata XML to reflect the new file.
- For an Item, grab all files matching a supported type
- Create a new Image object for the JP2
- Fail / stop job if a JP2 for the starting / original File already exists
- Create the JP2
- Rename the JP2 based off starting / original File’s name
- Associate with Image object and Item object
- Checksum-compute (method) (Compute and compare checksums for any files referenced in contentMetadata)
- For each file in the Item (Object):
- Compute new checksum (mda5 and sha1)
- If File metadata contains a checksum, compare & raise error if fails
- Otherwise, assert checksum in File metadata
- Exif-collect (method) (Calculate and add exif, mimetype, file size and other attributes to each file node in contentMetadata)
- For each file in the Item (Object):
- Add the following regardless of Item type:
- mimetype (unless already exists)
- filesize (unless already exists)
- Add preservation, release information based on mimetype unless that information provided
- NEEDS MORE INFO
- preserve?
- publish?
- Shelve?
- If image, adds image info (height + width) metadata
- Otherwise, just ensures type in metadata is ‘File’
- Accessioning-initiate (method) (Initiate workspace and start common accessioning)
- Creates druid-compliant workspace via dor-services
- Posts resource to that workspace via POST to dor-services
- kicks off accessioning / accessionWF via dor-workflow-services
Accession: accessionWF (robot suite | workflow definition)
- start-accession (Start Accessioning)
- Just starts common accessioning / accession workflow
- Descriptive-metadata (method) (default XML) (Descriptive Metadata)
- Check that newer descriptive metadata data stream doesn’t exist
- Builds a descriptive metadata datastream in Fedora based on a file in the local workspace
- Rights-metadata (method) (default XML) (Rights Metadata)
- Check that newer descriptive metadata data stream doesn’t exist
- Builds a rights metadata datastream in Fedora based on a file in the local workspace
- content-metadata (method) (default XML) (Content Metadata)
- Check that newer descriptive metadata data stream doesn’t exist
- Builds a content metadata datastream in Fedora based on a file in the local workspace
- technical-metadata (method) (Technical Metadata)
- Check that newer descriptive metadata data stream doesn’t exist
- Builds a technical metadata datastream in Fedora based on a file in the local workspace
- shelve (method) (Shelve content in Digital Stacks)
- Determine if Files have changed between the object save and what is in persistence
- Determine the location of the Files in persistence
- Determine the workspace location of the Files to be added / removed
- Either:
- Remove the file from Stacks
- Rename the file in Stacks
- Or Move the file into Stacks
- publish (method) (Publish Metadata)
- Check rights > access > discover for world
- If so, copies copies public_xml over to Purl’s cache
- Otherwise, prunes from current document cache
- provenance-metadata (method) (Provenance Metadata)
- Add repository, object DRUID, who / workflow process, event text to provenance streams
- sdr-ingest-transfer (method) (Initiate Ingest into Preservation)
- Transfers an object to SDR Ingest Service. Appears to have once required an Agreement for this, but robot passes in “”.
- sdr-ingest-received (method) (Signal from SDR that object has been received)
- ??
- Receive notice the ingest was received (where?)
- reset-workspace (method) (Reset workspace by renaming the druid-tree to a versioned directory)
- Clean up workspace based on DRUID and version
- End-accession (method) (Clean up any diff caches and set disseminationWF:cleanup to waiting)
- Search for a special additional dissemination workflow from the object’s APO
- Moves on to dissemination
Dissemination: disseminationWF (robot suite [??] | workflow definition)
- Clean up work space?
gisAssemblyWF
- Start-gis-assembly-workflow
- Register-druid
- Author-metadata
- Approve-metadata
- extract-thumbnail
- extract-iso19139
- generate-geo-metadata
- generate-mods
- assign-placenames
- finish-metadata
- wrangle-data
- approve-data
- package-data
- normalize-data
- extract-boundingbox
- finish-data
- generate-content-metadata
- load-geo-metadata
- finish-gis-assembly-workflow
- start-assembly-workflow
- Start-delivery-workflow
eemsAccessionWF
- Register-object
- submit-tech-services
- eems-transfer
- submit-marc
- check-marc
- catalog-status
- other-metadata
- Start-accession
hydrusAssemblyWF
- start-deposit
- submit
- approve
- Start-assembly
etdSubmitWF
- register-object
- submit
- reader-approval
- registrar-approval
- submit-marc
- check-marc
- catalog-status
- other-metadata
- start-accession
- Binder-transfer
wasSeedPreassemblyWF
- start
- build-was-seed-druid-tree
- desc-metadata-generator
- thumbnail-generator
- content-metadata-generator
- End-was-seed-preassembly
wasCrawlPreassemblyWF
- start
- build-was-crawl-druid-tree
- metadata-extractor
- content-metadata-generator
- technical-metadata-generator
- desc-metadata-generator
- End-was-crawl-preassembly
registrationWF
- register
- Digitization
digitizationWF
- initiate
- digitize
- Start-accession
dpgImageWF
- initiate
- tracking_db
- scan
- completeness
- postprocessing
- imageqc
- import_files
- md5_gen
- copy_to_assembly
- md5_verify_assembly
- delete_scratch
- Digitized
goobiWF
- start
- goobi-notify
versioningWF
- start-version
- submit-version
- start-accession
gisDeliveryWF
- start-gis-delivery-workflow
- load-vector
- load-raster
- load-geoserver
- load-geowebcache
- seed-geowebcache
- finish-gis-delivery-workflow
- Start-gis-discovery-workflow
releaseWF
- start
- release-members
- release-publish
- update-marc
wasCrawlDisseminationWF
- start
- cdx-generator
- cdx-merge-sort-publish
- Path-indexer
wasDisseminationWF
- start
- Start-special-dissemination
wasSeedDisseminationWF
- start
- update-thumbnail-generator
gisDiscoveryWF
- start-gis-discovery-workflow
- generate-geoblacklight
- load-geoblacklight
- export-opengeometadata
- Finish-gis-discovery-workflow
swIndexWF
- indexed_to_localhost
- indexed_to_sw-solr-test
preservationAuditWF
- moab-valid
- Preservation-audit
sdrAuditWF
- Audit-verify
sdrIngestWF
- start-ingest
- register-sdr
- transfer-object
- validate-bag
- verify-agreement
- complete-deposit
- update-catalog
- create-replica
- Ingest-cleanup
sdrMigrationWF
- migration-complete
- migration-metadata
- migration-register
- migration-start
- migration-transfer
googleScannedBookWF
- register-object
- descriptive-metadata
- google-convert
- google-download
- process-content
- sdr-ingest-transfer
- sdr-ingest-deposit
- shelve
- cleanup
- sdr-ingest-archive
- TACO API & Service Design
- Development Guide
- TACO Internal Steps
- Identifier schema
- Data Modeling & MAPs
- Data Expectations of TACO
- Auth & Permissions
- Benchmarking Goals & Scenarios
- Workflows & Robots Replacement Analysis