Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Project: MOC Proof-of-Concept #105

Closed
13 tasks done
cmbz opened this issue Dec 18, 2023 · 6 comments
Closed
13 tasks done

Project: MOC Proof-of-Concept #105

cmbz opened this issue Dec 18, 2023 · 6 comments
Assignees
Labels
Dataverse Project Issues related to Dataverse Project software Project: MOC Deploy containerized Dataverse on MOC infrastructure Proof of Concept Issue relates to a proof-of-concept deliverable

Comments

@cmbz
Copy link
Contributor

cmbz commented Dec 18, 2023

Purpose

  • Deploy containerized Dataverse on Mass Open Cloud to support large data and computing
  • Present demo of proof-of-concept at the Mass Open Cloud Alliance Conference on 2024/02/28

Background

  • Mass Open Cloud Alliance is a "partnership between higher education, medical research centers, government, and industry provides structure and resources that enable a close collaboration between research, development, and operations in a series of interconnected projects."
  • The MOC team wants to test a containerized Dataverse installation on the MOC infrastructure to support large data and computing.

Participants

Timeline

  • Start: 2024/01/08
    • Milestone: Deliver containerized Dataverse installation (2024/01/15)
    • Milestone: Binder equivalent functionality implemented (2024/01/31)
    • Upload presentation (2024/02/23)
    • Milestone: Presentation (2024/02/24)
    • Close out tasks
  • End: 2024/04/01

Tasks

Issues

Related

Technical Resources

  • NESE tape resources
  • NESE disk resources

Project Resources

@cmbz cmbz added the Epic Issue is a project epic label Dec 18, 2023
@cmbz cmbz self-assigned this Dec 18, 2023
@cmbz cmbz added the Project: MOC Deploy containerized Dataverse on MOC infrastructure label Dec 18, 2023
@cmbz cmbz changed the title Epic: MOC Proof-of-Concept Project: MOC Proof-of-Concept Dec 19, 2023
@cmbz cmbz added Proof of Concept Issue relates to a proof-of-concept deliverable and removed Epic Issue is a project epic labels Dec 19, 2023
@cmbz cmbz added the Dataverse Project Issues related to Dataverse Project software label Jan 18, 2024
@cmbz
Copy link
Contributor Author

cmbz commented Jan 29, 2024

2024/01/29: Updated with new task: #172

@landreev
Copy link

landreev commented Feb 6, 2024

A possible dependency: IQSS/dataverse#10302.
I can make a PR very quickly if/when needed.

@landreev
Copy link

landreev commented Feb 12, 2024

Just a quick status checklist on the NERC deployment side:

  • Basic Dataverse instance created (a Docker deployment in an OpenStack VM)
  • Container Store configured (2TB)
  • Access to the s3 bucket from within or outside the cluster
    • specifically, Dataverse instance configured w/ s3 access credentials
    • upload into the bucket via Dataverse tested
      • DvUploader works like a charm (tested with the GEOS-Chem data files used by the presentation notebook; see below)
      • uploads via the UI need to be re-tested
  • The Computation node setup remains the major component that's still missing, so this is the main focus of the current effort
    • We got the Python notebook from GEOS-Chem. It's nowhere near ready to be deployable in a standalone OpenShift Jupyter pod as has been the plan so far. I will be working with the author. Project: GEOS-Chem Proof-of-Concept #154 will be used to track the effort.
    • Once we got the notebook deployment figured out, the "external tool" mechanism will need to be added to it to perform the basic initial interaction with the Dataverse for obtaining the storage locations of the files.
  • NESE tape storage volume needs to be set up and configured on the Dataverse instance via Globus, as secondary storage for the demo
    • Created a dedicated Globus ID and had roles set up by NESE giving it access to the separate tape and disk pools, so I can configure the demo instance with these Globus credentials to facilitate data uploads to tape

@landreev
Copy link

edited the checklist to add the NESE tape volume.

@landreev
Copy link

Marked the "NESE tape" item as complete on the checklist - under the assumption that it is complete "for the purposes of the MOC PoC presentation slides". I'm opening a new issue dedicated to figuring out a working setup, with a functioning Borealis app hosted on the server, etc., that can be used by real users, that we can later replicate in IQSS prod.

@cmbz
Copy link
Contributor Author

cmbz commented Mar 1, 2024

2024/03/01

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Dataverse Project Issues related to Dataverse Project software Project: MOC Deploy containerized Dataverse on MOC infrastructure Proof of Concept Issue relates to a proof-of-concept deliverable
Projects
Status: No status
Development

No branches or pull requests

2 participants