- Argonne Training Program on Extreme-Scale Computing Application Deadline Extended to Today!
- IDEAS-ECP Webinar on "Software Design Patterns in Research Software with Examples from OpenFOAM" on Wednesday
- Perlmutter Machine Status
- Reduction in Perlmutter Node Availability during Cooling System Physical Maintenance
- (NEW) Perlmutter Maintenances, Explained
- IDEAS-ECP Webinar on "Evaluating Performance Portability of HPC Applications & Bencharks across Diverse HPC Architectures" April 13
- Learn to Use Spin to Build Science Gateways at NERSC: Next SpinUp Workshop Starts April 20!
March 2022
Su Mo Tu We Th Fr Sa
1 2 3 4 5
6 *7* 8 *9**10* 11 12 7 Mar ATPESC Applications Due [1]
9 Mar IDEAS-ECP Monthly Webinar [2]
10 Mar Perlmutter Maintenance [3]
13 14 15 *16* 17 18 19 16 Mar Cori Monthly Maintenance [4]
20 21 22 23 *24* 25 26 24 Mar Perlmutter Rolling Update [5]
27 28 29 30 31
April 2022
Su Mo Tu We Th Fr Sa
1 2
3 4 5 6 *7* 8 9 7 Apr Perlmutter Maintenance [3]
10 11 12 *13* 14 15 16 13 Apr IDEAS-ECP Monthly Webinar [6]
17 18 19 *20**21* 22 23 20 Apr Cori Monthly Maintenance [7]
20 Apr SpinUp Workshop [8]
21 Apr Perlmutter Rolling Update [5]
24 25 26 27 28 29 30
May 2022
Su Mo Tu We Th Fr Sa
1 2 3 4 5 6 7
8 9 10 11 12 13 14
15 16 17 *18* 19 20 21 18 May Cori Monthly Maintenance [7]
22 23 24 25 26 27 28
29 *30* 31 30 May Memorial Day Holiday [9]
- March 7, 2022: ATPESC Applications Due
- March 9, 2022: IDEAS-ECP Monthly Webinar
- March 10, & April 7, 2022: Perlmutter Maintenance
- March 16, 2022: Cori OS & Prog Env Upgrade
- March 24, & April 21, 2022: Perlmutter Rolling Update
- April 13, 2022: IDEAS-ECP Monthly Webinar
- April 20 & May 18, 2022: Cori Monthly Maintenance
- April 20, 2022: SpinUp Workshop
- May 30, 2022: Memorial Day Holiday (No Consulting or Account Support)
- All times are Pacific Time zone
-
Upcoming Planned Outage Dates (see Outages section for more details)
- Wednesday: HPSS Regent (Backup)
- Thursday: Perlmutter
-
Other Significant Dates
- June 15, July 20, & August 17, 2022: Cori Monthly Maintenance Window
- June 20, 2022: Juneteenth Holiday (No Consulting or Account Support)
- June 22, August 10, October 5, & November 30, 2022: SpinUp Workshops
- July 4, 2022: Independence Day Holiday (No Consulting or Account Support)
- September 5, 2022: Labor Day Holiday (No Consulting or Account Support)
- November 24-25, 2022: Thanksgiving Holiday (No Consulting or Account Support)
- December 23, 2022-January 2, 2023: Winter Shutdown (Limited Consulting and Account Support)
Berkeley Lab, where NERSC is located, is operating under public health restrictions. NERSC continues to remain open while following site-specific protection plans. We remain in operation, with the majority of NERSC staff working remotely, and staff essential to operations onsite. We do not expect significant changes to our operations in the next few months.
You can continue to expect regular online consulting and account support as well as schedulable online appointments. Trainings continue to be held online. Regular maintenances on the systems continue to be performed while minimizing onsite staff presence, which could result in longer downtimes than would occur under normal circumstances.
Because onsite staffing is so minimal, we request that you continue to refrain from calling NERSC Operations except to report urgent system issues.
For current NERSC systems status, please see the online MOTD and current known issues webpages.
Are you a doctoral student, postdoc, or computational scientist looking for advanced training on the key skills, approaches, and tools to design, implement, and execute computational science and engineering applications on current high-end computing systems and the leadership-class computing systems of the future? If so, consider applying for the Argonne Training Program on Extreme-Scale Computing (ATPESC) program.
The core of the two-week program focuses on programming methodologies that are effective across a variety of supercomputers and applicable to exascale systems. Additional topics to be covered include computer architectures, mathematical models and numerical algorithms, approaches to building community codes for HPC systems, and methodologies and tools relevant for Big Data applications. This year's program will be held July 31-August 12 in the Chicago area. There is no cost to attend. Domestic airfare, meals, and lodging are provided.
For more information and to apply, please see https://extremecomputingtraining.anl.gov/. The application deadline was extended to today, March 7, 2021.
IDEAS-ECP Webinar on "Software Design Patterns in Research Software with Examples from OpenFOAM" on Wednesday
The March webinar in the Best Practices for HPC Software Developers series is entitled "Software Design Patterns in Research Software with Examples from OpenFOAM", and will take place this Wednesday, March 9, at 10:00 am Pacific time.
In this webinar, Tomislav Marc (TU Darmstadt) will discuss beneficial software design patterns that provide a solid basis for developing numerical methods in a modular way, drawing concrete examples from OpenFOAM, a highly modular open-source software for Computational Fluid Dynamics.
There is no cost to attend, but registration is required. Please register here.
The initial phase of the Perlmutter supercomputer is in the NERSC machine room, running user jobs.
We have added many early users onto the machine. We hope to add even more users soon. Anyone interested in using Perlmutter may apply using the Perlmutter Access Request Form.
The second phase of the machine, consisting of CPU-only nodes, is beginning to arrive next month. After all the new nodes arrive, all of Perlmutter will be taken out of service and integrated over a period that we anticipate could take up to 8 weeks. We are developing a plan for integration that will reduce the amount of time the entire system is down. We will let you know when this plan is finalized.
This newsletter item will be updated each week with the latest Perlmutter status.
The Perlmutter Phase 1 system, which is currently in its early user pre-production stage, requires physical maintenance of the cooling system that will take up to 6 weeks to complete. Rather than shut down the entire machine, NERSC will perform the maintenance in a rolling fashion with the aim of keeping 500 or more nodes available to users. Occasionally, some jobs may see decreased GPU performance during this time. We will try to keep as much of the system available as possible, but please understand that Perlmutter is not yet a production resource with any uptime guarantees.
Perlmutter is currently undergoing a fortnightly maintenance schedule. You may have noticed that while some of the maintenances require a system downtime, others are rolling updates where the only disruption to users is brief disconnections from login nodes and the possibility of a longer job start-up time.
New system management technology allows us to perform all but the most invasive of operations on a rolling basis. We will use this technology to minimize (but likely not completely eliminate) Perlmutter downtime.
In order to remain in compliance with minimum requirements for support from HPE/Cray, Cori will undergo an operating system (OS) upgrade during the scheduled maintenance next Wednesday, March 16, 2022.
At that time, we will also update the default user programming environment on Cori for the first time since January 2020. The default Cray Developer Toolkit (CDT) version will change from 19.11 to 22.02 (note new version), and the Intel compiler default will change from 19.0.3.199 to 19.1.2.254. A detailed list of software changes (including cray-mpich, cray-libsci, cray-netcdf, cray-hdf5, gcc, cce, intel, perftools, etc.) can be found here. NERSC-supported software will be updated to be compatible with the new OS and CDT. Users will need to relink all statically compiled codes. We also highly recommend rebuilding all your applications.
Due to the upgrade of the operating system on Cori next week, the two earliest E4S versions on Cori, 20.10 and 21.02, will be deprecated at that time. The module files for these versions have been updated to inform you of this change. We encourage you to start using newer versions of E4S at this time.
We are pleased to announce that the E4S/21.11 software stack has been rebuilt
for Perlmutter using GCC version 9.3.0 and NVIDIA version 21.9. We have deployed
a subset of the most commonly used elements of the software stack. It is
accessible via module load e4s/21.11-tcl
or module load e4s/21.11-lmod
. Both
point to the same spack instance but employ two different types of module trees.
In addition, we have released instructions on using a containerized deployment of E4S via Shifter. The container, provided by the E4S team, includes the full E4S software stack built on Ubuntu 20.04.
For more information, please see the E4S documentation at https://docs.nersc.gov/applications/e4s/perlmutter/21.11/.
Are you an undergraduate or graduate student looking for a summer internship opportunity? Consider applying for a summer internship at NERSC! NERSC hosts a number of paid internships on a variety of topics every year.
Please check out the growing list of internship projects on our website. If you're interested in a project, reach out to the appropriate point of contact directly with your CV/resume.
IDEAS-ECP Webinar on "Evaluating Performance Portability of HPC Applications & Bencharks across Diverse HPC Architectures" April 13
The next webinar in the Best Practices for HPC Software Developers series is entitled "Evaluating Performance Portability of HPC Applications and Benchmarks across Diverse HPC Architectures," and will take place Wednesday, April 13, at 10:00 am Pacific time.
In this webinar, JaeHyuk Kwack (Argonne National Laboratory) will discuss the progress being made on achieving performance portability by a subset of ECP applications or their related mini-apps, and approaches to achieving performance portability across diverse HPC architectures, including AMD, Intel, and NVIDIA GPUs.
There is no cost to attend, but registration is required. Please register here.
Spin is a service platform at NERSC based on Docker container technology. It can be used to deploy science gateways, workflow managers, databases, and all sorts of other services that can access NERSC systems and storage on the back end. New large-memory nodes have been added to the platform, increasing the potential of the platform for new memory-constrained applications. To learn more about how Spin works and what it can do, please listen to the NERSC User News podcast on Spin: https://anchor.fm/nersc-news/episodes/Spin--Interview-with-Cory-Snavely-and-Val-Hendrix-e1pa7p.
Attend an upcoming SpinUp workshop to learn to use Spin for your own science gateway projects! Applications for sessions that begin on Wednesday, April 20 are now open. SpinUp is hands-on and interactive, so space is limited.
Participants will attend an instructional session and a hack-a-thon to learn about the platform, create running services, and learn maintenance and troubleshooting techniques. Local and remote participants are welcome.
If you can't make these upcoming sessions, never fear! The next session begins June 22, and more are planned for August, October, and November.
See a video of Spin in action at the Spin documentation page.
NERSC currently has several openings for postdocs, system administrators, and more! If you are looking for new opportunities, please consider the following openings:
- NEW Linux Systems Administrator & DevOps Engineer: Help to build and manage container and virtual machine platforms and high-performance storage that complement the supercomputing environment.
- NEW Cyber Security Engineer: Join the team to help protect NERSC resources from malicious and unauthorized activity.
- Data & Analytics Team Group Leader: Provide vision and guidance and lead a team that provides data management, analytics and AI software, support, and expertise to NERSC users.
- NESAP for Data Postdoctoral Fellow: Collaborate with computational and domain scientists to enable extreme-scale scientific data analysis on NERSC's Perlmutter supercomputer.
- HPC Architecture and Performance Engineer: Contribute to the effort to develop a complete understanding of the issues leading to improved application and computer-system performance on extreme-scale advanced architectures.
- Machine Learning Postdoctoral Fellow: Collaborate with computational and domain scientists to enable machine learning at scale on NERSC's Perlmutter supercomputer.
- Scientific Data Architect: Collaborate with scientists to meet their Data, AI, and Analytics needs on NERSC supercomputers.
- Exascale Computing Postdoctoral Fellow: Collaborate with ECP math library and scientific application teams to enable the solution of deep, meaningful problems targeted by the ECP program and other DOE/Office of Science program areas.
- Machine Learning Engineer: Apply machine learning and AI to NERSC systems to improve on their ability to deliver productive science output.
- HPC Performance Engineer: Join a multidisciplinary team of computational and domain scientists to speed up scientific codes on cutting-edge computing architectures.
(Note: You can browse all our job openings on the NERSC Careers page, and all Berkeley Lab jobs at https://jobs.lbl.gov.)
We know that NERSC users can make great NERSC employees! We look forward to seeing your application.
- Cori
- 03/16/22 07:00-20:00 PDT, Scheduled Maintenance
- 04/20/22 07:00-20:00 PDT, Scheduled Maintenance
- 05/18/22 07:00-20:00 PDT, Scheduled Maintenance
- 06/15/22 07:00-20:00 PDT, Scheduled Maintenance
- Perlmutter
- 03/10/22 08:00-17:00 PST, Scheduled Maintenance
- The system will be unavailable.
- 03/24/22 08:00-17:00 PDT, Scheduled Maintenance
- Rolling updates may result in brief disconnections from login nodes and longer job start up time.
- 04/07/22 08:00-17:00 PDT, Scheduled Maintenance
- The system will be unavailable.
- 04/21/22 08:00-17:00 PDT, Scheduled Maintenance
- Rolling updates may result in brief disconnections from login nodes and longer job start up time.
- 03/10/22 08:00-17:00 PST, Scheduled Maintenance
- Authentication Services
- 03/08/22 12:00-13:00 PST, Scheduled Maintenance
- Web-based logins will be briefly unavailable
- 03/08/22 12:00-13:00 PST, Scheduled Maintenance
- HPSS Regent (Backup)
- 03/09/22 09:00-13:00 PST, Scheduled Maintenance
- System available, retrievals may be delayed due to tape library firmware updates
- 03/23/22 09:00-17:00 PDT, Scheduled Maintenance
- System available, retrievals may be delayed due to tape library preventative maintenance
- 03/09/22 09:00-13:00 PST, Scheduled Maintenance
- DNA
- 03/16/22 11:00-14:00 PDT, Scheduled Maintenance
- Users may see degraded performance while we perform maintenance on DNA.
- 03/16/22 11:00-14:00 PDT, Scheduled Maintenance
Visit http://my.nersc.gov/ for latest status and outage information.
You are receiving this email because you are the owner of an active account at NERSC. This mailing list is automatically populated with the email addresses associated with active NERSC accounts. In order to remove yourself from this mailing list, you must close your account, which can be done by emailing accounts@nersc.gov with your request.