Upcoming Maintenance - July 27-28, 2021

From Research Computing Center Wiki
Revision as of 13:38, 18 June 2021 by Shtsai (talk | contribs)
Jump to navigation Jump to search


Summary of this message:

   Sapelo2 will be shut down for maintenance on July 27, 2021 at 6 a.m. and will remain unavailable until 6 p.m. on July 28, 2021 (tentative end-of-maintenance date).
   All jobs running on Sapelo2 at 6 a.m. on July 27 will be deleted. 
   Sapelo2 and the xfer nodes will be unavailable during this time.
   The teaching cluster will not be affected.


Per GACRC's standing policy on regular maintenance, our next scheduled window is on July 27 and 28, 2021. We plan to service Lustre hardware, upgrade the Lustre and the ZFS storage software, perform maintenance on the Infiniband network, implement an upgrade and minor configuration changes to the Slurm queueing system, and reinstall the compute nodes with an upgraded system image (the OS version will remain the same).


All Sapelo2 jobs still running when the maintenance begins at 6:00 a.m. on July 27 will be terminated. Because of this, we have implemented a "reservation" on the queueing system that will only start jobs whose requested walltime would permit them to complete running before 2 a.m. on July 27.


The teaching cluster will not be affected, and it will remain available during the maintenance.

If you have questions, please let us know via the Web form

 https://uga.teamdynamix.com/TDClient/Requests/ServiceCatalog?CategoryID=11593