Rocky 8 Transition Guide

From Research Computing Center Wiki
Jump to navigation Jump to search

Introduction

As part of our August 29-31,2023 maintenance window, GACRC will be upgrading the Sapelo2 cluster operating system from CentOS 7 to Rocky 8.

Why is a major Operating System (OS) update necessary?

  • Existing OS is End of Life - There are no more full version updates being released for the existing operating system and newer versions of some software applications are not supported by the current OS version.
  • Bringing New Nodes Online - As development within the existing OS has stopped, some of the latest generation of compute node hardware cannot use it, needing driver types newer than what this OS has. New hardware and architecture that we will be bringing online soon requires this OS update.
  • Security Improvements - In order to keep our cluster as up to date as possible, these kinds of big OS updates need to happen.
  • Why Rocky 8? - A good portion of the HPC centers is adopting it, which means there is a good amount of community support.

What does this mean to you and your workflows?

Overview

  • We are not changing anything from the data storage standpoint. All existing /home, /scratch, /work, /project spaces will retain the existing data.
  • The compiler toolchains and many software packages will be updated to newer versions.
  • Because this is a major OS update, we need to recompile all the applications and ensure that they work with the new version of OS.
  • We will have as comprehensive a software suite available on the new OS as possible, but some less widely used applications and older version software will not be immediately available. If a specific older version of software is required, please let us know ahead of time, by entering the software name and version into the Google doc below, so we can add that to our priority list:

https://docs.google.com/document/d/1wAw6ox54xsvMWP3NVP0wdFyBVQgKgLeUa3bRFo33NSM/edit?usp=sharing

  • As software modules will be reinstalled and updated, all pending jobs will be canceled during the maintenance window, to prevent job failure due to changes in the module names post maintenance.

Storage

There will be no changes to the storage system at this maintenance window. All existing /home, /scratch, /work, /project, /db spaces will be available after the maintenance and they will retain the existing data.

Software

Warning

Because this is a major change in the operating system, most user software built on CentOS 7 will not work and will need to be rebuilt. Even if the programs run without being rebuilt the change in the underlying libraries may impact code execution and results. Therefore, users should test and verify that their codes are producing the expected results on the new operating system.

Compiler toolchains

Centrally installed modules

Conda environments

Python packages

R packages

Singularity containers

Queueing System