Difference between revisions of "Georgia Advanced Computing Resource Center"

From Research Computing Center Wiki
Jump to navigation Jump to search
 
(290 intermediate revisions by 11 users not shown)
Line 1: Line 1:
==Aug 22 outage==
+
__NOTOC__
"W" means "don't know When this task will be done"
+
Welcome to the Georgia Advanced Computing Resource Center wiki. The information provided here is a supplement to the GACRC webpage. The GACRC online information resources include:
 
Soon: /etc/motd on pcluster, zcluster
 
 
Sometime Tuesday: message to users
 
 
Wednesday:
 
2 PM: VM snapshots
 
 
3 PM:
 
  disable logins pcluster
 
  disable logins zcluster (except for jkissing, students)
 
 
  kick users off pcluster
 
  kick users off zcluster?
 
 
  drain all nodes or queues, pcluster
 
  disable all queues except somedevq, zcluster
 
 
  kill all jobs pcluster
 
  kill all jobs zcluster
 
 
  do GE jobs testing MPI and storage I/O throughput
 
    CC to use fsr15 for iozone
 
    ST to use fsr12 for MPI/NAMD
 
    PB to use fsr7 for dumb I/O
 
 
  stop execd on all nodes, zcluster
 
  shut down racks 8,9,10,11
 
 
4 PM:
 
  CC reconfigs NICs/LAGs on storage units
 
  PB modify PanFS blksize on remaining nodes
 
  PB shut down VMs
 
  shut down 3070s
 
  shut down Panasas
 
 
  connect ESX IPMI cat5
 
      W: "final rsyncs" of /db, /usr/local
 
              W: Curtis reconfig storage unit NICs/LAGs
 
              W: PanFS 16K blksize on remaining nodes and zcluster.rcc
 
 
 
by 5PM:
 
we tell NEG "go ahead"
 
CC recable storage unit cat5, and work with Brian M on switch port configs
 
 
probably between 9 and 12PM, NEG finishes
 
 
****  AFTER ******
 
"Midnight" (when NEG is done):
 
power up 3070s
 
power up Panasas
 
power up ESX servers and VMs
 
start final rsyncs
 
power up racks 8,9,10,11
 
 
"Morning" (starting by 8AM)=======================================================
 
 
panasas OS upgrade
 
switch and test /db, /usr/local mounts on the zcluster
 
 
  do GE jobs testing MPI and storage I/O throughput
 
 
enable Panasas jumbo frames and reboot Panasas
 
 
VMWare updates
 
 
    W: GE upgrade
 
    W: yum updates of nodes
 
    W: yum update of zhead
 
    W: update FW on Dells
 
    W: move some rack15 nodes to rack 16?
 
    W: reinstall rack11?
 
    W: thumper upgrades
 
    W: rccstor upgrades
 
 
  morning            W: VMware updates
 
  
 +
*[http://gacrc.uga.edu/ Web Site] – General overview
 +
*[https://wiki.gacrc.uga.edu/ Wiki] – Software docs and how-to’s - "You Are Here"
 +
*[https://kaltura.uga.edu/channel/GACRC/176125031 Kaltura] – Linux and HPC training videos
 +
<!-- *[https://blog.gacrc.uga.edu/ Blog] – announcements -->
 +
<!-- *[https://forums.gacrc.uga.edu/ Forums] – user discussion area -->
  
switch and test NFS mounts on pcluster
+
<!--Comments on color for the below -->
+
<!-- green background = #00CC33 -->
    W: upgrade PGI compiler
+
<!-- light orange background = #FF9F40 -->
+
<!-- red background = red -->
reenable queues on zcluster
+
<!-- default text, at end of line, is: Online -->
resume queues on pcluster
 
 
enable logins pcluster
 
enable logins zcluster
 
 
email users that outage is over
 
contact Lab Storage users about their mounts
 
  
==Clusters==
+
<div style="width=100%; margin:0; background:#00CC33; font-size:120%; font-weight:bold; border:1px solid #00CC33; text-align:left; color:white; padding:0.2em 0.4em;"> Current Status: <span style="color:black"> Online </span></div>
===Overview===
 
===[[rCluster]]===
 
===[[zCluster]]===
 
====[[ToDo List]]====
 
===[[sCluster]]===
 
====[[scluster todo List]]====
 
===[[VMWare]]===
 
====[[Virtual Machines]]====
 
  
==Storage==
+
<!--
===Overview===
+
<div style="width=100%; margin:0; background:#FF9F40; font-size:120%; font-weight:bold; border:1px solid #FF9F40; text-align:left; color:white; padding:0.2em 0.4em;"> Current Status: <span style="color:black"> Sapelo2 under maintenance, teaching cluster available </span></div>
===[[NAS]]===
+
-->
===[[SAN]]===
+
<!--
==Networking==
+
<div style="width=100%; margin:0; background:#FF9F40; font-size:120%; font-weight:bold; border:1px solid #FF9F40; text-align:left; color:white; padding:0.2em 0.4em;"> Current Status: <span style="color:black"> Teaching cluster inaccessible while the scheduled UGA network maintenance is on-going</span></div>
===Overview===
+
-->
===[[VLANs]]===
+
 
===[[IP Networks]]===
+
<!--
==Physical Hosts==
+
<div style="width=100%; margin:0; background:#00CC33; font-size:120%; font-weight:bold; border:1px solid #00CC33; text-align:left; color:white; padding:0.2em 0.4em;"> Current Status: <span style="color:black"> Sapelo2 Cluster Online </span></div>
 +
 
 +
<div style="width=100%; margin:0; background:#FF9F40; font-size:120%; font-weight:bold; border:1px solid #FF9F40; text-align:left; color:white; padding:0.2em 0.4em;"> Current Status: <span style="color:black"> Sapelo decommissioned</span></div>
 +
-->
 +
 
 +
<div style="width=100%; margin:0; background:#333333; font-size:120%; font-weight:bold; border:1px solid #f9f9f9; text-align:left; color:#eeeeee; padding:0.2em 0.4em;"> IMPORTANT NEWS </div>
 +
The following is an important notice for all of our current users:
 +
 
 +
* GACRC offering in-person drop-in '''[[Office Hours]]'''.
 +
 
 +
<blockquote style="background-color: lightyellow; border: solid thin grey;">
 +
'''May Office Hours:'''
 +
*'''Wednesday May 8th, 3:00-4:30 pm''' at the McBay Science library, Main floor
 +
*'''Monday May 13th, 10:00 am-12:00 pm''' at the Physics Building in room 315
 +
*'''Wednesday May 22nd, 3:00-4:30 pm''' at the McBay Science library, Main floor
 +
</blockquote>
 +
 
 +
<div style="width=100%; margin:0; background:#333333; font-size:120%; font-weight:bold; border:1px solid #f9f9f9; text-align:left; color:#eeeeee; padding:0.2em 0.4em;"> Getting Started </div>
 +
Welcome to the Georgia Advanced Computing Resource Center at the University of Georgia. If you're new to the GACRC, start with these links to get acquainted with our resources.
 +
*[[User Accounts]]
 +
*[[Instructional Accounts]]
 +
*[[Connecting]]
 +
*[[Transferring Files]]
 +
*[[Password | Changing your Password]]
 +
*[[Frequently Asked Questions | FAQ]]
 +
*[https://wiki.gacrc.uga.edu/wiki/Quick_Reference_Guide Command List]
 +
*[[Getting Help]]
 +
*[[Policies]]
 +
*[[Consulting]]
 +
*[[Training]]
 +
 
 +
 
 +
<div style="width=100%; margin:0; background:#333333; font-size:120%; font-weight:bold; border:1px solid #f9f9f9; text-align:left; color:#eeeeee; padding:0.2em 0.4em;"> System Information </div>
 +
Hardware information and operational procedures are described below.
 +
*[[Systems]]
 +
*[[Disk Storage]]
 +
<!-- * [[Sapelo2 and Sapelo2 (old) comparison]] -->
 +
 
 +
 
 +
<div style="width=100%; margin:0; background:#333333; font-size:120%; font-weight:bold; border:1px solid #f9f9f9; text-align:left; color:#eeeeee; padding:0.2em 0.4em;"> Job and Data Management </div>
 +
Information on how to run jobs and data management.
 +
*[[Running Jobs]]
 +
*[[Monitoring Jobs]]
 +
*[[Job Submission Partitions]]
 +
*[[Sample Scripts | Sample Job Submission Scripts]]
 +
*[[Migrating from Torque to Slurm]]
 +
*[[Troubleshooting on Sapelo2]]
 +
*[[Best Practices]]
 +
*[[Globus]]
 +
*[[OnDemand | Open OnDemand]]
 +
 
 +
 
 +
<div style="width=100%; margin:0; background:#333333; font-size:120%; font-weight:bold; border:1px solid #f9f9f9; text-align:left; color:#eeeeee; padding:0.2em 0.4em;"> Software and Libraries </div>
 +
Documentation for software applications, programming tools, and usage.
 +
*[[Software]]
 +
*[[Available Toolchains and Toolchain Compatibility]]
 +
*[[Bioinformatics Databases]]
 +
*[[OpenMP]]
 +
*[[MPI | Message Passing Interface (MPI)]]
 +
*[[Compilers]]
 +
*[[GPU|GPU and CUDA Programming]]
 +
*[[Installing Applications]]
 +
 
 +
 
 +
<!--
 +
* [[Galaxy]]
 +
* [[Zaney]]
 +
 
 +
<div style="width=100%; margin:0; background:#eeeeee; font-size:120%; font-weight:bold; border:1px solid #f9f9f9; text-align:left; color:#eeeeee padding:0.2em 0.4em;">
 +
[[GACRC Knowledge Base]]</div>
 +
<br />
 +
<div style="width=100%; margin:0; background:#eeeeee; font-size:120%; font-weight:bold; border:1px solid #f9f9f9; text-align:left; color:#eeeeee padding:0.2em 0.4em;">
 +
[[GACRC Advisory Committee]]</div>
 +
-->

Latest revision as of 15:18, 24 April 2024

Welcome to the Georgia Advanced Computing Resource Center wiki. The information provided here is a supplement to the GACRC webpage. The GACRC online information resources include:

  • Web Site – General overview
  • Wiki – Software docs and how-to’s - "You Are Here"
  • Kaltura – Linux and HPC training videos


Current Status: Online


IMPORTANT NEWS

The following is an important notice for all of our current users:

May Office Hours:

  • Wednesday May 8th, 3:00-4:30 pm at the McBay Science library, Main floor
  • Monday May 13th, 10:00 am-12:00 pm at the Physics Building in room 315
  • Wednesday May 22nd, 3:00-4:30 pm at the McBay Science library, Main floor
Getting Started

Welcome to the Georgia Advanced Computing Resource Center at the University of Georgia. If you're new to the GACRC, start with these links to get acquainted with our resources.


System Information

Hardware information and operational procedures are described below.


Job and Data Management

Information on how to run jobs and data management.


Software and Libraries

Documentation for software applications, programming tools, and usage.