Difference between revisions of "Xcluster Go-Live"

From Research Computing Center Wiki
Jump to navigation Jump to search
Line 59: Line 59:
 
===PB notes===
 
===PB notes===
  
* note: don't want perlwrapper this time
+
* '''note''': don't want perlwrapper this time
  
* research: will scratch icebreakers run over Ethernet also, or only IB, for storage?
+
* '''research''': will scratch icebreakers run over Ethernet also, or only IB, for storage?
  
* research: try out POD
+
* '''research''': try out POD
  
* find out: what is difference between building on Intel vs AMD?
+
* '''research''': what is difference between building on Intel vs AMD?
  
* decision: which compiler to use by default?  I think we think gcc.
+
* '''decision''': which compiler to use by default?  I think we think gcc.
  
* find out/decision: will we do "qlogin"?  an interactive queue?  what kind of enforceable resource limits?
+
* '''research/decision''': will we do "qlogin"?  an interactive queue?  what kind of enforceable resource limits?
 
** http://www.nics.tennessee.edu/~troy/pbstools/ for a qlogin
 
** http://www.nics.tennessee.edu/~troy/pbstools/ for a qlogin
 
** dunno about resource limits yet
 
** dunno about resource limits yet
  
* decision: paths for /usr/local apps?
+
* '''decision''': paths for /usr/local apps?
  
* find out: can scyldev see zhead license server?
+
* '''research''': can scyldev see zhead license server?
 
** will know after testing the now-installed PGI
 
** will know after testing the now-installed PGI
  
* decision: RPMs vs not for /usr/local
+
* '''decision''': RPMs vs not for /usr/local
  
 
===Documentation===
 
===Documentation===
 
*cloud, login nodes, etc.
 
*cloud, login nodes, etc.
 
*queuing system
 
*queuing system

Revision as of 13:07, 20 May 2014

Name the cluster!!

  • Win a free hot-apple pie


(Scyld) Development Environment

  • Paul's VM plus physical nodes
  • Intel vs. AMD software builds?


Software

Intel Compiler - do we need it?

Apps to install on xcluster:

  • MPI, multi-core, big-memroy, serial?
  • Popular apps (regardless of type; how to determine?):
    • time stamps on app dirs??
    • access time of executable??

New queueing system:

  • how/what to configure (e.g. fairshare, queues, "complexes")

Use module or not?

Scyld

  • Location of /usr/local/apps, libraries, and languages e.g., Perl
  • A bunch more stuff needed here


Cloud Manager

Lab Group Registration:

  • Lab VM login nodes

User Accounts:

  • New request from PI's
  • CAS/LDAP authentication?
  • affiliate accounts?


Storage

  • HPC IceBreaker (3 x 48TB chains)
    • one chain for /home, /db, and /usr/local?
    • two chains for scratch?
  • Archive IceBreaker (2 x 320TB chains)
    • /oflow and /home, /usr/local backups
  • Lustre ClusterStor
    • scratch for zcluster via 10GBE
    • scratch for xcluster via 10GBE, maybe IB later


PB notes

  • note: don't want perlwrapper this time
  • research: will scratch icebreakers run over Ethernet also, or only IB, for storage?
  • research: try out POD
  • research: what is difference between building on Intel vs AMD?
  • decision: which compiler to use by default? I think we think gcc.
  • decision: paths for /usr/local apps?
  • research: can scyldev see zhead license server?
    • will know after testing the now-installed PGI
  • decision: RPMs vs not for /usr/local

Documentation

  • cloud, login nodes, etc.
  • queuing system