Globus
Introduction
The GACRC, on behalf of UGA, recently procured an institutional standard subscription to Globus for secure, reliable management of UGA's research data. Globus is a high-performance data-transfer platform that allows you to perform and/or automate:
- Data transfers between servers in your group.
- Data transfers between a server and your laptop.
- Sharing data with researchers at other institutions.
- Sharing data with the world.
Data transfers happen unattended and are faster than SCP/SFTP, data verification is on by default, and automatic restarts or continuation of transfers happen after a disruption.
Getting Started
If you are a first time user of Globus, you will need to create an Identity Account. At a minimum you will need to setup your identity using the University of Georgia organizational login in order to access UGA systems.
- Go to https://www.globus.org and choose Login in the upper right corner.
- Search for University of Georgia in the “Use your existing organizational login" box.
- Choose continue and you will be forwarded to a UGA Single Sign-On (SSO) login page. You will also need to authenticate with Duo (two-factor authentication).
When you login for the first time using an existing organization login associated to UGA:
- Globus will ask if you would like to link to an existing account. If you have already used another account with Globus in the past, you can choose "Link to an existing account". Otherwise, click "Continue" to proceed.
- You will need to accept Globus Terms of Service and Privacy Policy and click Continue to proceed.
- You will need to give Globus permission to use your identity to access information and perform actions (like file transfers) on your behalf.
These 3 steps will not be prompted after your first login.
Detailed information with screenshots are provided by Globus at their Getting Started page.
Access GACRC Storage
GACRC maintains a UGA GACRC Collection that can be used to access the Sapelo2 /home, /scratch, /work, and /project file systems.
After you have logged in to Globus, click the File Manager link at the top-left of the window.
In the Collection search box, enter GACRC and you should see UGA GACRC Collection in the list.
Select the UGA GACRC Collection and authenticate with SSO to access your files on Sapelo2. By default, you will open your home directory on Sapelo2. For example:
To access your /scratch or /project areas, enter the full path in the Path field under the Collection name (for example, enter /project/abclab) and then press the Enter or Return key on your keyboard for the change in the path to take effect.
Transfer Data Between GACRC Storage and Desktops/Laptops
There are two ways to use Globus to transfer files between a GACRC storage and desktops or laptops. The method that is best suited depends on the number and size of the files to be transferred. In both cases you would use a browser to Login into https://www.globus.org, as described above, and open the UGA GACRC Collection in the File Manager panel.
Small number of files or small sizes downloaded or uploaded
Uploads - Once you open the UGA GACRC Collection in the File Manager panel, you can select "Upload" button to upload a file from your local machine to the UGA GACRC Collection, to the path you select (e.g. your Sapelo2 /scratch dir).
Downloads - Once you open the UGA GACRC Collection in the File Manager panel, you can navigate the directory where the files are located and select the files you want to download. Then select the "Download" button to download the file(s) to your local machine.
Here is a sample screenshot to download a file called analysis.txt from the user's home directory on Sapelo2 to the local machine:
Many files or large files
Install Globus Connect Personal (GCP) and create a Globus endpoint on your local machine. Globus Connect Personal allows faster and more reliable file transfers. Information on how to install GCP on your local machine and create an endpoint are available on the Globus Connect Personal page.
Once you have installed GCP and created an endpoint on your local machine (e.g. called shtsai-imac in the sample below), navigate to the UGA GACRC Collection in the File Manager panel, select the files or directories you wish to transfer, and click on the double panel icon (top right), as illustrated here:
On the right panel, enter the name of the Collection on your endpoint and select the path where to transfer the data to (or from). By default, file integrity will be checked after the transfer is completed. You can enable other options (e.g. file encryption on transfer) by opening the Transfer & Sync Options menu and selecting the options you wish to use. Once all the settings are chosen, and you are ready to perform the file transfer, click on the Start button. You can check the status of the transfer by going to the Activity panel.
Transferring files between Sapelo2 file systems
To transfer files between two files systems on Sapelo2, navigate to the File Manager two-panel view, and select UGA GACRC Collection on both panels. Enter the path to the location of the files on one panel (e.g. your /project directory) and the location where you want to transfer the file to on the other panel (e.g. your /scratch directory). Select the files to transfer and click Start. You can check a report of the transfer by going to the Activity panel.
Example:
Access Storage Not Hosted by GACRC
Globus can be used to access, share, transfer, or manage data stored on devices outside of the GACRC. In order to do that, the first step is to install Globus Connect and create an endpoint on the system where the data are (or will be) stored.
Two cases:
1. If the data are stored (or will be stored) on a laptop, a personal desktop system, or a laboratory server, you will need to install Globus Connect Personal and become a Globus Plus member.
2. If the data are stored (or will be stored) on a multi-user system, you can install Globus Connect Server.