Copy data from Google Drive to PetaLibrary

Using Globus

Step 1: Log into the Globus Web App

Visit https://app.globus.org and log in with your Colorado.edu credentials

../../_images/Globus_Login.pngGlobus Login

Step 2: Open your Google Drive (left pane)

In the Collection search (left pane) enter: Google Drive CU Boulder - this should show your Google Drive contents

  • If you’d like to transfer from Team Drives click the “Up One Directory” button and then select Team Drives

../../_images/Globus_Google_Drive.pngGlobus Google Drive

Step 3: Open CURC (right pane)

In the Collection search (right pane) enter: CU Boulder Research Computing

  • You’ll be asked to log in using your CURC Credentials - after clicking Authenticate you should receive a Duo push on your Duo device which you will need to confirm

../../_images/Globus_CURC_Authentication.pngGlobus CURC Authentication

  • Once Authenticated you should see your CURC home directory - to access Petalibrary click “Up One Directory” and then select /pl

../../_images/Globus_CURC_Collection.pngGlobus CURC Collection

Step 4: Select files/folders from Google Drive to be transferred and initiate the transfer

  • In the left pane, select the file or folder you’d like to transfer and click Start - this will initiate a Globus Transfer job

../../_images/Globus_Intitiate_Transfer.pngGlobus Initiate Transfer

  • View progress of your transfer under the Activity tab in the Globus App

../../_images/Globus_Activity_GDrive.pngGlobus Activity

Using RClone

Step 1: download/install rclone and make sure you can run it

  • Download the latest version from https://rclone.org/downloads/ and install per the developer’s documentation.
  • To test, run rclone --version at your prompt; if the software is installed a version number will be reported back to you, similar to:
$ rclone --version
rclone v1.54.1-DEV
- os/arch: freebsd/amd64
- go version: go1.16.2

Step 2: configure google drive remote configuration

Note: the steps below are also outlined in the Rclone Documentation for Google Drive. We only list the critical steps. If a step is left out, assume you can select the default setting.

  • Type rclone config to create a new profile for transferring files between Google Drive and PetaLibrary

  • When prompted for whether to configure a “new” or “existing” profile type n for “new remote”

  • When prompted to name the new profile provide any descriptive name you like (e.g., gdrive_johndoe_cu)

  • When prompted for the type of storage to configure, select the number of the option for “Google Drive” (e.g., the number is “15” for rclone v1.54.1)

  • When prompted for scope that rclone should use when requesting access from Drive enter ‘1’ for full drive access

  • When prompted for “Use Auto config?” choose “Y” for yes (default). Now rclone will give you a URL to use to authenticate against. It may automatically open this URL in your browser. If it does not, you can paste the URL into your browser if you are configuring on a local machine. If you are working on a remote system (e.g., if you are logged into your lab server from home), then from a terminal you can ssh from your computer to the system where you are configuring rclone:

    $ port=53682
    $ ssh -L ${port}:localhost:${port} <machine where rclone is running>
    

    …and now entering the url in your local browser should work.

  • Once you are in your browser, you may be asked to authenticate to your Google account, and then you will be asked to allow Rclone to access the files in your gdrive. Complete this step to grant access. If successful you’ll receive a “success” message.

  • When prompted for whether you want to configure this as a Shared Drive (Team Drive). Choose the appropriate answer; if this is your personal Drive account then choose no (default)

  • Finally, you will be prompted to review the configuration and confirm whether it is okay. If everything looks okay, choose yes (default)

  • Now type q to quit the configuration.

Now test your Google Drive remote!

On your local machine type:

$ rclone ls gdrive_johndoe_cu

If this step fails, your Google Drive remote is not configured properly. Try reconfiguring your Google Drive remote again or contact rc-help@colorado.edu for assistance.

Step 3: Create your PetaLibrary remote configuration

See instructions for creating an Rclone remote configuration for PetaLibrary

Couldn’t find what you need? Provide feedback on these docs!