The PetaLibrary is a University of Colorado Boulder Research Computing service that supports the storage, archival, and sharing of research data. It is available at a subsidized cost to any researcher affiliated with the University of Colorado System (Boulder, Anschutz, Denver, Colorado Springs). It is available at an unsubsidized cost to researhers from other institutions.
Minimum project size: 1 TB/year
2 classes of storage: active and archive
- See our website for pricing information
New customers are initially limited to a maximum allocation size of:
200 TB* in Active Storage
100 TB* in Archive Storage
PetaLibrary access is subject to the PetaLibrary Terms of Service.
Accessing the PetaLibrary¶
PetaLibrary storage is presented as a file system directory under either:
Access to a PetaLibrary allocation is granted using an access group. This group may be an existing group in the Research Computing environment or a new group created specifically for the purpose of managing access to the allocation. Allocation users are made members of this access group by requesting that the allocation owner or delegate contact to the RC help desk to request their RC account be added to the group.
Note: Each person who accesses the PetaLibrary is required to have a Research Computing account and Duo two-factor authentication.
Request a PetaLibrary allocation¶
Request PetaLibrary storage by filling out the application form at the RC PetaLibrary page, under the “Request a new PetaLibrary allocation” link.
Note: Each PetaLibrary allocation must define an allocation owner, read more about PetaLibrary owners and contacts and their individual roles/responsibilities.
When a new allocation is created the path to it is defined and provisioned based on a name selected by you. For example, Jane Doe might name her lab’s allocation
- To access active storage: Log in to a Research Computing via login.rc.colorado.edu
and navigate to:
- To access archive storage: Archive storage is located at:
Note: Access via the login nodes is not recommended for frequent or large read/writes of archived data.
Two primary classes of storage are available:¶
- Appropriate for data that is frequently written or read
- Stores data in a parity-protected RAID array or similar
- Directly accessible (read+write) from Research Computing compute resources
- Appropriate for data that is infrequently accessed
- Stores data on tapes in a robotic tape library, with all data written to at least two tapes
PetaLibrary is a shared infrastructure and the instantaneous performance will vary depending on each individual workload and competing workloads from other clients.
The PetaLibrary service is designed for file storage and retrieval, and is not an ideal backend for highly transactional workloads (e.g., relational databases).