Data Release
When your project has finished sequencing, you will receive a data release email with download instructions. Our standard process for internal clients is delivery to the Minnesota Supercomputing Institute’s (MSI) high-performance file system. For external clients, we make data available for download from a secure website or through the Globus platform. If you encounter any problems downloading your data, please report the issue in a reply to the data release email.
Options
1. MSI storage
Internal clients have their data released to MSI's Shared User Resource Facility Storage (SURFS). Delivered data will be located in the "data_delivery" folder in your group's folder on MSI's primary filesystem (/projects/standard/GROUP/data_delivery/umgc). MSI does not charge for SURFS storage costs, but files expire and are removed one year after they've been delivered. Files should be copied to other MSI storage locations, such as Tier2 or your group's "shared" folder, before they expire. MSI provides documentation on how to download files from the MSI filesystem using WinSCP, FileZilla, and Globus.
2. Web download
External clients can download their data from a secure website using either a web browser or a command-line download tool. Complete instructions are provided in an email from the UMGC. The client’s data is available for download for 30 days, after which the data is removed from the data download website, and the client takes responsibility for storing the data.
3. Globus
Globus is the recommended method for external clients to download large datasets. Globus enables robust and high-speed transfer of data directly to your institution's high-performance filesystem (if it has a Globus endpoint configured) or to a laptop/desktop using the free Globus Connect Personal software. Let us know what your Globus-linked email address is, and we'll make your data available for download through the Globus platform.
4. Additional Options
Let us know if our standard data delivery options don't work for you. We may be able to deliver data directly to your AWS S3 bucket or, as a last resort, clients can choose to have data shipped on a hard drive purchased by the UMGC and invoiced to the client at a cost of $250 per hard drive.
Data Recovery
The UMGC archives most customer data for a year and some datasets are retained for 5 years or more. If you need a dataset re-delivered email a request to [email protected] to initiate data recovery. The UMGC does not provide any guarantee that data can be successfully recovered from the archive.