What is GPFS-WAN, and how do I access it on Big Red at IU?
Global Parallel File System-Wide Area Network (GPFS-WAN) is a service provided by the San Diego Supercomputer Center (SDSC), coordinated with multiple TeraGrid sites, including Indiana University. SDSC makes its 700TB file system available for mounting on other TeraGrid resources, such as Big Red at IU. Combined with a UID mapping mechanism, this allows a user with TeraGrid credentials to access data on the single file system from several disparate compute resources. For a list of TeraGrid resources where GPFS is mounted, see the GPFS-WAN page.
The GPFS-WAN storage system is recommended for long- or short-term data storage of high-volume multi-site runs, as well as for large TeraGrid-based data collections. GPFS-WAN has three distinct purposes, each with its own policy for access, allocation, and data preservation:
-
Long-term Collections Area: This 150TB partition
is for data collections that need the unique functionality of a global
file system. Use of this space is granted via the same peer-review
process used to request CPU cycles. Request use of this storage area
by submitting a Data Allocation Proposal at the Partnerships Online Proposal
System (POPS).
-
Project Area: This 475TB partition is intended as
a temporary unpurged space for multi-site analysis. Any active
TeraGrid-allocated project that can benefit from the unique
capabilities of the global file system can request space by submitting
the GPFS-WAN
Projects Space Request Form. Quotas are enforced and determined
based upon this request. The length of time the space is available is
based on the duration of the TeraGrid project specified in the
request. Data will be removed from GPFS-WAN at the end of the project.
- Scratch Area: This 75TB partition is accessible by all TeraGrid users; you don't need to submit a request. To use this partition for short-term data analysis before moving your data to archival storage, simply access the partition from any of the resources mounting GPFS-WAN, create directories, and store your data. Use is unlimited within 75TB and is shared among all active users. Inactive files will be purged regularly in this partition as they age beyond two weeks.
Accessing GPFS-WAN on Big Red
On Big Red, GPFS-WAN is mounted at /N/gpfswan/. You can
use GPFS-WAN on Big Red only if you have a TeraGrid allocation. To
apply, see How do I apply for a new TeraGrid allocation? If your advisor or someone you work with
already has a TeraGrid allocation, he or she can add you to the
allocation by completing the Add/Remove User Form.
Data in GPFS-WAN are not backed up.
For information about allocable data storage resources on the TeraGrid, see the Allocable Data Resources page in the TeraGrid User Support documentation.
For information about selecting the appropriate storage for efficient management of your job output, see the Data Storage page in the TeraGrid User Support documentation.
This document was developed with support from the National Science Foundation (NSF) under Grant No. 0503697 to the University of Chicago and subcontracted to Indiana University. Additional support was provided by IU through its participation in the TeraGrid, which is supported by the NSF under Grants No. 0833618, SCI451237, SCI535258, and SCI504075. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the NSF.
Last modified on June 04, 2009.







