Coming Events
November 21, 2008 - Supercomputing '08 Panel: Exa and Yotta Scale Data - Are We Ready?
Soon after Teraflops, HPC facilities were handling Peta-Byte data. The challenges of Exa-byte and Yotta-byte data will be a significant, possibly dominate, limiter on productivity of HPC users. This panel will address the question "Is the HPC community ready for Exa Byte data?" and will discuss challenges of Yotta-bytes.
November 17, 2008 - Petascale Data Storage Workshop at SC '08 - SUBMISSIONS NOW OPEN
Petascale computing infrastructures make petascale demands on information storage capacity, performance, concurrency, reliability, availability, and manageability. This one-day workshop focuses on the data storage problems and emerging solutions found in petascale scientific computing environments, with special attention to issues in which community collaboration can be crucial, problem identification, workload capture, solution interoperability, standards with community buy-in, and shared tools.
This workshop seeks contributions on relevant topics, including but not limited to: performance and benchmarking results and tools, failure tolerance problems and solutions, APIs for high performance features, parallel file systems, high bandwidth storage architectures, wide area file systems, metadata intensive workloads, autonomics for HPC storage, virtualization for storage systems, archival storage advances, resource management innovations, etc.
Download the Call for Papers.
August 3-6, 2008 - HEC FSIO Workshop, hosted by Los Alamos National Laboratory
HEC FSIO 2008 Presentations now available.
To collect a broader set of research needs in this area, government agencies, top universities in the I/O area, and commercial entities that fund file systems and I/O research were invited to help the HECIWG determine the most needed research topics within this area. The information gathered at this workshop will be used to facilitate better coordinated government funded research in this important area in the coming years. An advisory group will be formed to continue the file systems and I/O research coordination advisory effort for the coming years for the HECIWG.
Code and Data Releases
- NEW! Sandia releases 2 sets of application I/O trace outputs taken during runs of the Sandia code Alegra.
- LANL releases workstation file system statistics gathered with fsstats tool (08/07/08)
- Sandia's IOR software, for benchmarking parallel file systems using POSIX, MPIIO, or HDF5 interfaces. This software was developed at LLNL, with Bill Loewe.
- FSSTATS code release
- LANL PDSI Research Data and Open Source Code
- PFS code release
- Computer Failure Data Repository (CFDR)
PDSI Overview
Petascale computing infrastructures for scientific discovery make petascale demands on information storage capacity, performance, concurrency, reliability, availability, and manageability. The last decade has shown that parallel file systems can barely keep pace with high performance computing along these dimensions; this poses a critical challenge when petascale requirements are considered. The Petascale Data Storage Institute will focus on the data storage problems found in petascale scientific computing environments, with special attention to community issues such as interoperability, community buy-in, and shared tools. Leveraging experience in applications and diverse file and storage systems expertise of its members, the institute allows a group of researchers to collaborate extensively on developing requirements, standards, algorithms, and development and performance tools. Mechanisms for petascale storage and results will be made available to the petascale computing community. The institute will hold periodic workshops and develop educational materials on petascale data storage for science.
The Petascale Data Storage Institute is a collaboration between researchers at Carnegie Mellon University, National Energy Research Scientific Computing Center, Pacific Northwest National Laboratory, Oak Ridge National Laboratory, Sandia National Laboratory, Los Alamos National Laboratory, University of Michigan, and the University of California at Santa Cruz.
The Drive to Petascale ComputingFaster computers need more data, faster:
Challenges
|
![]() |
Contact Us
Garth Gibson, PDSI PI
School of Computer Science
Carnegie Mellon University
Pittsburgh, PA 15213
phone:412-268-5890
email: garth@cs.cmu.edu
Angela Miller, Administrative Asst.
School of Computer Science
Carnegie Mellon University
Pittsburgh, PA 15213
phone: 412-268-6645
email:amiller@cs.cmu.edu









