Coming Events | Code and Data Releases | PDSI Overview | Contact Us

Coming Events

November 21, 2008 - Supercomputing '08 Panel: Exa and Yotta Scale Data - Are We Ready?
Soon after Teraflops, HPC facilities were handling Peta-Byte data. The challenges of Exa-byte and Yotta-byte data will be a significant, possibly dominate, limiter on productivity of HPC users. This panel will address the question "Is the HPC community ready for Exa Byte data?" and will discuss challenges of Yotta-bytes.

November 17, 2008 - Petascale Data Storage Workshop at SC '08 - SUBMISSIONS NOW OPEN
Petascale computing infrastructures make petascale demands on information storage capacity, performance, concurrency, reliability, availability, and manageability. This one-day workshop focuses on the data storage problems and emerging solutions found in petascale scientific computing environments, with special attention to issues in which community collaboration can be crucial, problem identification, workload capture, solution interoperability, standards with community buy-in, and shared tools.

This workshop seeks contributions on relevant topics, including but not limited to: performance and benchmarking results and tools, failure tolerance problems and solutions, APIs for high performance features, parallel file systems, high bandwidth storage architectures, wide area file systems, metadata intensive workloads, autonomics for HPC storage, virtualization for storage systems, archival storage advances, resource management innovations, etc.

Download the Call for Papers.

August 3-6, 2008 - HEC FSIO Workshop, hosted by Los Alamos National Laboratory
HEC FSIO 2008 Presentations now available.
To collect a broader set of research needs in this area, government agencies, top universities in the I/O area, and commercial entities that fund file systems and I/O research were invited to help the HECIWG determine the most needed research topics within this area. The information gathered at this workshop will be used to facilitate better coordinated government funded research in this important area in the coming years. An advisory group will be formed to continue the file systems and I/O research coordination advisory effort for the coming years for the HECIWG.

 

Code and Data Releases

 

PDSI Overview

Petascale computing infrastructures for scientific discovery make petascale demands on information storage capacity, performance, concurrency, reliability, availability, and manageability. The last decade has shown that parallel file systems can barely keep pace with high performance computing along these dimensions; this poses a critical challenge when petascale requirements are considered. The Petascale Data Storage Institute will focus on the data storage problems found in petascale scientific computing environments, with special attention to community issues such as interoperability, community buy-in, and shared tools. Leveraging experience in applications and diverse file and storage systems expertise of its members, the institute allows a group of researchers to collaborate extensively on developing requirements, standards, algorithms, and development and performance tools. Mechanisms for petascale storage and results will be made available to the petascale computing community. The institute will hold periodic workshops and develop educational materials on petascale data storage for science.

The Petascale Data Storage Institute is a collaboration between researchers at Carnegie Mellon University, National Energy Research Scientific Computing Center, Pacific Northwest National Laboratory, Oak Ridge National Laboratory, Sandia National Laboratory, Los Alamos National Laboratory, University of Michigan, and the University of California at Santa Cruz.

The Drive to Petascale Computing

Faster computers need more data, faster:

  • Data movement at Terabytes/sec
  • Petabyte sized files (100 Library of Congress equivalents)
  • Trillions of files

Challenges

  • Scaling file system speeds and feeds
  • Scalable interoperable interfaces and protocols
  • Automating data distribution and fault mitigation
  • Enumerate and search metadata of trillions of files

 

Contact Us

Garth Gibson, PDSI PI
School of Computer Science
Carnegie Mellon University
Pittsburgh, PA 15213
phone:412-268-5890
email: garth@cs.cmu.edu

Angela Miller, Administrative Asst.
School of Computer Science
Carnegie Mellon University
Pittsburgh, PA 15213
phone: 412-268-6645
email:amiller@cs.cmu.edu

 

Last updated 2008-08-27 | ©2008 Carnegie Mellon University |