3rd Petascale Data Storage Workshop
Supercomputing '08

Held in conjunction with SC08 and sponsored by the DOE SciDAC Petascale Data Storage Institute (PDSI)

Session Chair: Garth Gibson, CMU

Monday, November 17, 2008
8:30 a.m. - 5:00 p.m.
Room 14
Austin Convention Center, Austin, Texas
SC08 Workshop Web Page

Abstract | Agenda | Poster Session | Other Workshops/Panels at SC08

Papers now available through IEEE Xplore


Petascale computing infrastructures make petascale demands on information storage capacity, performance, concurrency, reliability, availability, and manageability. This one-day workshop focuses on the data storage problems and emerging solutions found in petascale scientific computing environments, with special attention to issues in which community collaboration can be crucial, problem identification, workload capture, solution interoperability, standards with community buy-in, and shared tools.

NOTE: Please be aware that all attendees to the workshop, both speakers and participants, will have to pay an SC08 registration fee. However, new for SC'08 will be a one-day Workshop-only registration fee at a much reduced rate relative to the weeklong registration fee.


8:25am - 8:30am
Welcome - Garth Gibson, Workshop Chair
8:30am - 10:00am
SESSION 1: Structures for Out-of-order and Random Access
Session Chair: Carlos Maltzahn, University of California, Santa Cruz
  Input/Output APIs and Data Organization for High Performance Scientific Computing
Jay Lofstead, Fang Zheng, Scott Klasky, Karsten Schwan, Georgia Tech and
Oak Ridge National Laboratory
Paper | Slides

Fast Log-based Concurrent Writing of Checkpoints
Milo Polte, Jiri Simsa, Wittawat Tantisiriroj, Garth Gibson, CMU
Paper | Slides

Zest: Checkpoint Storage System for Large Supercomputers
Paul Nowoczynski, Nathan Stone, Jared Yanovich, Jason Sommerfield, PSC
Paper | Slides

Scalable Full-Text Search for Petascale File Systems
Andrew W. Leung and Ethan L. Miller, University of California, Santa Cruz
Paper | Slides
10:00am - 10:30am
POSTER SESSION 1 - List of participants and links to posters
10:30am - 12:00pm
SESSION 2: Tools and Devices
Session Chair: Evan Felix, Pacific Northwest National Laboratory
  Performance of RDMA-capable Storage Protocols on Wide-Area Network
Weikuan Yu, Nageswara S.V. Rao, Pete Wyckoff, Jeffrey S. Vetter,
Oak Ridge National Laboratory
Paper | Slides

Comparing Performance of Solid State Devices and Mechanical Disks
Milo Polte, Jiri Simsa, Garth Gibson, CMU
Paper | Slides

Arbitrary Dimension Reed-Solomon Coding and Decoding for
Extended RAID on GPUs

Matthew L. Curry, H. Lee Ward, Anthony Skjellum, and Ron Brightwell,
University of Alabama at Birmingham and Sandia National Laboratory
Paper | Slides

Pianola: A Script-based I/O Benchmark
John May, Lawrence Livermore National Laboratory
Paper | Slides
12:00pm - 1:00pm
1:00pm - 2:30pm
SESSION 3: Systems and Application Support
Session Chair: Bill Kramer, Lawrence Berkeley National Laboratory
  Introducing Map-Reduce to High End Computing
Grant Mackey, Saba Sehrish, Julio Lopez, John Bent, Salman Habib, Jun Wang, University of Central Florida, Carnegie Mellon University, and Los Alamos National Laboratory
Paper | Slides

Logan: Automatic Management for Evolvable, Large-Scale, Archival Storage
Mark W. Storer, Kevin M. Greenan, Ian F. Adams, Ethan L. Miller, Darrell D. E. Long, Kaladhar Voruganti, University of California, Santa Cruz
Paper | Slides

Just-in-time Staging of Large Input Data for Supercomputing Jobs
Henry M. Monti,  Ali R. Butt, Sudharshan S. Vazhkudai, Virginia Tech, ORNL
Paper | Slides

Revisiting the Metadata Architecture of Parallel File Systems
Nawab Ali, Ananth Devulapalli, Dennis Dalessandro, Pete Wyckoff, P. Sadayappan, Ohio State University
Paper | Slides
2:30pm -3:00pm
Short Announcements (sign up onsite)
3:00pm - 3:30pm
POSTER SESSION 2 - List of participants and links to posters
3:30pm - 4:30pm

PANEL: Rewarding the Public Release of Valuable Data and Resources
Panel Speakers:
Clem Cole, Intel Corp & USENIX Association
Garth Gibson, Carnegie Mellon University and Panasas Inc - slides
Gary Grider, Los Alamos National Laboratory
John May, Lawrence Livermore National Laboratory
Ethan L. Miller, UC Santa Cruz

4:30pm - 5:00pm
POSTER SESSION 3 - List of participants and links to posters


Poster Presentations:
There will also be a poster presentation opportunity at the workshop. Parties interested in presenting a related technical poster (posters with technical results for storage products are also encouraged) should submit a paragraph or two proposal describing a poster's contents through the above web site no later than November 14, 2008. Poster acceptance will be communicated on or before November 15, 2008.


This workshop seeks contributions on relevant topics, including but not limited to: performance and benchmarking results and tools, failure tolerance problems and solutions, APIs for high performance features, parallel file systems, high bandwidth storage architectures, wide area file systems, metadata intensive workloads, autonomics for HPC storage, virtualization for storage systems, archival storage advances, resource management innovations, etc.

Paper Submission Webpage: SUBMISSIONS NOW CLOSED
Paper (extended abstract in pdf format) due Fri Sept. 26, 2008, 11:59 p.m. EDT

Notification: Mon Oct. 20, 2008
Softcopy and slides due Nov. 16 BEFORE the workshop

Paper Submission Details:
The petascale data storage workshop holds a peer reviewed competitive process for selecting extended abstracts and short papers. Submit a not previously published extended abstract of 2 to 5 pages, not less than 10 point font, in a PDF file as instructed on the workshop web site. Submitted papers will be reviewed under the supervision of the workshop program committee. Submissions should indicate authors and affiliations. Selected papers and associated talks will be made available on the workshop web site. Selected final papers may be longer than in submission, but not longer than 10 pages. The workshop proceedings will be published in association with SC08 in the IEEE digital library and talk slides will be made available on the workshop web site.

Call for Papers flyer:

Garth A. Gibson, Carnegie Mellon University and Panasas Inc.
Darrell Long, University of California, Santa Cruz
J. Bruce Fields, University of Michigan, Ann Arbor, Center for Information Technology Integration
Gary A. Grider, Los Alamos National Laboratory
William T. C. Kramer, National Energy Research Scientific Computing Center, Lawrence Berkeley National Laboratory
Philip C. Roth, Oak Ridge National Laboratory
Evan J. Felix, Pacific Northwest National Laboratory
Lee Ward, Sandia National Laboratory
Rob Ross, Argonne National Laboratory
Karsten Schwan, Georgia Institute of Technology

Other Workshops & Panels of Interest at SC08

Exa and Yotta Scale Data - Are We Ready?
Panel Chair: Bill Kramer, NERSC
Friday, Nov 21, 2008
10:30AM - 12:00PM, Ballroom E
Austin Convention Center, Austin, Texas

pNFS Protocol after Final Draft and before RFC
Primary Session Leader: Sorin Faibish (EMC)
Wed, November 19, 2008
5:30PM - 7:00PM, Ballroom F
Austin Convention Center, Austin, Texas


Last updated 2010-12-02 | ©2011Carnegie Mellon University