6th Parallel Data Storage Workshop

held in conjunction with
Supercomputing '11

Chair: John Bent, EMC

Sunday, November 13, 2011
9:00 am - 5:30 pm
Grand Hyatt Princessa II, Seattle, WA

SC11 Workshop Web Page

PDSW Proceedings available online at the ACM Digital Library

Abstract | Agenda | Camera-Ready Instructions | Poster Instructions
Call for Papers | Call for Posters | Poster Session
Attending the Workshop | Workshop Committees

Previous Petascale Data Storage Workshops


Computational scientists are no longer satisfied with petascale infrastructures. Their demands for finer and finer spatial and temporal resolutions are driving parallel storage systems to larger and larger scales of parallelism and concurrency. This scale creates new problems and exacerbates old ones in areas such as storage capacity, performance, concurrency, data retrieval, reliability, availability, and manageability. Additionally, new technologies such as cloud storage are encouraging scientists to preserve more old data and to expand their analyses to include data from a wider range of previous computations. Paying special attention to issues in which community collaboration can be crucial such as problem identification, workload capture, solution interoperability, standards with community buy-in, and shared tools, this one-day workshop seeks contributions in the form of papers and posters on relevant topics, including but not limited to:

  • performance and benchmarking results and tools,
  • failure tolerance,
  • APIs and protocols for high performance features,
  • parallel file systems,
  • high bandwidth storage architectures,
  • wide area file systems,
  • metadata intensive workloads,
  • information extraction,
  • autonomics for HPC storage,
  • checkpoint/restart,
  • virtualization for storage systems,
  • archival storage advances, and
  • resource management innovations.


8:55am - 9:00am
Welcome - John Bent, EMC
9:00am - 9:45am
Keynote Speaker - Brent Welch (Panasas)
Lessons and Predictions from 10 Years of Parallel Data Storage System Development
Speaker Bio | Slides
9:45am - 10:15am
POSTER SESSION 1 - List of participants and links to posters
10:15am - 11:45am
Chair: Ethan Miller, UCSC

Robust Benchmarking for Archival Storage Tiers
DongJin Lee (The University of Auckland), Michael O'Sullivan (The University of Auckland), Cameron Walker (The University of Auckland), Monique MacKenzie (The University of St Andrews)
Speaker: DongJin Lee
Paper | Slides

Extending Scalability of Collective IO Through Nessie and Stagin
Jay Lofstead (Sandia National Laboratories), Ron Oldfield (Sandia National Laboratories), Todd Kordenbrock (Hewlett-Packard Company), Charles Reiss (University of California, Berkeley)
Speaker: Jay Lofstead
Paper | Slides

Parallel I/O and the Metadata Wall
Sadaf R Alam (Swiss National Supercomputing Centre), Hussein N El-Harake (Swiss National Supercomputing Centre), Kristopher Howard (Swiss National Supercomputing Centre), Neil Stringfellow (Swiss National Supercomputing Centre), Fabio Verzelloni (Swiss National Supercomputing Centre)
Speaker: Neil Stringfellow
Paper | Slides

11:45pm - 1:15pm
1:15pm - 2:45pm
Chair: Ron Oldfield, Sandia

Horus: Fine-Grained Encryption-Based Security for High Performance Petascale Storage
Ranjana Rajendran (Storage Systems Research Centre, University of California, Santa Cruz), Ethan Miller (Storage Systems Research Centre, University of California, Santa Cruz), Darrell Long (Storage Systems Research Centre, University of California, Santa Cruz)
Speaker: Ethan Miller
Paper | Slides

Easing the Burdens of HPC File Management
Stephanie N. Jones (University of California, Santa Cruz), Christina R. Strong (University of California, Santa Cruz), Aleatha Parker-Wood (University of California, Santa Cruz), Alexandra Holloway (University of California, Santa Cruz), Darrell D. E. Long (University of California, Santa Cruz)
Speaker: Stephanie Jones
Paper | Slides

The Purge Threat: Scientists' Thoughts on Usability in the Peta-Scale
Alexandra Holloway (University of California, Santa Cruz)
Speaker: Alexandra Holloway
Paper | Slides

2:45pm - 3:15pm
POSTER SESSION 2 - List of participants and links to posters
3:15pm - 4:45pm
Chair: Adam Manzanares, LANL

In-Situ I/O Processing: A Case for Location Flexibility
Fang Zheng (Georgia Tech), Hasan Abbasi (Georgia Tech), Jiangting Cao (Georgia Tech), Jai Dayal (Georgia Tech), Karsten Schwan (Georgia Tech), Matthew Wolf (Georgia Tech), Scott Klasky (Oak Ridge National Laboratory), Norbert Podhorszki (Oak Ridge National Laboratory)
Speaker: Fang Zheng
Paper | Slides

Pattern-Aware File Reorganization in MPI-IO
Jun He (Illinois Institute of Technology), Huaiming Song (Illinois Institute of Technology), Xian-He Sun (Illinois Institute of Technology), Yanlon g Yin (Illinois Institute of Technology), Rajeev Thakur (Argonne National Laboratory)
Speaker: Jun He
Paper | Slides

Power Use of Disk Subsystems in Supercomputers
Matthew L. Curry (Sandia National Laboratories), H. Lee Ward (Sandia National Laboratories), Gary Grider (Los Alamos National Laboratory), Jill Gemmill (Clemson University), Jay Harris (Clemson University), David Martinez (Sandia National Laboratories)
Speaker: Matthew Curry
Paper | Slides

4:45pm - 5:15pm
Short Announcements

NSF PRObE: A community facility for systems testing at scale - Garth Gibson

5:15pm - 5:45pm
POSTER SESSION 3 - List of participants and links to posters


Paper Submission Website: http://www2.pdl.cmu.edu/conferences/pdsw11/index
Due: Friday, September 16, 2011, 11:59 p.m. PDT
Notification: Tuesday, October 11, 2011
Camera-ready due: Sunday, November 6, 2011
Slides due: Friday, Nov. 11, 2011

CALL FOR PAPERS POSTER - download and hang one up at your office / department!

The parallel data storage workshop holds a peer reviewed competitive process for selecting extended abstracts and short papers. Submit a not previously published extended abstract of up to UPDATED - 6 PAGES, not less than 10 point font, in a PDF file as instructed on the workshop web site. Submitted papers will be reviewed under the supervision of the workshop program committee. Submissions should indicate authors and affiliations. Final papers must not be longer than 5 pages. Selected papers and associated talk slides will be made available on the workshop web site; the papers will also be published in the digital library of the IEEE or ACM.



Poster Submission Website: http://www2.pdl.cmu.edu/conferences/pdsw11poster/index
Due: Monday, November 7, 2011
Wednesday, November 9, 2011

The PDSW program committee highly encourages authors of accepted papers to present posters of their work - accepted papers ALWAYS be accepted for a poster.

Additional submissions for technical poster presentation will be considered if they are marked as such and include title and author list and a short abstract. Further specifications for poster production will be available on the workshop web site.



Please be aware that all attendees to the workshop, both speakers and participants, will have to pay an SC11 registration fee. However, there will be available a Workshop-only registration fee at a much reduced rate relative to the weeklong conference registration fee ($150). If you do register for the technical program, workshop admittance is included in that fee.

STUDENT VOLUNTEERS TO SC11: Student volunteers that have free SC11 registration also have free PDSW registration.

To attend the workshop, please register through the Supercomputing '11 registration web page.


John Bent, Los Alamos National Laboratory (PC Chair)
Randal Burns, Johns Hopkins University
Andreas Dilger, Whamcloud, Inc.
Yong Chen, Texas Tech University
Haryadi Gunawi, University of California, Berkeley
Adam Manzanares, Los Alamos National Laboratory
Dutch Meyer, University of British Columbia
Ethan Miller, University of California, Santa Cruz
Ron Oldfield, Sandia National Laboratory
Vijayan Prabhakaran, Microsoft Research
Karsten Schwan, Georgia Tech
Brad Settlemyer, Oak Ridge National Laboratory
Raju Rangaswami, Florida International University
Douglas Thain, University of Notre Dame
Rob Ross, Argonne National Laboratory


Scott Brandt, University of California, Santa Cruz
Evan J. Felix, Pacific Northwest National Laboratory
Garth A. Gibson, Carnegie Mellon University and Panasas Inc.
Gary Grider, Los Alamos National Laboratory
Peter Honeyman, University of Michigan, Ann Arbor, Center for Information Technology Integration
Bill Kramer, National Center for Supercomputing Applications/University of Illinois Urbana-Champaign
Darrell Long, University of California, Santa Cruz
Carlos Maltzahn, University of California, Santa Cruz
Philip C. Roth, Oak Ridge National Laboratory
John Shalf, National Energy Research Scientific Computing Center, Lawrence Berkeley National Laboratory
Lee Ward, Sandia National Laboratories

Last updated 2012-03-12 | ©2011Carnegie Mellon University