High-performance collaborative distributed storage (HP-CDS)

HP-CDS (high-performance collaborative distributed storage) is an experimental storage prototype specifically designed to deliver high throughput with low resource utilization at scale for data-intensive distributed applications that exhibit non-trivial I/O patterns or irregularity due to multi-tenancy.

It is centered around the idea of organizing the storage elements in a decentralized peer-to-peer network that constantly exchanges information about locally observed content and I/O access patterns in order to discover global trends that can be exploited by collaboration, such as: dynamic prefetch of data blocks from peers with similar access pattern, on-the-fly de-duplication and dissemination of hot data, automated system-level storage elasticity.

Contact

Bogdan Nicolae

HP-CDS use case

On-demand collaborative propagation of virtual disk content

Publications

Bogdan Nicolae, Alexei Karve, Andrzej Kochut
"Discovering and Leveraging Content Similarity to Optimize Collective On-Demand Data Access to IaaS Cloud Storage"
CCGrid'15: 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, 2015.

Andrzej Kochut, Alexei Karve, Bogdan Nicolae
"Towards Efficient On-demand VM Provisioning: Study of VM Runtime I/O Access Patterns to Shared Image Content"
IM'15: 13th IFIP/IEEE International Symposium on Integrated Network Management, 2015.

Bogdan Nicolae, Pierre Riteau, Kate Keahey
"Transparent Throughput Elasticity for IaaS Cloud Storage Using Guest-Side Block-Level Caching"
UCC'14: 7th IEEE/ACM International Conference on Utility and Cloud Computing, 2014.

Bogdan Nicolae, Pierre Riteau, Kate Keahey
"Bursting the Cloud Data Bubble: Towards Transparent Storage Elasticity in IaaS Clouds"
IPDPS '14: Proc. 28th IEEE International Parallel and Distributed Processing Symposium, pp. 135-144, 2014.