Toward Efficient Block Replication Management in Distributed Storage

Liao, Jianwei; Sha, Zhibing; Cai, Zhigang; Liu, Zhiming; Li, Kenli; Liao, Wei-Keng; Choudhary, Alok N.; Ishiakwa, Yutaka

Published in

Association for Computing Machinery (ACM), ACM Transactions on Modeling and Performance Evaluation of Computing Systems, 3(5), p. 1-27, 2020

DOI: 10.1145/3412450

Tools

Export citation

Search in Google Scholar

Toward Efficient Block Replication Management in Distributed Storage

Journal article published in 2020 by Jianwei Liao, Zhibing Sha, Zhigang Cai, Zhiming Liu, Kenli Li, Wei-Keng Liao, Alok N. Choudhary, Yutaka Ishiakwa

This paper was not found in any repository, but could be made available legally by the author.

Full text: Unavailable

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: archiving forbidden

Policy details

Data provided by

Abstract

Distributed/parallel file systems commonly suffer from load imbalance and resource contention due to the bursty characteristic exhibited in scientific applications. This article presents an adaptive scheme supporting dynamic block data replication and an efficient replica placement policy to improve the I/O performance of a distributed file system. Our goal is not only to yield a balanced data replication among storage servers but also a high degree of data access parallelism for the applications. We first present mathematical cost models to formulate the cost of data block replication by considering both the overhead and reduced data access time to the replicated data. To verify the validity and feasibility of the proposed cost model, we implement our proposal in a prototype distributed file system and evaluate it using a set of representative database-relevant application benchmarks. Our results demonstrate that the proposed approach can boost the usage efficiency of the data replicas with acceptable overhead of data replication management. Consequently, the overall data throughput of storage system can be noticeably improved. In summary, the proposed replication management scheme works well, especially for the database-relevant applications that exhibit an uneven access frequency and pattern to different parts of files.

Published in

Links

Tools

Toward Efficient Block Replication Management in Distributed Storage

Abstract