Published in

2010 International Conference on Complex, Intelligent and Software Intensive Systems

DOI: 10.1109/cisis.2010.45

Links

Tools

Export citation

Search in Google Scholar

Benchmarking a MapReduce Environment on a Full Virtualisation Platform

Proceedings article published in 2010 by Maryam Kontagora, Horacio González-Vélez ORCID
This paper is available in a repository.
This paper is available in a repository.

Full text: Download

Green circle
Preprint: archiving allowed
Green circle
Postprint: archiving allowed
Red circle
Published version: archiving forbidden
Data provided by SHERPA/RoMEO

Abstract

This work analyses the performance of Hadoop, an implementation of the MapReduce programming model for distributed parallel computing, executing on a virtualisation environment comprised of 1 + 16 nodes running the VMWare workstation software. A set of experiments using the standard Hadoop benchmarks has been designed in order to determine whether or not significant reductions in the execution time of computations are experienced using Hadoop on this virtualisation platform on a local area network. Our findings indicate that a significant decrease in computing times is observed under these conditions. They also highlight how overheads and virtualisation in a distributed environment hinder the possibility of achieving the maximum (peak) performance.