Product SiteDocumentation Site

Chapter 1. Introduction

1.1. About our AMIs
Amazon's Elastic Compute Cloud (EC2) can be a cost-effective way to try out cluster computing before investing time and money into equipment and maintenance. In order to make it even easier to get started on Hadoop and parallel computing, we have created Amazon Machine Images (AMIs) which, following the process below, can be used to bring up and start testing a fully-operational Hadoop cluster in a matter of minutes.

1.1. About our AMIs

Amazon Machine Images are prebuilt virtual machine images specially designed for EC2. They are stored on Amazon's servers, and can be created, customized, and shared by any EC2 user. Users with access can use them to launch their own instances, running virtual machines over which they have complete control.
As of this writing, we have built the following AMI for creating Hadoop clusters with WebMapReduce (see http://webmapreduce.sf.net/ec2.php for the latest list):
AMI ID ami-40926129
OS/Architecture Ubuntu 10.04 Lucid Server (32-bit)
Preloaded Software
  • Cloudera Distribution for Hadoop 2, based on Apache Hadoop 0.20.1
  • Apache HTTPD 2 server with PHP 5
  • WebMapReduce frontend & backend
  • OpenSSL with working certificate authority for signing WMR certificates
Table 1.1. WebMapReduce AMIs

Our AMIs come with the following features:
  • Fully configurable through Amazon's web-based AWS Management Console, requiring only a browser for setup and an SSH client for logging in.
  • Persistent storage with Amazon's Elastic Block Store (EBS) allows you to shut down cluster machines without losing their data.
  • Easy to add cluster nodes with increasing demand.