Product SiteDocumentation Site

Chapter 2. Installation

2.1. Requirements
2.1.1. Frontend
2.1.2. Backend
2.2. Backend Installation
2.2.1. Download and Unpack
2.2.2. Create Account & Set Permissions
2.2.3. Configure
2.2.4. Start & Stop
2.2.5. Enable Test Jobs (Optional)
2.3. Frontend Installation
2.3.1. Download and Unpack
2.3.2. Configure
2.3.3. Set up Authentication
2.4. Securing Access
2.4.1. Password Authentication
2.4.2. SSL

2.1. Requirements

As explained in Section 1.2, “Architecture”, WebMapReduce is split into two components. We will discuss the requirements for each component separately.

2.1.1. Frontend

The WebMapReduce frontend requires the following:
  • A webserver (e.g., Apache 2) running PHP 5+, including the following PHP extensions:
    • cURL (with OpenSSL support)
    • LDAPor PAM (depending on which authentication option you choose)
    The first three extensions need to be compiled into the PHP core, but they may already be present on your installation. PAM comes as an installable module. You can find out what extensions are present on your PHP installation using phpinfo() as described here.
  • OpenSSL command-line suite
Apache 2 is the officially supported webserver for WebMapReduce; however, any PHP-capable server should work.

2.1.2. Backend

The WebMapReduce backend requires the following:
  • Hadoop 0.20.x
  • A POSIX-compatible OS (e.g., Linux, Mac OS X, BSD, Solaris) or support layer (e.g., Cygwin)
  • Java SE 6 or above
  • Recommended: OpenSSL command-line suite
  • Recommended: sudo or an equivalent system that allows unprivileged users to execute commands under other (unprivileged) accounts. (Some OSs have this support built in.)
Additional components may be required to support writing jobs in specific languages. For example, C++ requires GCC and GNU make, while Java requires the JDK. Interpreted languages (Python, Java, Scheme, etc.) require their interpreters to be installed on all Hadoop nodes.