1. Introduction
  2. Prerequistes
  3. Set Environment Variables
  4. Setup SSH daemon
  5. Download hadoop and place it in the home directory.
  6. Unpack hadoop
  7. Configure Hadoop
  8. Format the namenode
  9. Setup hadoop plugin
  10. Start the cluster
  11. Setup hadoop location
  12. Upload data
  13. Create and run a test project.
Bookmark and Share

Configure Hadoop

Now that we have unpacked Hadoop we are ready to configure it.

  1. Open a new cygwin window and execute the following commands
    cd hadoop-0.19.1
    cd conf
    explorer .


    Bringing up explorer window


  2. As a result of the last command you will see the explorer window for the 'conf' directory popped up. Minimize it for now or move it to the side.
  3. Launch eclipse
  4. Bring up the the 'conf' explorer window opened in the step 2 and drug the file hadoop-site to the eclipse main window.
  5. Insert the following lines between <configuration> and </configuration> tags.
    <property>
    <name>fs.default.name</name>
    <value>hdfs://localhost:9100</value>
    </property>
    <property>
    <name>mapred.job.tracker</name>
    <value>localhost:9101</value>
    </property>
    <property>
    <name>dfs.replication</name>
    <value>1</value>
    </property>


    Editing site configuration file


  6. Close the eclipse, cygwin command window and the explorer window

Continue

Bookmark and Share