1 Answers

What is the recommended disk configuration for slave nodes in your Hadoop cluster with 6 x 2 TB hard drives?

Asked by: 401 views , ,
CCA-470

What is the recommended disk configuration for slave nodes in your Hadoop cluster with 6 x 2 TB
hard drives?

A.
RAID 10

B.
JBOD

C.
RAID 5

D.
RAID 1+0

1 Answers



  1. isamin on Jul 24, 2013 Reply

    Answers: B
    JBOD

    Explanation:
    Note: Let me be clear here…there are absolutely times when using a Enterpriseclass storage device makes perfect sense.But for Hadoop it is very much unnecessary, and it is
    these three areas that I am going to hit as well as some others that I hope will demonstrate that
    Hadoop works best with inexpensive, internal storage in JBOD mode.Some of you might say “if
    you lose a disk in a JBOD configuration, you’re toast…you lose everything”. While this might be
    true, with Hadoop, it isn’t.Not only do you have the benefit that JBOD gives you in speed, you
    have the benefit that Hadoop Distributed File System (HDFS) negates this risk.HDFS basically
    creates three copies of the data.This is a very robust way to guard against data loss due to a disk
    failure or node outage, so you can eliminate the need for performance-reducing RAID.
    Reference:
    Hadoop and Storage Area Networks

    +1 Votes Thumb up 1 Votes Thumb down 0 Votes