Step 5: Set Up a CDH Cluster This section describes how to set up a CDH cluster in an unmanaged environment, using the command line. Set up the components you are using in the order shown. Skip any components you are not using. ZooKeeper HDFS MapReduce v2 with YARN MapReduce v1 Crunch Installation Flume HBase Hive HCatalog Impala HttpFS Hue Kafka KMS Kudu Mahout Oozie Pig Search Sentry Spark Sqoop Sqoop 2 Whirr Step 4: Install CDH Packages ZooKeeper