Experiment and Discussion: ZooKeeper and Hadoop
The instructor provides a short and incomplete tutorial about setting up Hadoop cluster.
Complete the following:
- Setting up a Hadoop cluster capable of running MapReduce workload if not already.
- Adding the ZooKeeper service to the cluster (See References)
Prepare slides, discussing:
- Motivation. (e.g., Why do we need ZooKeeper?)
- Problem. (e.g., What problem does ZooKeeper solve? )
- Design. (e.g., How does ZooKeeper help? Is there any trade-offs? What are the trade-offs)
- Experiment. (e.g., completing setup, running experiments observing in what ways ZooKeeper helps)
Be prepared to present selected slides.
Reference:
- “HDFS High Availability”, https://hadoop.apache.org/docs/stable/hadoop-project-dist/hadoop-hdfs/HDFSHighAvailabilityWithNFS.html, retrieved November 2025.
- “Apache ZooKeeper Project”, https://zookeeper.apache.org/, retrieved November 2025.