- Run all commands with the sudo user
- Run
initial.sh
- Change the path where the github project is downloaded in the
docker-compose.yml
file to the one on your local machine - Access Portainer-Web at http://localhost:9000
- Create Stack on Portainer
docker-compose.yml
and start the stack: https://docs.portainer.io/v/ce-2.9/user/docker/stacks/add - Run
start-project.sh
- Run
run-wordcount.sh
to run Word Count job - Enjoy!
- Have Ansible installed
- Enter the ansible folder
- Set your hosts in hosts file in ansible folder
- Run
./install-ssh-keys.sh hosts
and enter the hosts file in front of the script to set the SSH key with your remote host - Run
ansible-playbook playbook.yml
- This cluster consists of a master node and two slaves by default
- You might have to change resource configs. Current config uses 4 cores and 4 Gb RAM
- Enter directory /data/big-data-storage
- Create a text file or any other
- Access HDFS http://localhost:9870
- Navigate to Browser the file system
- Open the /flume folder
- See if the file you just created on the host machine is found
- Enjoy!
If you want to see the web UI, you have to access the following address/port:
http://localhost:9870
HDFS Web UIhttp://localhost:8088
YARN Web UIhttp://localhost:19888
MapReduce JobHistory Web UIhttp://localhost:10002
HiveServer2 Web UIhttp://localhost:9000
Portainer Web UI