Member-only story
Writing Your First Map Reduce Program In Hadoop Using Python
Before beginning, I want to convey to all readers (either technical or non-technical) that while learning any advanced technologies like big data and Hadoop, first we need to have confidence that yes we all can learn any new language or concept! with this motivation, let us move ahead.
Hadoop 1.x and Hadoop 2.x have two things in common which are MapReduce and HDFS, although there are some other internal differences, but we’ll ignore them and focus on MapReduce.
MapReduce in Hadoop 2.x is responsible for Data Processing.
To begin with, please note that first check if the virtualization is enabled on your system or not, you can check this in windows by simply going in task manager and if the virtualization is disabled you can enable it following the steps:
- Go to BIOS by pressing the below key as per your laptop model during laptop boot
2. Enable Virtualization Technology as below.
Once virtualization is enabled, follow the below steps and download the requirements first:
- You’ll need to install Oracle Virtualbox. As per your…