Bine ați venit la Scribd!

Tutorial MapReduce

Încărcat de

0% au considerat acest document util (0 voturi)

30 vizualizări13 pagini

The document describes running a MapReduce word count job on Hadoop. It involves copying text files from the local file system to HDFS, running the MapReduce job which counts word occurrences, and checking the output stored in HDFS. It also describes the web interfaces for viewing job tracking, task tracking and HDFS information.

Descriere originală:

Map Reduce

Drepturi de autor

Formate disponibile

PDF, TXT sau citiți online pe Scribd

Partajați acest document

Partajați sau inserați document

Opțiuni de partajare

Vi se pare util acest document?

Este necorespunzător acest conținut?

Raportați acest document

Drepturi de autor:

Formate disponibile

Descărcați ca PDF, TXT sau citiți online pe Scribd

Indicator pentru conținut neadecvat

0% au considerat acest document util (0 voturi)

30 vizualizări13 pagini

Tutorial MapReduce

Încărcat de

pavan2711

Drepturi de autor:

Formate disponibile

Descărcați ca PDF, TXT sau citiți online pe Scribd

Indicator pentru conținut neadecvat

Salt la pagina

Sunteți pe pagina 1din 13

Căutați în document

1

Running a MapReduce job

We will now run your first Hadoop MapReduce job. We will use the WordCount example job which reads text files and
counts how often words occur.
The input is text files and the output is text files, each line of which contains a word and the count of how often it
occurred, separated by a tab.

copy input data

$ls -l /mnt/hgfs/Hadoopsw
total 3604
-rw-r--r-- 1 hduser hadoop

674566 Feb

3 10:17 pg20417.txt

-rw-r--r-- 1 hduser hadoop 1573112 Feb

3 10:18 pg4300.txt

-rw-r--r-- 1 hduser hadoop 1423801 Feb

3 10:18 pg5000.txt

Restart the Hadoop cluster

Restart your Hadoop cluster if its not running already.
# bin/start-all.sh

www.hpottech.com

Running a MapReduce job

Copy local example data to HDFS
Before we run the actual MapReduce job, we first have to copy the files from our local file system to HadoopsHDFS.
#bin/hadoop fs mkdir /user/root
#bin/hadoop fs mkdir /user/root/in
#bin/hadoop dfs -copyFromLocal /mnt/hgfs/Hadoopsw/*.txt /user/root/in

Run the MapReduce job

Now, we actually run the WordCount example job.
#cd $HADOOP_HOME
#bin/hadoop jar hadoop-examples-1.0.0.jar wordcount /user/root/in /user/root/out

This command will read all the files in the HDFS directory /user/root/in, process it, and store the result in the
HDFS directory /user/root/out.

www.hpottech.com

Running a MapReduce job

www.hpottech.com

Running a MapReduce job

www.hpottech.com

Running a MapReduce job

Check if the result is successfully stored in HDFS directory /user/root/out/:

#bin/hadoop dfs -ls /user/root

www.hpottech.com

Running a MapReduce job

$ bin/hadoop dfs -ls /user/root/out

www.hpottech.com

Running a MapReduce job

Retrieve the job result from HDFS
To inspect the file, you can copy it from HDFS to the local file system. Alternatively, you can use the command
# bin/hadoop dfs -cat /user/root/out/part-r-00000

www.hpottech.com

Running a MapReduce job

Copy the output to local file.
$ mkdir /tmp/hadoop-output
# bin/hadoop dfs -getmerge /user/root/out/ /tmp/hadoop-output

www.hpottech.com

Running a MapReduce job

Hadoop Web Interfaces
Hadoop comes with several web interfaces which are by default (see conf/hadoop-default.xml) available at
these locations:

http://localhost:50030/ web UI for MapReduce job tracker(s)

http://localhost:50060/ web UI for task tracker(s)

http://localhost:50070/ web UI for HDFS name node(s)

These web interfaces provide concise information about whats happening in your Hadoop cluster. You might want to give
them a try.

MapReduce Job Tracker Web Interface

The job tracker web UI provides information about general job statistics of the Hadoop cluster, running/completed/failed
jobs and a job history log file. It also gives access to the local machines Hadoop log files (the machine on which the web
UI is running on).
By default, its available at http://localhost:50030/.

www.hpottech.com

Running a MapReduce job

A screenshot of Hadoop's Job Tracker web interface.

www.hpottech.com

Running a MapReduce job

Task Tracker Web Interface
The task tracker web UI shows you running and non-running
non running tasks. It also gives access to the local machines Hadoop
log files.
By default, its available at http://localhost:50060/.
http://localhost:50060/

A screenshot of Hadoop's Task Tracker web interface.

www.hpottech.com

Running a MapReduce job

HDFS Name Node Web Interface
The name node web UI shows you a cluster summary including information about total/remaining capacity, live and dead
nodes. Additionally, it allows you to browse the HDFS namespace and view the contents of its files in the web browser. It
also gives access to the local machines Hadoop log files.
By default, its available at http://localhost:50070/.

www.hpottech.com

Running a MapReduce job

A screenshot of Hadoop's Name Node web interface.

www.hpottech.com

S-ar putea să vă placă și

Part 03 Intro To Hadoop
Document22 pagini
Part 03 Intro To Hadoop
Sahera Shabnam
Încă nu există evaluări
Bda Lab
Document37 pagini
Bda Lab
Dhanush Kumar
Încă nu există evaluări
Tutorial HDFSMiscelle
Document6 pagini
Tutorial HDFSMiscelle
pavan2711
Încă nu există evaluări
L Hadoop 1 PDF
Document12 pagini
L Hadoop 1 PDF
Dao Van Hang
Încă nu există evaluări
Big Data Analytics: Essential Hadoop Tools
Document41 pagini
Big Data Analytics: Essential Hadoop Tools
VISHNU
Încă nu există evaluări
Hadoop
Document27 pagini
Hadoop
Narasimha Reddy
Încă nu există evaluări
Hadoop Single Node Cluster Setup Steps
Document7 pagini
Hadoop Single Node Cluster Setup Steps
Shushrutha Reddy K
Încă nu există evaluări
BDA Unit-4
Document38 pagini
BDA Unit-4
Aishwarya Rayasam
Încă nu există evaluări
Hadoop Presentaton
Document47 pagini
Hadoop Presentaton
Jhumri Talaiya
Încă nu există evaluări
NYOUG Hadoop Presentaton
Document47 pagini
NYOUG Hadoop Presentaton
V Kalyan
Încă nu există evaluări
Extracting Real Value From Your Data With Apache Hadoop: Sarah Sproehnle
Document51 pagini
Extracting Real Value From Your Data With Apache Hadoop: Sarah Sproehnle
prabhakar_n1
Încă nu există evaluări
Bda A2
Document17 pagini
Bda A2
Deepti Agrawal
Încă nu există evaluări
PDC All Labs
Document129 pagini
PDC All Labs
Sai Kiran
100% (1)
Prepared By: Manoj Kumar Joshi & Vikas Sawhney
Document47 pagini
Prepared By: Manoj Kumar Joshi & Vikas Sawhney
kavitha
Încă nu există evaluări
Bigdata Lab
Document55 pagini
Bigdata Lab
Radheshyam Shah
Încă nu există evaluări
Big Data Lab Manual and Syllabus
Document71 pagini
Big Data Lab Manual and Syllabus
startechbyjus123
Încă nu există evaluări
Experiment No 1
Document13 pagini
Experiment No 1
Aman Jain
Încă nu există evaluări
Big Data File
Document16 pagini
Big Data File
Arnav Shrivastava
Încă nu există evaluări
BigData Module 2
Document41 pagini
BigData Module 2
R SANJAY CS
Încă nu există evaluări
Big Data & Analytics Lab Manual
Document51 pagini
Big Data & Analytics Lab Manual
Sathish
Încă nu există evaluări
Hadoop Shell Commands
Document63 pagini
Hadoop Shell Commands
srikant4u4670
100% (1)
Hadoop Introduction PDF
Document3 pagini
Hadoop Introduction PDF
Tahseef Reza
Încă nu există evaluări
BDA Experiment 14 PDF
Document77 pagini
BDA Experiment 14 PDF
Nikita Ichale
Încă nu există evaluări
Mapreduce, Hadoop and Amazon Aws: Yasser Ganjisaffar
Document33 pagini
Mapreduce, Hadoop and Amazon Aws: Yasser Ganjisaffar
Abhi Gupta
Încă nu există evaluări
Hadoop Ecosystem
Document16 pagini
Hadoop Ecosystem
poojan thakkar
Încă nu există evaluări
Hadoop Ecosystem
Document55 pagini
Hadoop Ecosystem
nehal
Încă nu există evaluări
Map Reduce
Document38 pagini
Map Reduce
Fikret Toydemir
Încă nu există evaluări
Setup Hadoop Gettingstart
Document4 pagini
Setup Hadoop Gettingstart
Nawaal Ali
Încă nu există evaluări
Hadoop Install
Document19 pagini
Hadoop Install
Lâm Lương
Încă nu există evaluări
BDA LAB Programs
Document56 pagini
BDA LAB Programs
raghu rama teja vegesna
Încă nu există evaluări
BD - Unit - III - MapReduce
Document31 pagini
BD - Unit - III - MapReduce
Prem Kumar
Încă nu există evaluări
Hadoop Ecosystem PDF
Document55 pagini
Hadoop Ecosystem PDF
Rishabh Gupta
Încă nu există evaluări
Hadoop Ecosystem PDF
Document55 pagini
Hadoop Ecosystem PDF
Rishabh Gupta
Încă nu există evaluări
Setup 8
Document16 pagini
Setup 8
Nguyễn Hữu Quân
Încă nu există evaluări
Compusoft, 3 (10), 1136-1139 PDF
Document4 pagini
Compusoft, 3 (10), 1136-1139 PDF
Ijact Editor
Încă nu există evaluări
12 13 14 Map Reduce
Document57 pagini
12 13 14 Map Reduce
Manjula Annamalai
Încă nu există evaluări
Hands On Exercises 2013
Document51 pagini
Hands On Exercises 2013
Manish Jain
Încă nu există evaluări
Bda A1
Document15 pagini
Bda A1
Deepti Agrawal
Încă nu există evaluări
Commands in Hadoop
Document7 pagini
Commands in Hadoop
shweta shedshale
Încă nu există evaluări
BDA Lab Manual-1
Document60 pagini
BDA Lab Manual-1
pavan chittala
Încă nu există evaluări
Hadoop
Document13 pagini
Hadoop
Mastan
Încă nu există evaluări
Running Ha Do Op Michel Noll
Document23 pagini
Running Ha Do Op Michel Noll
Mahashwetha Rao
Încă nu există evaluări
Big Data Analytics: Dmitry Anoshin
Document11 pagini
Big Data Analytics: Dmitry Anoshin
RAMON CESPEDES PAZ
Încă nu există evaluări
DAN Lab ManuaL
Document53 pagini
DAN Lab ManuaL
SARANYA A
Încă nu există evaluări
Hadoop Installatio1
Document22 pagini
Hadoop Installatio1
paramreddy2000
Încă nu există evaluări
Hadoop
Document5 pagini
Hadoop
Vaishnavi Chockalingam
Încă nu există evaluări
BigData Module 2
Document18 pagini
BigData Module 2
Sushmith Shettigar
Încă nu există evaluări
Big Data Analytics Assignment
Document7 pagini
Big Data Analytics Assignment
Devananth A B
Încă nu există evaluări
Hadoop File Complte
Document18 pagini
Hadoop File Complte
rashant
Încă nu există evaluări
Apache Hive
Document77 pagini
Apache Hive
Ashok Kumar K R
Încă nu există evaluări
Bda Lab Manual
Document20 pagini
Bda Lab Manual
RAKSHIT AYACHIT
Încă nu există evaluări
2 Hadoop
Document20 pagini
2 Hadoop
YASH PRAJAPATI
Încă nu există evaluări
Module-1: Hdfs Basics Running Example Programs and Benchmarks Hadoop Mapreduce Framework Mapreduce Programming
Document33 pagini
Module-1: Hdfs Basics Running Example Programs and Benchmarks Hadoop Mapreduce Framework Mapreduce Programming
bhattsb
Încă nu există evaluări
04 Hadoop Setup 05 CLI 06 Running MapRed
Document30 pagini
04 Hadoop Setup 05 CLI 06 Running MapRed
Manjula Annamalai
Încă nu există evaluări
Bab 3 Hadoop Installation
Document12 pagini
Bab 3 Hadoop Installation
CHRISTOPHER EVAN HARJONO
Încă nu există evaluări
Apache Hadoop: Developer(s) Stable Release Preview Release
Document5 pagini
Apache Hadoop: Developer(s) Stable Release Preview Release
nitesh_mps
Încă nu există evaluări
Bda Manual
Document80 pagini
Bda Manual
bhuvans80_m
Încă nu există evaluări
Exploring Hadoop Ecosystem (Volume 1): Batch Processing
De la Everand
Exploring Hadoop Ecosystem (Volume 1): Batch Processing
Wei Liu
Încă nu există evaluări
Professional Hadoop Solutions
De la Everand
Professional Hadoop Solutions
Boris Lublinsky
Evaluare: 4 din 5 stele
4/5 (2)
Quick Configuration of Openldap and Kerberos In Linux and Authenicating Linux to Active Directory
De la Everand
Quick Configuration of Openldap and Kerberos In Linux and Authenicating Linux to Active Directory
Dr. Hidaia Mahmood Alassouli
Încă nu există evaluări
Tutorial MonitorHadoopWithGanglia
Document5 pagini
Tutorial MonitorHadoopWithGanglia
pavan2711
Încă nu există evaluări
Tutorial Partitioner
Document8 pagini
Tutorial Partitioner
pavan2711
Încă nu există evaluări
Tutorial InstallationHadoopSingleNodePrerequisite
Document4 pagini
Tutorial InstallationHadoopSingleNodePrerequisite
pavan2711
Încă nu există evaluări
DVO Training Class Example Report
Document4 pagini
DVO Training Class Example Report
pavan2711
Încă nu există evaluări
Fact and Dimension Tables
Document11 pagini
Fact and Dimension Tables
pavan2711
Încă nu există evaluări
USACO
Document94 pagini
USACO
LyokoPotter
Încă nu există evaluări
Research Question
Document6 pagini
Research Question
Alfian Ardhiyana Putra
Încă nu există evaluări
Curriculum Vitae: Name: Father S Name: Cell No: Email: Date of Birth: Nationality: Cnic No: Passport No: Address
Document2 pagini
Curriculum Vitae: Name: Father S Name: Cell No: Email: Date of Birth: Nationality: Cnic No: Passport No: Address
Jahanzeb Joji
Încă nu există evaluări
Tips Experiments With Matlab
Document190 pagini
Tips Experiments With Matlab
Vishal
Încă nu există evaluări
My Home, My School: Meet and Greet
Document2 pagini
My Home, My School: Meet and Greet
Acebuque Arby Immanuel Gopeteo
Încă nu există evaluări
Shape The Future Listening Practice - Unit 3 - Without Answers
Document1 pagină
Shape The Future Listening Practice - Unit 3 - Without Answers
leireleire20070701
Încă nu există evaluări
Charity Connecting System: M.Archana, K.Mouthami
Document5 pagini
Charity Connecting System: M.Archana, K.Mouthami
Shainara Dela Torre
Încă nu există evaluări
Tcps 2 Final Web
Document218 pagini
Tcps 2 Final Web
BornaGhannadi
Încă nu există evaluări
Brandi Jones-5e-Lesson-Plan 2
Document3 pagini
Brandi Jones-5e-Lesson-Plan 2
api-491136095
Încă nu există evaluări
Assignment Diffraction 2016
$Assignment Diffraction 2016$
Document3 pagini
Assignment Diffraction 2016
Ritesh Meel
Încă nu există evaluări
SugaNate 160NC
Document5 pagini
SugaNate 160NC
mndmatt
Încă nu există evaluări
AFDD 1-1 (2006) - Leadership and Force Development
Document82 pagini
AFDD 1-1 (2006) - Leadership and Force Development
ncore_scribd
100% (1)
Practice Makes Perfect English Conversation UNIT 9
Document10 pagini
Practice Makes Perfect English Conversation UNIT 9
Thaís C C Miranda
Încă nu există evaluări
Henderson 1
Document4 pagini
Henderson 1
daniel
Încă nu există evaluări
бббббббббббб
Document7 pagini
бббббббббббб
hug1515
Încă nu există evaluări
01CDT00330 - Lakeshore - 330 - Manual
Document104 pagini
01CDT00330 - Lakeshore - 330 - Manual
SPMS_MELEC
Încă nu există evaluări
Wifi Hack
Document20 pagini
Wifi Hack
VAJIR ABDUR
Încă nu există evaluări
For Purposeful and Meaningful Technology Integration in The Classroom
Document19 pagini
For Purposeful and Meaningful Technology Integration in The Classroom
Vincent Chris Ellis Tumale
Încă nu există evaluări
Do You Have Esp Final
Document20 pagini
Do You Have Esp Final
Vrasidas Poulopoulos
50% (2)
Saint Albert Polytechnic College, Inc. Bachelor of Science in Office Administration
Document15 pagini
Saint Albert Polytechnic College, Inc. Bachelor of Science in Office Administration
Austin Grey Pineda Macote
Încă nu există evaluări
Joshi C M, Vyas Y - Extensions of Certain Classical Integrals of Erdélyi For Gauss Hypergeometric Functions - J. Comput. and Appl. Mat. 160 (2003) 125-138
Document14 pagini
Joshi C M, Vyas Y - Extensions of Certain Classical Integrals of Erdélyi For Gauss Hypergeometric Functions - J. Comput. and Appl. Mat. 160 (2003) 125-138
Renee Bravo
Încă nu există evaluări
009 Installation of Pumps Risk Assessment
Document2 pagini
009 Installation of Pumps Risk Assessment
gangadharan000
100% (13)
Federal Register / Vol. 72, No. 108 / Wednesday, June 6, 2007 / Proposed Rules
Document1 pagină
Federal Register / Vol. 72, No. 108 / Wednesday, June 6, 2007 / Proposed Rules
Justia.com
Încă nu există evaluări
Unliquidated Cash Advances 3rd Quarter 2020
Document10 pagini
Unliquidated Cash Advances 3rd Quarter 2020
kQy267BdTK
Încă nu există evaluări
Fuzzy Ideals and Fuzzy Quasi-Ideals in Ternary Semirings: J. Kavikumar and Azme Bin Khamis
Document5 pagini
Fuzzy Ideals and Fuzzy Quasi-Ideals in Ternary Semirings: J. Kavikumar and Azme Bin Khamis
msmramansrimathi
Încă nu există evaluări
Normality in The Residual - Hossain Academy Note
Document2 pagini
Normality in The Residual - Hossain Academy Note
Abdullah Khatib
Încă nu există evaluări
Barry Farm Powerpoint Slides
Document33 pagini
Barry Farm Powerpoint Slides
sarah
Încă nu există evaluări
Spinning Calculation
Document178 pagini
Spinning Calculation
amboklate
69% (16)
Lec1 PDF
Document16 pagini
Lec1 PDF
Jaswant Singh
Încă nu există evaluări
Job Stress, Workload, Environment and Employee Turnover Intentions
Document13 pagini
Job Stress, Workload, Environment and Employee Turnover Intentions
Yuni Sasie
Încă nu există evaluări