您当前的位置: 首页 >  ar

段智华

暂无认证

  • 0浏览

    0关注

    1232博文

    0收益

  • 0浏览

    0点赞

    0打赏

    0留言

私信
关注
热门博文

Hadoop/spark安装实战(系列篇4) Hadoop MapReduce词频统计之小试牛刀

段智华 发布时间:2015-09-12 21:52:51 ,浏览量:0

 Hadoop/spark安装实战(系列篇4) Hadoop MapReduce词频统计之小试牛刀

运行hadoop 自带的例子的MapReduce 计算

1 上传文件到hadoop的hdfs的根目录 [root@localhost hadoop-1.2.1]# hadoop fs -put README.txt / 检验 [root@localhost hadoop-1.2.1]# hadoop fs -ls / Found 2 items -rw-r--r--   1 root supergroup       1366 2015-09-12 06:45 /README.txt drwxr-xr-x   - root supergroup          0 2015-09-12 06:34 /home [root@localhost hadoop-1.2.1]#

2 运行MapReduce 分词统计

[root@localhost hadoop-1.2.1]# hadoop jar  hadoop-examples-1.2.1.jar  wordcount /README.txt /wordcountoutput Warning: $HADOOP_HOME is deprecated.

15/09/12 06:48:00 INFO input.FileInputFormat: Total input paths to process : 1 15/09/12 06:48:00 INFO util.NativeCodeLoader: Loaded the native-hadoop library 15/09/12 06:48:00 WARN snappy.LoadSnappy: Snappy native library not loaded 15/09/12 06:48:03 INFO mapred.JobClient: Running job: job_201509120634_0001 15/09/12 06:48:04 INFO mapred.JobClient:  map 0% reduce 0% 15/09/12 06:48:27 INFO mapred.JobClient:  map 100% reduce 0% 15/09/12 06:48:34 INFO mapred.JobClient:  map 100% reduce 33% 15/09/12 06:48:36 INFO mapred.JobClient:  map 100% reduce 100% 15/09/12 06:48:36 INFO mapred.JobClient: Job complete: job_201509120634_0001 15/09/12 06:48:36 INFO mapred.JobClient: Counters: 29 15/09/12 06:48:36 INFO mapred.JobClient:   Job Counters 15/09/12 06:48:36 INFO mapred.JobClient:     Launched reduce tasks=1

3/运行结果 [root@localhost hadoop-1.2.1]# hadoop fs -ls /wordcountoutput Warning: $HADOOP_HOME is deprecated.

Found 3 items -rw-r--r--   1 root supergroup          0 2015-09-12 06:48 /wordcountoutput/_SUCCESS drwxr-xr-x   - root supergroup          0 2015-09-12 06:48 /wordcountoutput/_logs -rw-r--r--   1 root supergroup       1306 2015-09-12 06:48 /wordcountoutput/part-r-00000

[root@localhost hadoop-1.2.1]#  hadoop fs -cat  /wordcountoutput/part-r-00000 Warning: $HADOOP_HOME is deprecated.

(BIS),  1 (ECCN)  1 (TSU)   1 (see    1 5D002.C.1,      1 740.13) 1      1 Administration  1 Apache  1 BEFORE  1 BIS     1 Bureau  1 Commerce,       1 Commodity       1 Control 1 Core    1 Department      1 ENC     1 Exception       1 Export  2 For     1

。。。。。

hadoop小试牛刀 OK

关注
打赏
1659361485
查看更多评论
立即登录/注册

微信扫码登录

0.7408s