时间: 2020-09-17|65次围观|0 条评论


1. 编辑WordCount.java文件,在下载的hadoop安装包里有WordCount的例子

http://mirrors.hust.edu.cn/apache/hadoop/common/hadoop-2.2.0/hadoop-2.2.0.tar.gz

2. 把WordCount编译成一个jar文件

 mkdir wordcount_classes javac -cp classpath -d wordcount_classes WordCount.java jar -cvf wordcount.jar -C wordcount_classes/ . 
这里的classpath和之前的hadoop版本有所区别,需要按照新的设置方法,这一点网上很少提及!
新的classpath为:
$HADOOP_HOME/share/hadoop/common/hadoop-common-2.2.0.jar:$HADOOP_HOME/share/hadoop/mapreduce/hadoop-mapreduce-client-core-2.2.0.jar:$HADOOP_HOME/share/hadoop/common/lib/commons-cli-1.2.jar
 

3.创建HDFS文件夹

hadoop fs -mkdir wordCounthadoop fs -mkdir wordCount/input
echo "Hello World Bye World" > file0
echo "Hello Hadoop Goodbye Hadoop" > file1
hadoop fs -put file* wordCount/input

4.运行

hadoop jar wordcount.jar org.myorg.WordCount wordCount/input wordCount/outputhadoop fs -cat /user/cloudera/wordcount/output/part-00000

 

 

文章转载于:https://www.cnblogs.com/kxdblog/p/4115230.html

原著是一个有趣的人,若有侵权,请通知删除

本博客所有文章如无特别注明均为原创。
复制或转载请以超链接形式注明转自起风了,原文地址《命令行下编译Wordcount
   

还没有人抢沙发呢~