本文記錄sqoop test
星期一, 11月 19, 2012
星期一, 11月 12, 2012
[Hadoop] WordCount Sample
記錄一下第一次跑Hadoop WordCount Job的過程 :)
1. 建立HDFS資料夾
#全部的資料夾會自動建立
hduser@hadoop-master:/usr/local/hadoop$hadoop dfs -mkdir /home/hduser/wordcount
2. 匯入要分析的文件資料(local-dir)到HDFS資料夾
$hadoop dfs -copyFromLocal
$hadoop dfs -copyFromLocal
#匯入
hduser@hadoop-master:/usr/local/hadoop$hadoop dfs -copyFromLocal /home/hduser/wordcount /home/hduser/wordcount
#查看匯入的資料
hduser@hadoop-master:/usr/local/hadoop$ hadoop dfs -ls /home/hduser/wordcount
Warning: $HADOOP_HOME is deprecated.
Warning: $HADOOP_HOME is deprecated.
[Hadoop] WordCount
匯出.jar的時候請記得選取Main Class進入點,如下所示
然後再執行job的時候就不會說找不到了
hadoop@client:~/wordcount$ hadoop jar exercise.wordco.jar WordCo /user/hadoop/wordcount/pg5000.txt /user/hadoop/wordcount/output
訂閱:
意見 (Atom)
