百木园-与人分享,
就是让自己快乐。

安装Spark与Python练习

一、安装Spark

  1. 检查基础环境hadoop,jdk

     

     

 

 

2.下载spark

 

 

 

 二、Python编程练习:英文文本的词频统计

1、准备文本(f1.txt)

Please send this message to those people who mean something to you,to those who have touched your life in one way or another,to those who make you smile when you really need it,to those that make you see the brighter side of things when you are really down,to those who you want to let them know that you appreciate their friendship.And if you don’t, don’t worry,nothing bad will happen to you,you will just miss out on the opportunity to brighten someone’s day with this message.

 2、插入代码

复制代码

path=\'/home/hadoop/sb/f1.txt\'
with open(path) as f:
    text=f.read()
words = text.split()
sb={}
for word in words:
    sb[word]=sb.get(word,0)+1
sblist=list(sb.items())
sblist.sort(key=lambda x:x[1],reverse=True)
print(sblist)

复制代码

3、输出结果

 


来源:https://www.cnblogs.com/ssssuqi/p/15971517.html
本站部分图文来源于网络,如有侵权请联系删除。

未经允许不得转载:百木园 » 安装Spark与Python练习

相关推荐

  • 暂无文章