STA9760_Yelp_Data_Analysis 源码
分析10Gb的Yelp评论数据 For this project, I will be tasked with provisioning a Spark Cluster on AWS EMR for loading and running some analysis on Yelp’s Reviews and Businesses dataset (about 10gb) from Kaggle. I will run my analysis via Jupyter Notebook and the expected output artifact is a .ipynb file 第
用户评论