Apache Zeppelin is an interactive development system that enables big data visualization and analytics. It can undertake tasks such as data ingestion, discovery, analytics, visualization, and collaboration. It provides a rich set of visual graph libraries such as SparkSQL on the frontend and supports big data systems like HBase and Flink in the form of plugin extension on the backend. In addition, it allows you to perform data preprocessing, algorithm development and debugging, and algorithm job scheduling for machine learning.
For EMR versions later than 3.1.0, the default login permission is set, and both the username and password are admin. To change the password, you can modify the
roles options in the configuration file
/usr/local/service/zeppelin-0.8.2/conf/shiro.ini. For more configuration instructions, see here.
val data = sc.textFile("cosn://huanan/zeppelin-spark-randomint-test") case class WordCount(word: String, count: Integer) val result = data.flatMap(x => x.split(" ")).map(x => (x, 1)).reduceByKey(_ + _).map(x => WordCount(x._1, x._2)) result.toDF().registerTempTable("result") %sql select * from result