Apache Superset is a web-based data browsing and visualization application.
Superset features:
Superset on EMR supports MySQL, Hive, Presto, Impala, Kylin, Druid, and ClickHouse.
Enter http://${master_ip}:18088
in your browser (or go to EMR Console > Component Management) to open the login page of Supserset. The default username is admin
, and the password is the one you set when creating the cluster.
Enter the Sources > Databases page and click Filter List.
Enter the following page and add the URI of the component to be added in SQLAlchemy URI.
The SQLAlchemy URI for each database is as follows:
Name | SQLAlchemy URI | Remarks |
---|---|---|
MySQL | mysql+pymysql://<mysqlname>:<password>@<mysql_ip>:<mysql_port>/<your_database> |
|
Hive | hive://hadoop@<master_ip>:7001/default?auth=NONE |
Master_ip: master_ip of EMR cluster |
Presto | presto://hive@<master_ip>:9000/hive/<hive_db_name> |
master_ip of EMR clusterdefault by default |
Impala | impala://<core_ip>:27000 |
core_ip: core IP of EMR cluster |
Kylin | kylin://<kylin_user>:<password>@<master_ip>:16500/<kylin_project> |
master_ip of EMR cluster |
ClickHouse | clickhouse://<user_name>:<password>@<clickhouse-server-endpoint>:8123/<database_name> |
clickhouse://default:password@localhost:8123/default |
Superset supports databases. To install another database, follow the steps below:
source /usr/local/service/superset/bin/activate
command.
Was this page helpful?