Tencent Cloud

Superset Overview

Last updated: 2022-05-16 12:52:26

Apache Superset is a web-based data browsing and visualization application. Superset on EMR supports MySQL, Hive, Presto, Impala, Kylin, Druid, and ClickHouse.

Superset Features

Supports almost all major databases such as MySQL, PostgresSQL, Oracle, SQL Server, SQLite, and Spark SQL as well as Druid.
Provides a wide variety of visual displays and allows you to create custom dashboards.
Makes data display controllable and enables customization of displayed fields, aggregated data, and data sources.

Prerequisites

You have created an EMR Hadoop or Druid cluster and selected the Superset service. For more information, see Creating EMR Cluster.
By default, Superset is installed on the master node of your cluster. Enable the security group policy for the master node and make sure that your network can access port 18088 of the master node.

Enter http://${master_ip}:18088 in your browser (or go to the EMR console > Cluster Service) to open the login page of Supserset. The default username is admin, and the password is the one you set when creating the cluster.

Adding Databases

Go to Sources > Databases and click Filter List.

On the following page, add the URI of the component to be added in SQLAlchemy URI.

The SQLAlchemy URI for each database is as follows:

Name	SQLAlchemy URI	Remarks
MySQL	`mysql+pymysql://<mysqlname>:<password>@<mysql_ip>:<mysql_port>/<your_database>`	`mysqlname`: Username used to connect to MySQL. `password`: MySQL password. `your_database`: The MySQL database to be connected to.

    | Hive | `hive://hadoop@<master_ip>:7001/default?auth=NONE` | `master_ip`: Master IP of the EMR cluster. |

| Presto | presto://hive@<master_ip>:9000/hive/<hive_db_name> |

Master_ip: master_ip of the EMR cluster

hive_db_name: Name of the database in Hive. If this parameter is left empty, it will be default by default |
| Impala | impala://<core_ip>:27000 | core_ip: core IP of EMR cluster. |
| Kylin | kylin://<kylin_user>:<password>@<master_ip>:16500/<kylin_project> |

kylin_user: Kylin username

password: Kylin password

master_ip: master_ip of the EMR cluster

kylin_project: Kylin project |
|ClickHouse |clickhouse://<user_name>:<password>@<clickhouse-server-endpoint>:8123/<database_name>| clickhouse://default:password@localhost:8123/default

user_name: Username

password: Password

clickhouse-server-endpoint: ClickHouse service endpoint

database_name: Name of the database to be accessed |

Adding New Databases on Your Own

Superset supports databases. To install another database, follow the steps below:

Log in to the server where the master node of EMR cluster resides.
Run the source /usr/local/service/superset/bin/activate command.
Install the corresponding Python library with pip3.
Restart Superset.

tencent cloud

Recent Pages

Superset Overview

Superset Features

Prerequisites

Adding Databases

Adding New Databases on Your Own

Was this page helpful?

Was this page helpful?

tencent cloud

Sign Up

Log in

Recent Pages

Superset Overview

Superset Features

Prerequisites

Login

Adding Databases

Adding New Databases on Your Own

Was this page helpful?

Was this page helpful?