Elastic MapReduce (EMR) consists of a series of open source applications in the big data ecosystem. EMR supports three types of clusters: Hadoop, Druid, and ClickHouse. Each EMR version contains a specific set of open source applications. When you create a cluster, you can choose the most appropriate EMR version based on your actual needs.
EMR comes with two component editions. One is the Standard edition, which is based on the stable open source community component edition to provide services. To keep your EMR clusters consistent with the evolution of the community, you are advised to choose the Standard edition. The other one is the TianQiong edition, which has incorporated Tencent Cloud's proprietary, mature, and stable features on the basis of the stable open source community component edition for better performance and stability. Please choose an edition that suits your business when you purchase a cluster. It is not recommended to switch between the two editions after the cluster is launched.
EMR version numbers are in the format of EMR va.b.c
as detailed below:
a
for different clusters are as follows:a
indicates the Hadoop versions supported by the current version. When a
is 1
or 2
, Hadoop v2.X is supported; when a
is 3
, Hadoop v3.X is supported.a
indicates the Druid versions supported by the current version. When a
is 1
, Druid v0.17.X is supported.a
indicates the ClickHouse versions supported by the current version. When a
is 1
, ClickHouse v19.X and v20.X are supported.b
indicates that the version has new components or supports component version upgrade.c
indicates feature optimization.Note:
- The components and their versions bundled with each EMR version are fixed. Currently, neither selecting multiple versions of a component nor changing a component version in one EMR version is supported. For example, Hadoop v2.7.3 and Spark v2.2.1 are built into EMR v2.0.1.
- Once a version of EMR is selected for cluster creation, the EMR and components used by the cluster will not be automatically upgraded. For example, if EMR v2.0.1 is selected, then Hadoop will always be v2.7.3, and Spark will always be v2.2.1. Even if you subsequently upgrade EMR to v2.1.0 where Hadoop v2.8.4 and Spark v2.3.2 are included, the previously created cluster will not be affected, and only new clusters will use the new versions.
- When you upgrade the cluster through data migration (for example, from EMR v2.0.1 to EMR v2.1.0), in order to avoid issues such as version incompatibility or environment changes, be sure to test the tasks to be migrated and ensure that they can work properly in the new software environment.
Hadoop cluster is available in Standard Edition and TianQiong Edition. For more information on the differences between the two editions, please see EMR TianQiong Introduction.
Currently, EMR Standard supports the Hadoop, Druid, and ClickHouse clusters.
Hadoop v2.X Standard supports the component versions listed in the following table:
Component | EMR v1.3.1 | EMR v2.0.1 | EMR v2.1.0 | EMR v2.2.0 | EMR v2.3.0 | EMR v2.4.0 | EMR v2.5.0 |
---|---|---|---|---|---|---|---|
Release Date | - | - | May 2019 | March 2020 | May 2020 | August 2020 | September 2020 |
Hadoop | 2.7.3 | 2.7.3 | 2.8.4 | 2.8.5 | 2.8.5 | 2.8.5 | 2.8.5 |
Spark | 2.0.2 | 2.2.1 | 2.3.2 | 2.4.3 | 2.4.3 | 3.0.0 | 3.0.0 |
Hive | 2.1.1 | 2.3.2 | 2.3.3 | 2.3.5 | 2.3.5 | 2.3.7 | 2.3.7 |
Tez | 0.8.5 | 0.8.5 | 0.8.5 | 0.9.2 | 0.9.2 | 0.9.2 | 0.9.2 |
Presto | 0.161 | 0.188 | 0.215 | 0.228 | 0.228 | 332 | 332 |
Storm | 1.1.0 | 1.1.0 | 1.1.0 | 1.2.3 | 1.2.3 | 1.2.3 | 1.2.3 |
Flink | 1.2.0 | 1.2.0 | 1.4.2 | 1.9.2 | 1.9.2 | 1.10.0 | 1.10.0 |
HBase | 1.2.4 | 1.3.1 | 1.3.1 | 1.4.9 | 1.4.9 | 1.4.9 | 1.4.9 |
Phoenix | 4.8.1 | 4.11.0 | 4.13.0 | 4.13.0 | 4.13.0 | 4.13.0 | 4.13.0 |
Ganglia | 3.7.2 | 3.7.2 | 3.7.2 | 3.7.2 | 3.7.2 | 3.7.2 | 3.7.2 |
Hue | 3.12.0 | 3.12.0 | 4.4.0 | 4.6.0 | 4.6.0 | 4.6.0 | 4.6.0 |
Sqoop | 1.4.6 | 1.4.6 | 1.4.7 | 1.4.7 | 1.4.7 | 1.4.7 | 1.4.7 |
Ooize | 4.3.1 | 4.3.1 | 4.3.1 | 5.1.0 | 5.1.0 | 5.1.0 | 5.1.0 |
Ranger | - | 0.7.1 | 0.7.1 | 1.2.0 | 1.2.0 | 1.2.0 | 1.2.0 |
ZooKeeper | 3.4.9 | 3.4.9 | 3.4.9 | 3.5.5 | 3.5.5 | 3.6.1 | 3.6.1 |
Flume | - | - | 1.8.0 | 1.9.0 | 1.9.0 | 1.9.0 | 1.9.0 |
Impala | - | - | - | 2.10.0 | 2.10.0 | 2.10.0 | 2.10.0 |
Kylin | - | - | - | 2.5.2 | 2.5.2 | 2.5.2 | 2.5.2 |
Zeppelin | - | - | - | 0.8.2 | 0.8.2 | 0.8.2 | 0.8.2 |
Alluxio | - | - | 1.8.1 | 1.8.1 | 1.8.1 | 1.8.1 | 2.3.0 |
Knox | 1.2.0 | 1.2.0 | 1.2.0 | 1.2.0 | 1.2.0 | 1.2.0 | 1.2.0 |
Kerberos | - | - | 1.15.0 | 1.15.0 | 1.15.0 | 1.15.0 | 1.15.0 |
Hudi | - | - | - | 0.5.1 | 0.5.1 | 0.5.1 | 0.5.1 |
Superset | - | - | - | 0.35.2 | 0.35.2 | 0.35.2 | 0.35.2 |
Livy | - | - | - | 0.7.0 | 0.7.0 | 0.7.0 | 0.7.0 |
TensorFlow | - | - | - | - | 1.4.4 | 1.4.4 | 1.4.4 |
Jupyter | - | - | - | - | 4.6.3 | 4.6.3 | 4.6.3 |
Hadoop v3.X Standard supports the component versions listed in the following table:
Component | EMR v3.0.0 | EMR v3.1.0 |
---|---|---|
Release Date | November 2019 | December 2020 |
Hadoop | 3.1.2 | 3.1.2 |
Spark | 2.4.3 | 2.4.3 |
Hive | 3.1.1 | 3.1.1 |
Tez | 0.9.2 | 0.9.2 |
Presto | 0.222 | 332 |
Flink | 1.8.1 | 1.10.0 |
HBase | 2.2.0 | 2.3.3 |
Hue | 4.4.0 | 4.4.0 |
Sqoop | 1.4.7 | 1.4.7 |
Ooize | 5.1.0 | 5.1.0 |
Ranger | 2.0.0 | 2..0.0 |
ZooKeeper | 3.4.9 | 3.6.1 |
Flume | 1.9.0 | 1.9.0 |
Impala | - | 3.4.0 |
Alluxio | 1.8.1 | 2.3.0 |
Knox | 1.2.0 | 1.2.0 |
Kudu | - | 1.13.0 |
Kerberos | 1.15.1 | 1.15.1 |
Druid clusters support the component versions listed in the following table:
Component | Druid v1.0.0 |
---|---|
Release Date | April 2020 |
Hadoop | 2.8.5 |
Druid | 0.17.0 |
ZooKeeper | 3.5.5 |
Knox | 1.2.0 |
Superset | 0.35.2 |
Ganglia | 3.7.2 |
ClickHouse clusters support the component versions listed in the following table:
Component | ClickHouse v1.0.0 | ClickHouse v1.1.0 |
---|---|---|
Release Date | April 2020 | May 2020 |
ClickHouse | 19.16.12.49 | 20.3.10.75 |
ZooKeeper | 3.4.9 | 3.4.9 |
Superset | - | 0.35.2 |
Currently, EMR TianQiong supports Hadoop clusters only. It has integrated the enhanced edition of Spark and Tencent's proprietary JDK Kona.
Hadoop v2.X TianQiong supports the component versions listed in the following table:
Component | EMR TianQiong v1.0.0 |
---|---|
Release Date | November 2020 |
Hadoop | 2.8.5 |
Spark | 3.0.1 (Enhanced) |
Hive | 2.3.7 |
Tez | 0.9.2 |
PrestoSQL | 332 |
Storm | 1.2.3 |
Flink | 1.10.0 |
HBase | 1.4.9 |
Phoenix | 4.13.0 |
Ganglia | 3.7.2 |
Hue | 4.6.0 |
Sqoop | 1.4.7 |
Ooize | 5.1.0 |
Ranger | 1.2.0 |
ZooKeeper | 3.6.1 |
Flume | 1.9.0 |
Impala | 2.10.0 |
Kylin | 2.5.2 |
Alluxio | 2.3.0 |
Knox | 1.2.0 |
Kerberos | 1.15.0 |
Hudi | 0.5.1 |
Superset | 0.35.2 |
Livy | 0.7.0 |
TensorFlow | 1.4.4 |
Jupyter | 4.6.3 |
Was this page helpful?