Node Type Description

Last updated: 2020-05-28 17:44:44

    EMR offers five types of nodes for your choice based on the cluster type:

    Hadoop Cluster

               
    Node Type Description HA Quantity Non-HA Quantity
    Master Processes such as NameNode, ResourceManager, and HMaster are deployed here. 2 1
    Core Processes such as DataNode, NodeManager, and RegionServer are deployed here. ≥ 3 ≥ 2
    Task Processes such as NodeManger and PrestoWork are deployed here.The number of task nodes can be changed at any time to achieve elastic scalability of the cluster. The minimum value is 0.
    Common Distributed coordinator components such as ZooKeeper and JournalNode are deployed here. ≥ 3 0
    Router Hadoop packages, including software programs and processes such as Hive, Hue, and Spark, are deployed here.The number of router nodes can be changed at any time. The minimum value is 0.
    • A master node is a management node that ensures that the scheduling of the cluster works properly.
    • A core node is a compute and storage node. All your data in HDFS is stored in core nodes. Therefore, in order to ensure data security, once core nodes are scale out, they cannot be scaled in.
    • A task node is a pure compute node and does not store any data. The computed data comes from a core node or COS. Therefore, it is often used as an elastic node and can be scaled in or out at any time.
    • A common node provides data sharing sync and HA fault tolerance services for the master node in an HA cluster.
    • A router node is used to share the load of a master node or as the task submitter of the cluster. It can be scaled in or out at any time.

    ClickHouse Cluster

    Node Type Description HA Quantity Non-HA Quantity
    Core The ClickHouseServer process is deployed here. ≥ 2 ≥ 1
    Common Nodes of the distributed coordinator component ZooKeeper are deployed here. ≥ 3 0
    • A core node is a compute and storage node.
    • A common node provides data sharing sync and HA fault tolerance services for the master node in an HA cluster.

    Druid Cluster

               
    Node Type Description HA Quantity Non-HA Quantity
    Master Processes such as NameNode, ResourceManager, Overlord, coordinator, ZKFailoverController, and JobHistoryServer are deployed here. 2 1
    Core Processes such as DataNode, NodeManager, middlemanager, and historical are deployed here. ≥ 3 ≥ 2
    Task Processes such as NodeManager and middlemanager are deployed here.The number of task nodes can be changed at any time to achieve elastic scalability of the cluster. The minimum value is 0.
    Common Distributed coordinator components such as ZooKeeper and JournalNode are deployed here. ≥ 3 0
    Router Hadoop packages, including software programs and processes such as broker, are deployed here.The number of router nodes can be changed at any time. The minimum value is 0.
    • A master node is a management node that ensures that the scheduling of the cluster works properly.
    • A core node is a compute and storage node. All your data in HDFS is stored in core nodes. Therefore, in order to ensure data security, once core nodes are scale out, they cannot be scaled in.
    • A task node is a pure compute node and does not store any data. The computed data comes from a core node or COS. Therefore, it is often used as an elastic node and can be scaled in or out at any time.
    • A common node provides data sharing sync and HA fault tolerance services for the master node in an HA cluster.
    • A router node is used to share the load of a master node or as the task submitter of the cluster. It can be scaled in or out at any time.

    Was this page helpful?

    Was this page helpful?

    • Not at all
    • Not very helpful
    • Somewhat helpful
    • Very helpful
    • Extremely helpful
    Send Feedback
    Help