Using COS as Deep Storage of Druid

Last updated: 2020-05-29 10:57:55

    Environment Dependencies

    • HADOOP-COS and Hadoop-COS-Java-SDK (included in the dep directory of HADOOP-COS)
    • Druid version: Druid-0.12.1

    Download and Installation

    Downloading HADOOP-COS

    Download HADOOP-COS on Github.

    Installing HADOOP-COS

    Druid-hdfs-extension is required if Druid uses COS for Deep Storage.
    After downloading HADOOP-COS, copy the version you want displayed as hadoop-cos-2.x.x.jar under the dep directory to the Druid installation path extensions/druid-hdfs-storage and the hadoop-dependencies/hadoop-client/2.x.x. Since Druid accesses COS using HDFS plugin, the version you selected needs to be the same as that of the HDFS plugin.


    Modifying configuration

    1. Modify the file `conf/druid/_common/· under Druid installation path, add the extension of hdfs to ·druid.extensions.loadList·, specify hdfs as Druid's deep storage, and enter the path of cosn:
    1. Create a hdfs configuration file hdfs-site.xml under the directory conf/druid/_common/, and enter your COS keys and other information:
    <?xml version="1.0" encoding="UTF-8"?>
    <?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
      Licensed under the Apache License, Version 2.0 (the "License");
      you may not use this file except in compliance with the License.
      You may obtain a copy of the License at
      Unless required by applicable law or agreed to in writing, software
      distributed under the License is distributed on an "AS IS" BASIS,
      WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
      See the License for the specific language governing permissions and
      limitations under the License. See accompanying LICENSE file.
    <!-- Put site-specific property overrides in this file. -->

    The supported items for the above configuration are exactly the same as those described in the HADOOP-COS official documentation. For more information, see HADOOP Tool.

    Getting started

    After the Druid processes are started in turn, the Druid data can be loaded into the COS.