• Data Warehouse Service

dws
  1. Help Center
  2. Data Warehouse Service
  3. Developer Guide
  4. Tutorial: Using GDS to Import Data
  5. Step 3: Installing, Configuring, and Starting GDS on a Data Server

Step 3: Installing, Configuring, and Starting GDS on a Data Server

This section describes how to install, configure, and start GDS on the server where source data files are stored. GDS will be used for importing data to DWS.

  1. Before using GDS to import or export data, perform the "Preparing an ECS as the GDS Data Server" and "Downloading the GDS Package and SSL Certificate" operations in Tutorial: Using GDS to Import Data > Step 1: Preparing an ECS as the GDS Data Server.
  2. Log in as user root to the data server (192.168.0.90) where GDS is to be installed and run the following command to create the /opt/bin/dws directory for storing the GDS package:

    mkdir -p /opt/bin/dws

  3. Upload the GDS package dws_client_redhat_x64.tar.gz in the software installation package to the created directory.
  4. Go to the new directory and decompress the package:

    cd /opt/bin/dws
    tar -zxvf dws_client_redhat_x64.tar.gz

  5. (Optional) If SSL is used, upload the SSL certificate to the directory created in Step 2.

      

  6. Create the user gds_user and the user group gdsgrp it belongs to. This user is used to start GDS and must have the permission to read the source data file directory.

    groupadd gdsgrp
    useradd -g gdsgrp gds_user

  7. Change the owner of the GDS package and source data file directory to gds_user and the user group to gdsgrp.

    chown -R gds_user:gdsgrp /opt/bin/dws/gds
    chown -R gds_user:gdsgrp /input_data

  8. Switch to user gds_user.

    su - gds_user

  9. Start GDS.

    • If SSL encryption is not used, run the following command to start GDS:
      /opt/bin/dws/gds/gds -d /input_data/ -p 192.168.0.90:5000 -H 10.10.0.1/24 -l /opt/bin/dws/gds/gds_log.txt -D
    • If SSL encryption is used, run the following command to start GDS after performing step Step 5:
      /opt/bin/dws/gds/gds -d /input_data/ -p 192.168.0.90:5000 -H 10.10.0.1/24 -l /opt/bin/dws/gds/gds_log.txt -D --enable-ssl --ssl-dir /opt/bin/dws/gds

    Replace the italic parts as required.

    • -d dir: directory storing data files that contain data to be imported. It is /input_data/ in this tutorial.
    • -p ip:port: listening IP address and port for GDS. The default value is 127.0.0.1. You need to replace it with the IP address of a 10GE network that can communicate with DWS. The listening port can be any one ranging from 1024 to 65535. The default port is 8098. This parameter is 192.168.0.90:5000 in this tutorial.
    • -H address_string: network segment for hosts that can connect to and use the GDS service. The value must be in CIDR format. This parameter is used to allow the DWS cluster to access the GDS service for data import. Ensure that the configured network segment covers all hosts in the DWS cluster.
    • -l log_file: GDS log directory and log file name. It is /opt/bin/dws/gds/gds_log.txt in this tutorial.
    • -D: GDS in daemon mode. This parameter is used only in the Linux OS.
    • --enable-ssl: SSL encryption enablement for data transmission.
    • --ssl-dir: directory storing the SSL certificate. It must be the same as the certificate directory used in Step 5.