• Data Warehouse Service

dws
  1. Help Center
  2. Data Warehouse Service
  3. User Guide
  4. Managing Clusters
  5. Creating a Cluster

Creating a Cluster

To use DWS in the public cloud environment, you need to create a data warehouse cluster by specifying only the node flavor and scale.

This section describes how to create a data warehouse cluster on the DWS management console.

Preparations Before Creating a Cluster

  • The flavor of nodes in the cluster has been evaluated.

    Choose as many nodes as possible to ensure high-performance storage and computing capabilities for service applications based on service requirements. A node in a data warehouse cluster contains two DataNodes. The DataNode instance stores service data (support for row-based, column-based, and hybrid storage), executes the data query tasks, and returns the execution results.

  • A network access topology has been designed.

    Plan an appropriate AZ and configure the network to isolate the data warehouse cluster from other public cloud services based on the service application requirements.

  • Ensure that the number of available nodes meets the following conditions. Otherwise, the cluster cannot be created. In this case, click Increase quota to submit a work order and apply for higher node quota.
    • The number of available nodes must be greater than or equal to three, because at least three nodes are required for creating a cluster. You can view the number of available nodes on the Cluster Management page.
    • The number of nodes in the cluster to be created must be less than or equal to the number of available nodes.

Creating a Cluster

  1. Log in to the management console at https://console.otc.t-systems.com/dws/.
  2. On the Cluster Management page, click Create DWS Cluster.
  3. Select the Region.

    Table 1 Region parameters

    Parameter

    Description

    Example Value

    Region

    Select the actual region where the cluster nodes run.

    For more information about AZs, see Regions and Endpoints.

    eu-de

    AZ

    Select an AZ associated with the cluster region.

    An AZ contains one or more physical DCs. It has independent cooling, fire extinguishing, moisture-proof, and electricity facilities. Within an AZ, computing, network, storage, and other resources are logically divided into multiple clusters. AZs within a region are interconnected using high-speed optical fibers to realize cross-AZ high availability for the system.

    eu-de-01

  4. Configure the node-related parameters.

    NOTE:

    The number of nodes in a new cluster cannot exceed the quota that can be used by a user or 32. If the node quota is insufficient, click Increase quota to submit a work order and apply for higher node quota.

    Figure 1 Configuring node-related parameters
    Table 2 Node configuration parameters

    Parameter

    Description

    Example Value

    Node Flavor

    Select vCPU and memory resources for the nodes in the cluster. Table 3 lists the node flavors supported by DWS.

    dws.m3.xlarge

    Nodes

    Specify the number of nodes in the cluster.

    The number of nodes ranges from 3 to 32.

    3

    Table 3 Flavor description

    Type

    vCPU Cores

    Memory

    Disk Size

    Disk Type

    dws.m1.xlarge.ultrahigh

    4

    32 GB

    256 GB

    General-purpose generation I, SSD

    dws.d1.xlarge

    4

    32 GB

    1.68 TB

    Local disk (HDD)

    dws.d2.15xlarge

    60

    540 GB

    13.41 TB

    Disk-intensive generation II (KVM), local disk (HDD)

    dws.d2.xlarge

    4

    32 GB

    1.68 TB

    Disk-intensive generation II (KVM), local disk (HDD)

    dws.m3.xlarge

    4

    32 GB

    160 GB

    General-purpose generation III (KVM), SSD

      

  5. Configure cluster-related parameters.

    Figure 2 Configuring the cluster
    Table 4 Parameter description

    Parameter

    Description

    Example Value

    Cluster Name

    Set the name of the data warehouse cluster.

    Enter 4 to 64 characters. Only case-insensitive letters, digits, hyphens (-), and underscores (_) are allowed. The value must start with a letter.

    dws-demo

    Cluster Version

    Display the version of the database instance installed in the cluster.

    NOTE:

    The version number is the default one and, as of publication, cannot be changed.

    1.2.1

    Default Database

    The default database name of the cluster is postgres.

    NOTE:

    This name cannot be changed.

    -

    Administrator Account

    Set the administrator name of the database.

    The administrator username must:

    • Consist of lowercase letters, digits, or underscores.
    • Start with a lowercase letter or an underscore.
    • Contain 1 to 63 characters.
    • Cannot be a keyword of the DWS database. For details about the keywords of the DWS database, see section Keyword in the Data Warehouse Service Database Developer Guide.

    dbadmin

    Administrator Password

    Set the password of the database administrator account.

    The password complexity requirements are as follows:
    • Consists of 8 to 32 characters.
    • Cannot be the same as the username or the username written in reverse order.
    • Must contain at least 3 of the following character types: uppercase letters, lowercase letters, digits, and special characters ~!@#%^&*()-_=+|[{}];:,<.>/?
    • Passes the weak password check.
    NOTE:

    Change the password regularly and keep it secure.

    Dws2018demo!

    Confirm Password

    Enter the database administrator password again.

    Dws2018demo!

    Database Port

    Set the port used when the client or application connects to the database in the cluster.

    The port ranges from 8000 to 10000.

    8000

  6. Configure network parameters.

    Figure 3 Configuring the network
    Table 5 Network parameters

    Parameter

    Description

    Example Value

    VPC

    Specify a virtual private network for nodes in a cluster to isolate networks of different services.

    If you create a data warehouse cluster for the first time and have not configured the VPC, click View VPC. On the VPC management console that is displayed, create a new VPC that satisfies your needs.

    For details about how to create a VPC, see section Creating a VPC in the Virtual Private Cloud User Guide.

    After selecting a VPC from the drop-down list, click View VPC to enter the VPC management console and view the detailed information about the VPC.

    You can click to refresh the options in the VPC drop-down list.

    vpc-dws

    Subnet

    Specify a subnet for the VPC.

    A subnet provides dedicated network resources that are isolated from other networks, improving network security.

    subnet-dws

    Security Group

    Specify a security group for the VPC.

    A security group restricts access rules to enhance security when DWS and other services access each other.

    • Automatically create a security group.

      If Automatic creation is selected, the system automatically creates a default security group. This option is selected by default.

      The rule of the default security group is as follows: The outbound allows all access requests, while the inbound is only open to the database port that you set to connect to the data warehouse cluster.

      The format of the default security group's name is dws-<cluster name>-<database port of the data warehouse cluster>, for example, dws-dws-demo-8000.

      NOTE:

      If the quotas of the security group and the security group rule are insufficient, an error message will be displayed after you submit the cluster creation application. Select an existing group and retry.

    • Manually create and configure a security group.

      You can also log in to the VPC management console to manually create a security group. Then, go back to the page for creating data warehouse clusters, click the button next to the Security Group drop-down list to refresh the page, and select the new security group.

      To enable the DWS client to connect to the cluster, you need to add an inbound rule to the new security group to grant the access permission to the database port of the data warehouse cluster. An example of the inbound rule is as follows:

      • Protocol: TCP
      • Port: 8000 Use the database port set when creating the data warehouse cluster. This port is used for receiving client connections to DWS.
      • Source: Select IP address and use the host IP address of the client, for example, 192.168.0.10/32.

    Automatic creation

    EIP

    Specify whether users can use a client to connect to a cluster's database over the Internet. The following methods are supported:

    • Do not use: The EIP is not required.
    • Automatically assign: Users specify the bandwidth of the EIP and the system automatically assigns an EIP that exclusively uses bandwidth to each cluster so that users can use the EIP to access the cluster over the Internet.
    • Specify: A specified EIP is bound to the cluster. If no available EIPs are displayed in the drop-down list, click Create EIP to go to the Elastic IP page and create an EIP that satisfies your needs. You can set the bandwidth as needed.

    Automatically assign

    Bandwidth

    When EIP is set to Automatically assign, you specify the bandwidth of the used EIP, which ranges from 1 Mbit/s to 100 Mbit/s.

    50 Mbit/s

  7. Select either of the following options for Advanced Settings:

    • Default: Indicates that the following advanced settings use the default configurations.
      • Parameter Group: The default database parameter group Default-Parameter-Group-DWS is associated with the cluster.
      • Tag: By default, no tag is added to the cluster.
      • Automated Snapshot: By default, the policy for automatically generating cluster snapshots is disabled.
    • Custom: If you select this option, set the following advanced settings:
      Figure 4 Custom advanced settings
      • Parameter Group

        A parameter group is a set of database parameters. You need to select a parameter group from the drop-down list and associate it with the cluster during cluster creation. You can select the default parameter group Default-Parameter-Group-DWS or a customized parameter group.

        For details about parameter groups, see section Managing Parameter Groups.

      • Tag

        A tag is a key-value pair used to identify a cluster. For details about the keys and values, see Table 6.

        For more information about tags, see section Tagging Overview.

        Table 6 Tag parameters

        Parameter

        Description

        Example Value

        Tag key

        You can:

        • Select a predefined tag key or an existing resource tag key from the drop-down list of the text box.
          NOTE:

          To add a predefined tag, you need to create one on TMS and select it from the drop-down list of Tag key. You can click View predefined tags to enter the Predefined Tag page of TMS. Then, click Create Tag to create a predefined tag. For details, see section Creating Predefined Tags in the Tag Management Service User Guide.

        • Enter a tag key in the text box. A tag key can contain a maximum of 36 Unicode characters, which cannot be null. The first and last characters cannot be spaces.

          Contain only uppercase letters (A to Z), lowercase letters (a to z), digits (0-9), hyphens (-), and underscores (_).

          NOTE:

          The key name must be unique in the same cluster.

        tagkey01

        Tag value

        You can:

        • Select a predefined tag value or resource tag value from the drop-down list of the text box.
        • Enter a tag value in the text box. A tag key can contain a maximum of 43 Unicode characters, which can be null. The first and last characters cannot be spaces.

          Contain only uppercase letters (A to Z), lowercase letters (a to z), digits (0-9), hyphens (-), and underscores (_).

        value01

      • Automated Snapshot

        Click Automated Snapshot to enable or disable the automatic snapshot policy for the cluster. After the automated snapshot policy is enabled, the system automatically creates snapshots based on the preset time and period.

        indicates that the policy is enabled.

        indicates that the policy is disabled (default).

        When it is enabled, configure the following parameters:

        Table 7 Parameter description

        Parameter

        Description

        Retention Days

        It is used to set the retention days of the snapshots that are automatically created. The value ranges from 1 to 10 days.

        NOTE:

        You cannot delete the snapshots that are automatically created. The system automatically deletes these snapshots when their retention duration expires.

        Start Time

        It is used to set the time when the automatic snapshot creation begins. The time must be set to an integer. The automatic creation task is triggered within one hour after the creation start time you set.

        Execution Period

        It is used to set the automatic snapshot creation cycle.

  8. Click Create Now. The Details page is displayed.

    NOTE:

    If the number of applied nodes, vCPU (cores), or memory (GB) exceed the user's remaining quota, a warning dialog box is displayed indicating insufficient quota and displaying the detailed remaining quota and the current quota application information. In this case, you can click Increase quota in the warning dialog box to submit a work order and apply for higher node quota.

    For details about quota, see What Is User Quota?.

  9. Click Submit.

    After the submission, Cluster Status of the newly created cluster is Creating. Wait several minutes. Clusters in the Available state are ready for use.