Resource Pools

ModelArts Resource Pools

When using ModelArts to implement AI Development Lifecycle, you can use two different resource pools to train and deploy models.

Dedicated Resource Pool

  • Dedicated resource pools can be used in the following jobs and tasks: deployment (including real-time services).

  • Dedicated resource pools are available only when they are in the Running status. If a dedicated resource pool is Unavailable or Abnormal, rectify the fault before using it.

Creating a Dedicated Resource Pool

  1. Log in to the ModelArts management console and choose Dedicated Resource Pools on the left.

  2. On the Dedicated Resource Pools page, the list of the created resource pools is displayed.

  3. Click Create in the upper left corner. The page for creating a dedicated resource pool is displayed.

  4. Set the parameters on the page. For details about how to set parameters, see Table 1.

    Table 1 Parameters of the Dedicated for Service Deployment type

    Parameter

    Description

    Resource Type

    The default value is Dedicated for Service Deployment and cannot be changed.

    Name

    Name of a dedicated resource pool.

    The value can contain letters, digits, hyphens (-), and underscores (_).

    Description

    Brief description of a dedicated resource pool.

    Nodes

    Select the number of nodes in a dedicated resource pool. More nodes mean higher computing performance.

    Specifications

    Required specifications. The GPU delivers better performance, and the CPU is more cost-effective.

    • CPU: modelarts.vm.cpu.8ud

    • GPU: modelarts.vm.gpu.16g.v100

  5. After confirming that the specifications are correct, create a dedicated resource pool as prompted. After a dedicated resource pool is created, its status changes to Running.

Scaling a Dedicated Resource Pool

After a dedicated resource pool is used for a period of time, you can scale out or in the capacity of the resource pool by increasing or decreasing the number of nodes.

The procedure for scaling is as follows:

  1. Go to the dedicated resource pool management page, locate the row that contains the desired dedicated resource pool, and click Scale in the Operation column.

  2. On the scaling page, increase or decrease the number of nodes. Increasing the node quantity scales out the resource pool whereas decreasing the node quantity scales in the resource pool. Scale the capacity based on service requirements.

    • During capacity expansion,

    • During capacity reduction, delete the target nodes in the Operation column. To reduce one node, you need to switch off the node in Node List to delete the node.

  3. Click Submit. After the request is submitted, the dedicated resource pool management page is displayed.

Deleting a Dedicated Resource Pool

If a dedicated resource pool is no longer needed during AI service development, you can delete the resource pool to release resources and reduce costs.

Note

  • After a dedicated resource pool is deleted, the training jobs, notebook instances, and deployment that depend on the resource pool are unavailable. A dedicated resource pool cannot be restored after being deleted. Exercise caution when deleting a dedicated resource pool.

  1. Go to the dedicated resource pool management page, locate the row that contains the desired dedicated resource pool, and click Delete in the Operation column.

  2. In the dialog box that is displayed, click OK.