• Data Warehouse Service

dws
  1. Help Center
  2. Data Warehouse Service
  3. Developer Guide
  4. Data Export
  5. Using GDS to Export Data to a Remote Server
  6. Exporting Data

Exporting Data

Prerequisites

Ensure that the IP addresses and ports of servers where CNs and DNs are deployed can connect to those of the GDS server.

Procedure

  1. Export data.

    INSERT INTO  [Foreign table name] SELECT * FROM [Source table name];
    NOTE:
    • Create batch processing scripts to export data in parallel. The degree of parallelism depends on the server resource usage. You can test several tables and monitor resource usage to determine whether to increase or reduce the amount. Common resource monitoring commands include top for monitoring memory and CPU usage, iostat for monitoring I/O usage, and sar for monitoring networks. Examples are provided in Example: Exporting Data Using Multiple Threads.
    • Only a single internal table can be exported at a time. Multi-table join is not supported during export. Results of the aggregation, sort, subquery, and limit operations on a single table also cannot be exported.

Examples

  • Example 1: Export data from the reasons table to data files through the foreign_tpcds_reasons foreign table.
    INSERT INTO foreign_tpcds_reasons SELECT * FROM reasons;
  • Example 2: Export part of the data to data files by specifying the filter condition r_reason_sk =1.
    INSERT INTO foreign_tpcds_reasons SELECT * FROM reasons WHERE r_reason_sk=1;
  • Example 3: Data of a special type, such as RAW, is exported as a binary file, which cannot be recognized by the import tool. You need to use the RAWTOHEX() function to convert it to hexadecimal the format before export.
    INSERT INTO foreign_blob_type_tab SELECT RAWTOHEX(c) FROM blob_type_tab;