section> Computing
  • Auto Scaling
  • Bare Metal Server
  • Dedicated Host
  • Elastic Cloud Server
  • FunctionGraph
  • Image Management Service
Network
  • Direct Connect
  • Domain Name Service
  • Elastic IP
  • Elastic Load Balancing
  • Enterprise Router
  • NAT Gateway
  • Private Link Access Service
  • Secure Mail Gateway
  • Virtual Private Cloud
  • Virtual Private Network
  • VPC Endpoint
Storage
  • Cloud Backup and Recovery
  • Cloud Server Backup Service
  • Elastic Volume Service
  • Object Storage Service
  • Scalable File Service
  • Storage Disaster Recovery Service
  • Volume Backup Service
Application
  • API Gateway (APIG)
  • Application Operations Management
  • Application Performance Management
  • Distributed Message Service (for Kafka)
  • Simple Message Notification
Data Analysis
  • Cloud Search Service
  • Data Lake Insight
  • Data Warehouse Service
  • DataArts Studio
  • MapReduce Service
  • ModelArts
  • Optical Character Recognition
Container
  • Application Service Mesh
  • Cloud Container Engine
  • Cloud Container Instance
  • Software Repository for Containers
Databases
  • Data Replication Service
  • Distributed Cache Service
  • Distributed Database Middleware
  • Document Database Service
  • GeminiDB
  • Relational Database Service
  • TaurusDB
Management & Deployment
  • Cloud Create
  • Cloud Eye
  • Cloud Trace Service
  • Config
  • Log Tank Service
  • Resource Formation Service
  • Tag Management Service
Security Services
  • Anti-DDoS
  • Cloud Firewall
  • Database Security Service
  • Dedicated Web Application Firewall
  • Host Security Service
  • Identity and Access Management
  • Key Management Service
  • Web Application Firewall
Other
  • Enterprise Dashboard
  • Marketplace
  • Price Calculator
  • Status Dashboard
APIs
  • REST API
  • API Usage Guidelines
  • Endpoints
Development and Automation
  • SDKs
  • Drivers and Tools
  • Terraform
  • Ansible
  • Cloud Create
Architecture Center
  • Best Practices
  • Blueprints
IaaSComputingAuto ScalingBare Metal ServerDedicated HostElastic Cloud ServerFunctionGraphImage Management ServiceNetworkDirect ConnectDomain Name ServiceElastic IPElastic Load BalancingEnterprise RouterNAT GatewayPrivate Link Access ServiceSecure Mail GatewayVirtual Private CloudVirtual Private NetworkVPC EndpointStorageCloud Backup and RecoveryCloud Server Backup ServiceElastic Volume ServiceObject Storage ServiceScalable File ServiceStorage Disaster Recovery ServiceVolume Backup ServicePaaSApplicationAPI Gateway (APIG)Application Operations ManagementApplication Performance ManagementDistributed Message Service (for Kafka)Simple Message NotificationData AnalysisCloud Search ServiceData Lake InsightData Warehouse ServiceDataArts StudioMapReduce ServiceModelArtsOptical Character RecognitionContainerApplication Service MeshCloud Container EngineCloud Container InstanceSoftware Repository for ContainersDatabasesData Replication ServiceDistributed Cache ServiceDistributed Database MiddlewareDocument Database ServiceGeminiDBRelational Database ServiceTaurusDBManagementManagement & DeploymentCloud CreateCloud EyeCloud Trace ServiceConfigLog Tank ServiceResource Formation ServiceTag Management ServiceSecuritySecurity ServicesAnti-DDoSCloud FirewallDatabase Security ServiceDedicated Web Application FirewallHost Security ServiceIdentity and Access ManagementKey Management ServiceWeb Application FirewallOtherOtherEnterprise DashboardMarketplacePrice CalculatorStatus Dashboard

MapReduce Service

  • Using Alluxio
  • Using CarbonData (for Versions Earlier Than MRS 3.x)
  • Using CarbonData (for MRS 3.x or Later)
  • Using ClickHouse
  • Using DBService
  • Using Flink
  • Using Flume
  • Using HBase
  • Using HDFS
  • Using Hive
  • Using Hudi
  • Using Hue (Versions Earlier Than MRS 3.x)
  • Using Hue (MRS 3.x or Later)
  • Using Kafka
  • Using KafkaManager
  • Using Loader
    • Using Loader from Scratch
    • How to Use Loader
    • Loader Link Configuration
    • Managing Loader Links (Versions Earlier Than MRS 3.x)
    • Source Link Configurations of Loader Jobs
    • Destination Link Configurations of Loader Jobs
    • Managing Loader Jobs
    • Preparing a Driver for MySQL Database Link
    • Loader Log Overview
    • Example: Using Loader to Import Data from OBS to HDFS
    • Common Issues About Loader
  • Using MapReduce
  • Using Oozie
  • Using OpenTSDB
  • Using Presto
  • Using Ranger (MRS 1.9.2)
  • Using Ranger (MRS 3.x)
  • Using Spark
  • Using Spark2x
  • Using Sqoop
  • Using Storm
  • Using Tez
  • Using Yarn
  • Using ZooKeeper
  • Appendix
  • Change History
  • Component Operation Guide
  • Using Loader
  • Source Link Configurations of Loader Jobs

Source Link Configurations of Loader Jobs¶

Overview¶

When Loader jobs obtain data from different data sources, a link corresponding to a data source type needs to be selected and the link properties need to be configured.

This section applies to versions earlier than MRS 3.x.

obs-connector¶

Table 1 Data source link properties of obs-connector¶

Parameter

Description

Bucket Name

OBS file system for storing source data.

Source Directory/File

Actual storage form of source data. It can be either all data files in a directory or a single data file contained in the file system.

File Format

Loader supports the following file formats of data stored in OBS:

  • CSV_FILE: Specifies a text file. When the destination link is a database link, only the text file is supported.

  • BINARY_FILE: Specifies binary files excluding text files.

Line Separator

Identifier of each line end of source data

Field Separator

Identifier of each field end of source data

Encoding Type

Text encoding type of source data. It takes effect on text files only.

File Split Type

The following types are supported:

  • File: The number of files is assigned to a map task by the total number of files. The calculation formula is Total number of files/Extractors.

  • Size: A file size is assigned to a map task by the total file size. The calculation formula is Total file size/Extractors.

generic-jdbc-connector¶

Table 2 Data source link properties of generic-jdbc-connector¶

Parameter

Description

Schema/Tablespace

Name of the database storing source data. You can query and select it on the interface.

Table Name

Data table storing the source data. You can query and select it on the interface.

Partition Column

If multiple columns need to be read, use this column to split the result and obtain data.

Where Clause

Query statement used when accessing the database

ftp-connector or sftp-connector¶

Table 3 Data source link properties of ftp-connector or sftp-connector¶

Parameter

Description

Source Directory/File

Actual storage form of source data. It can be either all data files in a directory or single data file contained in the file server.

File Format

Loader supports the following file formats of data stored in the file server:

  • CSV_FILE: Specifies a text file. When the destination link is a database link, only the text file is supported.

  • BINARY_FILE: Specifies binary files excluding text files.

Line Separator

Identifier of each line end of source data

Note

If FTP or SFTP serves as a source link and File Format is set to BINARY_FILE, the value of Line Separator in the advanced properties is invalid.

Field Separator

Identifier of each field end of source data

Note

If FTP or SFTP serves as a source link and File Format is set to BINARY_FILE, the value of Field Separator in the advanced properties is invalid.

Encoding Type

Text encoding type of source data. It takes effect on text files only.

File Split Type

The following types are supported:

  • File: The number of files is assigned to a map task by the total number of files. The calculation formula is Total number of files/Extractors.

  • Size: A file size is assigned to a map task by the total file size. The calculation formula is Total file size/Extractors.

hbase-connector¶

Table 4 Data source link properties of hbase-connector¶

Parameter

Description

Table Name

HBase table storing source data

hdfs-connector¶

Table 5 Data source link properties of hdfs-connector¶

Parameter

Description

Source Directory/File

Actual storage form of source data. It can be either all data files in a directory or single data file contained in HDFS.

File Format

Loader supports the following file formats of data stored in HDFS:

  • CSV_FILE: Specifies a text file. When the destination link is a database link, only the text file is supported.

  • BINARY_FILE: Specifies binary files excluding text files.

Line Separator

Identifier of each line end of source data

Note

If HDFS serves as a source link and File Format is set to BINARY_FILE, the value of Line Separator in the advanced properties is invalid.

Field Separator

Identifier of each field end of source data

Note

If HDFS serves as a source link and File Format is set to BINARY_FILE, the value of Field Separator in the advanced properties is invalid.

File Split Type

The following types are supported:

  • File: The number of files is assigned to a map task by the total number of files. The calculation formula is Total number of files/Extractors.

  • Size: A file size is assigned to a map task by the total file size. The calculation formula is Total file size/Extractors.

hive-connector¶

Table 6 Data source link properties of hive-connector¶

Parameter

Description

Database Name

Name of the Hive database storing the data source. You can query and select it on the interface.

Table

Name of the Hive table storing the data source. You can query and select it on the interface.

  • Prev
  • Next
last updated: 2025-07-09 15:07 UTC - commit: cb943fa3145d5c3e150bb4fa1a987d24c3077fe9
Edit pageReport Documentation Bug
Page Contents
  • Source Link Configurations of Loader Jobs
    • Overview
    • obs-connector
    • generic-jdbc-connector
    • ftp-connector or sftp-connector
    • hbase-connector
    • hdfs-connector
    • hive-connector
© T-Systems International GmbH
  • Contact
  • Data privacy
  • Disclaimer of Liabilities
  • Imprint