System Container Exits Unexpectedly¶
Symptom¶
After a training job is created, the system container exits unexpectedly.
Possible Causes¶
The possible causes are as follows:
An error occurred in OBS.
Unavailable file: The specified key does not exist.
Insufficient OBS permissions
OBS traffic limiting
Others
The disk space is insufficient.
Solution¶
For an OBS error:
Unavailable file: The specified key does not exist.
For details, see Error Message "errorMessage:The specified key does not exist" Displayed in Logs.
Insufficient OBS permissions
For details, see What Should I Do If Error "stat:403 reason:Forbidden" Is Displayed in Logs When a Training Job Accesses OBS.
OBS traffic limiting
For details, see Error Message "BrokenPipeError: Broken pipe" Displayed When OBS Data Is Copied.
Others
Alternatively, collect the request ID and contact OBS customer service.
For insufficient disk space:
For details, see Common Issues Related to Insufficient Disk Space and Solutions.