If a backup job fails or one of the tasks within the job fails, you can enable automatic restart of the failure by configuring auto retry in the
entrypoint.sh file. Auto retry can be useful in situations where the failure is due to an intermittent issue, such as a network or service interruption.
Prerequisites
In
PowerProtect Data Manager, some services that are required for auto retry, such as the workflow service, have been moved into a docker container. In order to enable auto retry, ensure that the workflow service is running in a docker.
About this task
Auto retry is only supported for daily, weekly, or monthly schedules for virtual machine and
File System agent protection operations.
Steps
Log in to the
PowerProtect Data Manager server by using SSH.
Copy the
entrypoint.sh file from the workflow container by typing the following:
docker cp workflow:/workflow/bin/entrypoint.sh .
Configure auto retry by adding a line to
entrypoint.sh:
Type
vi entrypoint.sh
Before the last line in the output,
-jar /${APP_NAME}/lib/workflow-manager.jar), add the following:
-Denable.auto.retry.scheduler=true \
NOTE Auto retry is disabled by default. After adding this line, if you want to disable this setting at any point, change the entry to
-Denable.auto.retry.scheduler=false \
Optionally, add the following application properties to the file to specify a maximum number of auto retries and a time interval at which subsequent auto retry attempts will occur:
-Dfailed.job.retry.max.count=2 \
-Dfailed.job.retry.interval=PT30M \
NOTE The values specified above are the recommended default values. Auto retries will only occur during the activity window. If you perform a manual retry in the
PowerProtect Data Manager UI, this retry will not count towards the auto retry max count.
For the interval duration, the value must be specified in ISO-8601 format.
Save the
entrypoint.sh file to the workflow container by typing the following:
docker cp entrypoint.sh workflow:/workflow/bin/
Restart the workflow service by using one of the following methods:
Type
docker container restart workflow
NOTE For the configuration to be applied successfully using this method, you can only restart the container. If you restart your workflow service or your
PowerProtect Data Manager operating system, the configuration will be lost.
Type the following to save the docker image and restart the workflow service. For example:
docker commit workflow dpd/ppdm/ppdmc-workflow:PowerProtect Data Manager version
workflow restart
where
PowerProtect Data Manager version is the
PowerProtect Data Manager version that is deployed on your system.
You can use this method to permanently apply the configuration change after restoring the docker image.
Results
Upon configuration, the workflow service is scheduled to run every 30 minutes to determine if any jobs or tasks have failed. If a restart occurred, the status will indicate
Running or
Queued. To view whether a failed job or task has been restarted, go to the
Jobs window in the
PowerProtect Data Manager UI and select
Running or
Queued.
Data is not available for the Topic
Please provide ratings (1-5 stars).
Please provide ratings (1-5 stars).
Please provide ratings (1-5 stars).
Please select whether the article was helpful or not.
Comments cannot contain these special characters: <>()\