1508324ce7
In the database backup framework (_backup_main.sh.tpl), the backup_databases function exits with code 1 if the store_backup_remotely function fails to send the backup to the remote RGW. This causes the pod to fail and be restarted by the cronjob, over and over until the backoff retries limit (6 by default) is reached, so it creates many copies of the same backup on the file system, and the default k8s behavior is to delete the job/pods once the backoff limit has been exceeded, so it then becomes more difficult to troubleshoot (although we may have logs in elasticsearch). This patch changes the return code to 0 so that the pod will not fail in that scenario. The error logs generated should be enough to flag the failure (via Nagios or whatever alerting system is being used). Change-Id: Ie1c3a7aef290bf6de4752798821d96451c1f2fa5 |
||
---|---|---|
.. | ||
db-backup-restore | ||
_create-s3-bucket.sh.tpl | ||
_create-s3-user.sh.tpl | ||
_db-drop.py.tpl | ||
_db-init.py.tpl | ||
_db-pg-init.sh.tpl | ||
_image-repo-sync.sh.tpl | ||
_ks-domain-user.sh.tpl | ||
_ks-endpoints.sh.tpl | ||
_ks-service.sh.tpl | ||
_ks-user.sh.tpl | ||
_rabbit-init.sh.tpl | ||
_rally_test.sh.tpl |