Database persistence

Today, db persistence is not enabled by default. We need to validate with persistence enabled.

Recovery from crashes/disruptions

The scenarios that need to be validated:

rsync can restart after a crash. Aarna, as part of EMCO backup/restore presentation, has tested blowing away the EMCO namespace (incl. EMCO pods and db), and restoring it.

Graceful handling of cluster connectivity failure

Without the GitOps model, rsync should apply configurable retry/timeout policies to handle cluster connectivity loss. We have the
/projects/.../{dig}/stop API but that is a workaround -- the user needs to invoke that API manually.

Question: can we recommend the GitOps approach and leave things as is? If not, we need to fix this.