Announcing backups for GCP-hosted hubs!

2i2c are pleased to announce the development and deployment of automated backups of home directories on GCP-hosted hubs!

We have developed the gcp-filestore-backups project that regularly creates backups of JupyterHub home directories for disaster recovery purposes. The project is a Python wrapper around the gcloud tool to regularly request backups be made of the Filestore hosting JupyterHub’s user home directories, by default on a daily basis. The script also manages retention of these backups by checking how recently the last backup was made, and the age of existing backups, by default deleting any backup older than 5 days.

Having these backups enabled means that, in the unlikely and unfortunate case of data loss or corruption, we can reinstate the home directories of the hub to a relatively recent state that is at a maximum of 1 day prior to the incident.

We have deployed gcp-filestore-backups to all our GCP hubs presently running, with a retention period of 2 days. If you would like to discuss this further with us, please get in touch!

As ever, this project has been developed openly in line with our Right to Replicate so you can deploy it against your own infrastructure!

Sarah Gibson
Sarah Gibson
Open Source Infrastructure Engineer