EventStoreDb snapshots and restore

Hello,

We have EventstoreDb v24.10 installed in a cluster of three.

I’ve had a read of the documentation around V24.10 for Backups and restores for here but wanted to confirm some things if I can.

We currently have the DB, Index and Logs file on the one data disk (let’s say E:) on a virtual machine in Azure.
At the moment thinking of configuring a nightly full snapshot the E: drive on the first node (1 of 3).

Am I correct in thinking if we needed to restore from this snapshot I can perform the following:

  1. Stop the node (the EventStoreDb service) on that virtual machine
  2. Shutdown the virtual machine
  3. Detach the E: Drive disk
  4. Create a new disk from the snapshot
  5. Attach the new E: drive creating from the snapshot
  6. Start the virtual machine
  7. Start EventStoreDb pointing it to the config file

My questions:

  1. Am I missing anything above?
  2. Can / do we do the same for the other two nodes in the cluster from the same snapshot?
  3. If so, do I stop all 3 nodes at the same time or am I okay to bring them down one by one and restore the E: Drive on each (if needed)?
  4. Is this still something we need to do when taking Azure snapshots nightly?

Any responses or further information you think is important is greatly appreciated.

Thank you

Hi, great questions!

Personally, I think you can skip this if you are taking regular Azure snapshots, as many of our customers do. But, keep a few things in mind:

  • If only taking nightly snapshots, your worst-case RPO is approximately 24 hrs. This might not be acceptable, so you might consider more regular snapshotting on the Azure side
  • You can do the same for all three nodes, which might increase your RTO, but you can also take a snapshot of another node and repurpose it into a new cluster node with (almost) the same configuration (obviously hostnames and things like that which are node-specific would need to be modified after restoring the snapshot)

I hope this helps!

Hi Tony,

That’s great, thank you.

So I could use the same snapshot of the data disk (where the EventStore DB, Index and Logs are) for all Virtual machines (3 nodes) in theory?

We’ll definitely be looking to make more frequent snapshots

Yes, you should be good to do that

Please make certain you test your recovery process to ensure that it works end-to-end!

1 Like

That’s great. Thank you Tony