What happens when a local SSD fails in a Google Cloud Platform compute engine instance?

Question

What happens when one of the multiple local SSDs attached to a compute engine instance has a hardware failure? Specifically:

Is the failure automatically detected by Google Cloud Platform?
Is there a notification, such as by email?
How long does it take for the drive to be replaced?
Is the VM stopped, and restarted after the replacement? Or, is it a hot-swap?
Obviously, the data on that SSD is lost, however, what happens to the data on other SSDs attached to the same Virtual Machine?

Edit: I am aware of the "ephemeral" nature of local SSDs, and plan to replicate my data on multiple machines across different zones in my primary region, and at least one replication to a completely different region. The database I am planning to use is "data-center/rack aware". I am particularly looking for documentation/information about how Google Cloud Platform handles hardware failures in local SSDs.

Does this answer your question? Google Cloud - Local SSD hadware failure? — Martin Zeitler
@MartinZeitler Not really. I am aware of the "ephemeral" nature of the local SSDs. I will be having data replication across multiple zones, and possibly even across multiple regions. I am looking for more information about what happens when a local SSD fails. I couldn't find anything on GCP documentation. — user2101712
If the local SSD fails, the instance fails. A new instance will be launched with blank SSDs. All data stored on all SSDs will be lost. You will need to set up Stackdriver monitoring and alerting to be notified. The drive is not replaced on the same VM instance. — John Hanley
@JohnHanley Thanks. Is this behavior documented anywhere by Google? While I do not doubt your knowledge on this topic, a link to an official document will be much appreciated! — user2101712
There is no document that I am aware of. I am a Google GDE and my comment comes from personal experience and knowledge. I would have posted an answer if I had an authoritative reference to link to. — John Hanley

Martin Zeitler Martin Zeitler · Accepted Answer · 2020-01-26T21:46:59

You might want to use persistent disks instead, because your use-case might not apply:

As adding local SSDs reads:

Local SSDs are suitable only for temporary storage such as caches, processing space, or low value data. If you store important data in a local SSD device, you must also store that same data in a durable storage option.

What happens when a local SSD fails in a Google Cloud Platform compute engine instance?

3 Answers