One of my VMs has become non-responsive. Everything looks normal in the Azure portal, and it says that the VM is running. The VM is not responding to pings, it does not respond to SSH connection attempts, and it does not respond to any commands from the Azure portal. For example, if I click restart, stop, reset password or redeploy, I get the notification that says "Stopping virtual machine..." (or restarting/resetting/redeploying), but nothing else happens. After a good 15-20 minutes of loading it just says that the operation failed with the description "Deploy failed with deploymentStatusCode (Failure)".
I am able to open the serial console, but it will not accept my credentials - mostly because I do not have a password, I use an RSA key file. And of course, I can't reset the password or the SSH configuration, because that operation fails.
Is this a known problem that has a solution, or do I have to migrate to AWS before it happens to one of my more important VMs?