Understanding fail over and fail back in disaster recovery

Handinata Tanudjaja 230 Reputation points
2025-04-29T21:31:11.1566667+00:00

Hi everyone,

I would like to make sure I have a proper understanding of disaster recovery process.
The following are my understanding:

  1. When Microsoft declares a disaster in the primary region, the fail over from primary to the secondary region is initiated.
    This will make the secondary becomes the new primary and the original primary becomes new secondary.
    Fail over cannot be initiated before Microsoft declares the region as disaster unless Cross Region Restore is enabled.
  2. It's possible to initiate a fail back from the new primary to the new secondary, which will make the new secondary becomes primary again.
    However, it's important to check Last Sync Time value before failing back in order to avoid major data loss.

Am I capturing the high level points of fail over and fail back in disaster recovery?
Thank you

Azure Site Recovery
Azure Site Recovery
An Azure native disaster recovery service. Previously known as Microsoft Azure Hyper-V Recovery Manager.
793 questions
{count} votes

1 answer

Sort by: Most helpful
  1. Ashok Gandhi Kotnana 6,355 Reputation points Microsoft External Staff
    2025-04-30T03:25:13.8866667+00:00

    Hi @Handinata Tanudjaja

    Thanks for the response back

    Services like Azure Storage (with GRS) and Azure SQL (with Auto-failover groups) support geo-replication to a paired region. However, Microsoft controls the failover process and will only promote the secondary region to primary if the source region is officially declared a disaster. Until then, manual failover is not possible for these platform-managed services.

     For Virtual Machines (VMs) protected with Azure Site Recovery (ASR), failover can be initiated at any time — without needing Microsoft to declare a disaster.

     You have full control over the failover process, which includes:

     Test Failover: Simulates failover to validate your recovery plans.

      Planned Failover: Used for maintenance or pre-scheduled downtime.

    Unplanned Failover: Triggered during unexpected outages.

    The replication policy plays a key role in defining the Recovery Point Objective (RPO) and governs the types of recovery points:

    1.Crash-Consistent Recovery Points: Captures the on-disk state at the time of the snapshot — like pulling the power plug on a server. These do not include data in memory or in-flight transactions.

     2.Application-Consistent Recovery Points: Include all the data in a crash-consistent snapshot, plus in-memory data, in-flight transactions, and application state. These are created using VSS snapshots (Windows) or custom scripts (Linux).

     3.By default, ASR creates crash-consistent snapshots every 5 minutes. App-consistent snapshots are created less frequently, based on OS and workload capability.

     Reprotection and Failback:

     1.Commit Failover After validating that the failovered VM is functioning correctly, commit the failover. This designates the VM as the new primary, and the original primary is deactivated.

     2.Reprotection The replication direction is reversed, and the failovered VM begins replicating back to the original region using the same replication policy.

     3.Failback Once the original region is operational, you can initiate failback. The Recovery Services Vault automatically synchronizes all delta changes between the fail overed VM and the original source VM.

     Note: Azure Site Recovery uses disk-level replication, ensuring disk consistency throughout reprotection and failback.

     Please refer to the following link for detailed information on failover, reprotection, and failback processes in Azure Site Recovery.

    Reference:
    https://learn.microsoft.com/en-us/azure/site-recovery/azure-to-azure-tutorial-failback

    Hope I have answered your questions!


    If you found this informative, please consider accepting an answer and upvote it as a token of appreciation. And don't forget to give it a thumbs up 👍 if it was helpful.


Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.