Редактиране

Споделяне чрез


Back up and recover Oracle Database on an Azure Linux VM by using Azure Backup

Applies to: ✔️ Linux VMs

This article demonstrates the use of Azure Backup to take disk snapshots of virtual machine (VM) disks, which include the Oracle Database files and the Oracle fast recovery area. By using Azure Backup, you can take full disk snapshots that are suitable as backups and are stored in a Recovery Services vault.

Azure Backup also provides application-consistent backups, which ensure that more fixes aren't required to restore the data. Application-consistent backups work with both file system and Oracle Automatic Storage Management (ASM) databases.

Restoring application-consistent data reduces restoration time, so you can quickly return to a running state. Oracle Database recovery is still necessary after restore. You facilitate the recovery by using Oracle archived redo log files that are captured and stored in a separate Azure file share.

This article walks you through the following tasks:

  • Back up the database with application-consistent backup.
  • Restore and recover the database from a recovery point.
  • Restore the VM from a recovery point.

Prerequisites

  • To perform the backup and recovery process, you must first create a Linux VM that has an installed instance of Oracle Database 12.1 or later.

  • Create an Oracle Database instance by following the steps in Create an Oracle Database instance in an Azure VM.

Prepare the environment

To prepare the environment, complete these steps:

  1. Connect to the VM.
  2. Set up Azure Files storage.
  3. Prepare the databases.

Connect to the VM

  1. To create a Secure Shell (SSH) session with the VM, use the following command. Replace <publicIpAddress> with the public address value for your VM.

    ssh azureuser@<publicIpAddress>
    
  2. Switch to the root user:

    sudo su -
    
  3. Add the oracle user to the /etc/sudoers file:

    echo "oracle   ALL=(ALL)      NOPASSWD: ALL" >> /etc/sudoers
    

Set up Azure Files storage for the Oracle archived redo log files

The Oracle Database instance's archived redo log files play a crucial role in database recovery. They store the committed transactions needed to roll forward from a database snapshot taken in the past.

When the database is in ARCHIVELOG mode, it archives the contents of online redo log files when they become full and switch. Together with a backup, they're required to achieve point-in-time recovery when the database is lost.

Oracle provides the capability to archive redo log files to different locations. The industry best practice is that at least one of those destinations should be on remote storage, so it's separate from the host storage and protected with independent snapshots. Azure Files meets those requirements.

An Azure file share is storage that you an attach to a Linux or Windows VM as a regular file-system component, by using the Server Message Block (SMB) or Network File System (NFS) protocol. To set up an Azure file share on Linux (by using the SMB 3.0 protocol) for use as archive log storage, see Mount an SMB Azure file share on Linux. When you complete the setup, return to this guide and complete all remaining steps.

Prepare the databases

This part of the process assumes that you followed Create an Oracle Database instance in an Azure VM. As a result:

  • You have an Oracle instance named oratest1 that's running on a VM named vmoracle19c.
  • You're using the standard Oracle oraenv script with its dependency on the standard Oracle configuration file /etc/oratab to set up environment variables in a shell session.

Perform the following steps for each database on the VM:

  1. Switch to the oracle user:

     sudo su - oracle
    
  2. Set the environment variable ORACLE_SID by running the oraenv script. It prompts you to enter the ORACLE_SID name.

    . oraenv
    
  3. Add the Azure file share as another destination for database archive log files.

    This step assumes that you configured and mounted an Azure file share on the Linux VM. For each database installed on the VM, make a subdirectory named after your database security identifier (SID).

    In this example, the mount point name is /backup and the SID is oratest1. So you create the subdirectory /backup/oratest1 and change ownership to the oracle user. Substitute /backup/SID for your mount point name and database SID.

    sudo mkdir /backup/oratest1
    sudo chown oracle:oinstall /backup/oratest1
    
  4. Connect to the database:

    sqlplus / as sysdba
    
  5. Start the database if it's not already running:

    SQL> startup
    
  6. Set the first archive log destination of the database to the file-share directory that you created earlier:

    SQL> alter system set log_archive_dest_1='LOCATION=/backup/oratest1' scope=both;
    
  7. Define the recovery point objective (RPO) for the database.

    To achieve a consistent RPO, consider the frequency at which the online redo log files get archived. These factors control the frequency:

    • The size of the online redo log files. As an online log file becomes full, it gets switched and archived. The larger the online log file, the longer it takes to fill up. The added time decreases the frequency of archive generation.
    • The setting of the ARCHIVE_LAG_TARGET parameter controls the maximum number of seconds permitted before the current online log file must be switched and archived.

    To minimize the frequency of switching and archiving, along with the accompanying checkpoint operation, Oracle online redo log files generally have a large size (for example, 1,024M, 4,096M, or 8,192M). In a busy database environment, logs are still likely to switch and archive every few seconds or minutes. In a less active database, they might go hours or days before the most recent transactions are archived, which would dramatically decrease archival frequency.

    We recommend that you set ARCHIVE_LAG_TARGET to ensure a consistent RPO. A setting of 5 minutes (300 seconds) is a prudent value for ARCHIVE_LAG_TARGET. It ensures that any database recovery operation can recover to within 5 minutes of the time of failure.

    To set ARCHIVE_LAG_TARGET, run this command:

    SQL> alter system set archive_lag_target=300 scope=both;
    

    To better understand how to deploy highly available Oracle Database instances in Azure with zero RPO, see Reference architectures for Oracle Database.

  8. Make sure the database is in archive log mode to enable online backups.

    Check the log archive status first:

    SQL> SELECT log_mode FROM v$database;
    
    LOG_MODE
    ------------
    NOARCHIVELOG
    

    If it's in NOARCHIVELOG mode, run the following commands:

    SQL> SHUTDOWN IMMEDIATE;
    SQL> STARTUP MOUNT;
    SQL> ALTER DATABASE ARCHIVELOG;
    SQL> ALTER DATABASE OPEN;
    SQL> ALTER SYSTEM SWITCH LOGFILE;
    
  9. Create a table to test the backup and restore operations:

    SQL> create user scott identified by tiger quota 100M on users;
    SQL> grant create session, create table to scott;
    SQL> connect scott/tiger
    SQL> create table scott_table(col1 number, col2 varchar2(50));
    SQL> insert into scott_table VALUES(1,'Line 1');
    SQL> commit;
    SQL> quit
    

Back up your data by using Azure Backup

The Azure Backup service provides solutions to back up your data and recover it from the Microsoft Azure cloud. Azure Backup provides independent and isolated backups to guard against accidental destruction of original data. Backups are stored in a Recovery Services vault with built-in management of recovery points, so you can restore as needed.

In this section, you use Azure Backup to take application-consistent snapshots of your running VM and Oracle Database instances. The databases are placed into backup mode, which allows a transactionally consistent online backup to occur while Azure Backup takes a snapshot of the VM disks. The snapshot is a full copy of the storage and not an incremental or copy-on-write snapshot. It's an effective medium to restore your database from.

The advantage of using Azure Backup application-consistent snapshots is that they're fast to take, no matter how large your database is. You can use a snapshot for restore operations as soon as you take it, without having to wait for it to be transferred to the Recovery Services vault.

To use Azure Backup to back up the database, complete these steps:

  1. Understand the Azure Backup framework.
  2. Prepare the environment for an application-consistent backup.
  3. Set up application-consistent backups.
  4. Trigger an application-consistent backup of the VM.

Understand the Azure Backup framework

The Azure Backup service provides a framework to achieve application consistency during backups of Windows and Linux VMs for various applications. This framework involves invoking a prescript to quiesce the applications before taking a snapshot of disks. It calls a postscript to unfreeze the applications after the snapshot is completed.

Microsoft has enhanced the framework so that the Azure Backup service provides packaged prescripts and postscripts for selected applications. These prescripts and postscripts are already loaded on the Linux image, so there's nothing for you to install. You just name the application, and then Azure Backup automatically invokes the relevant scripts. Microsoft manages the packaged prescripts and postscripts, so you can be assured of the support, ownership, and validity of them.

Currently, the supported applications for the enhanced framework are Oracle 12.x or later and MySQL. For details, see Support matrix for managed Azure VM backups.

You can author your own scripts for Azure Backup to use with pre-12.x databases. Example scripts are available on GitHub.

Each time you do a backup, the enhanced framework runs the prescripts and postscripts on all Oracle Database instances installed on the VM. The configuration_path parameter in the workload.conf file points to the location of the Oracle /etc/oratab file (or a user-defined file that follows the oratab syntax). For details, see Set up application-consistent backups.

Azure Backup runs the prescripts and postscripts for each database listed in the file that configuration_path points to. Exceptions are lines that begin with # (treated as comment) or +ASM (an Oracle ASM instance).

The Azure Backup enhanced framework takes online backups of Oracle Database instances that operate in ARCHIVELOG mode. The prescripts and postscripts use the ALTER DATABASE BEGIN and END BACKUP commands to achieve application consistency.

For the database backup to be consistent, databases in NOARCHIVELOG mode must be shut down cleanly before the snapshot starts.

Prepare the environment for an application-consistent backup

Oracle Database employs job role separation to provide separation of duties by using least privilege. It associates separate operating system (OS) groups with separate database administrative roles. Users can then have different database privileges granted to them, depending on their membership in OS groups.

The SYSBACKUP database role (generic name OSBACKUPDBA) provides limited privileges to perform backup operations in the database. Azure Backup requires it.

During Oracle installation, we recommend that you use backupdba as the OS group name to associate with the SYSBACKUP role. But you can use any name, so you need to determine the name of the OS group that represents the Oracle SYSBACKUP role first.

  1. Switch to the oracle user:

    sudo su - oracle
    
  2. Set the Oracle environment:

    export ORACLE_SID=oratest1
    export ORAENV_ASK=NO
    . oraenv
    
  3. Determine the name of the OS group that represents the Oracle SYSBACKUP role:

    grep "define SS_BKP" $ORACLE_HOME/rdbms/lib/config.c
    

    The output looks similar to the following example:

    #define SS_BKP_GRP "backupdba"
    

    In the output, the value enclosed within double quotation marks is the name of the Linux OS group to which the Oracle SYSBACKUP role is externally authenticated. In this example, it's backupdba. Note down the actual value.

  4. Verify that the OS group exists by running the following command. Substitute <group name> with the value that the previous command returned (without the quotation marks).

    grep <group name> /etc/group
    

    The output looks similar to the following example:

    backupdba:x:54324:oracle
    

    Important

    If the output doesn't match the Oracle OS group value that you retrieved in step 3, use the following command to create the OS group that represents the Oracle SYSBACKUP role. Substitute <group name> with the group name that you retrieved in step 3.

    sudo groupadd <group name>
    
  5. Create a new backup user named azbackup that belongs to the OS group that you verified or created in the previous steps. Substitute <group name> with the name of the verified group. The user is also added to the oinstall group to enable it to open ASM disks.

    sudo useradd -g <group name> -G oinstall azbackup
    
  6. Set up external authentication for the new backup user.

    The backup user azbackup needs to be able to access the database by using external authentication, so it isn't challenged by a password. To enable this access, you must create a database user that authenticates externally through azbackup. The database uses a prefix for the user name, which you need to find.

    Perform the following steps for each database installed on the VM:

    1. Sign in to the database by using SQL Plus, and check the default settings for external authentication:

      sqlplus / as sysdba
      SQL> show parameter os_authent_prefix
      SQL> show parameter remote_os_authent
      

      The output should look like this example, which shows ops$ as the database username prefix:

      NAME                                 TYPE        VALUE
      ------------------------------------ ----------- ------------------------------
      os_authent_prefix                    string      ops$
      remote_os_authent                    boolean     FALSE
      
    2. Create a database user named ops$azbackup for external authentication to the azbackup user, and grant SYSBACKUP privileges:

      SQL> CREATE USER ops$azbackup IDENTIFIED EXTERNALLY;
      SQL> GRANT CREATE SESSION, ALTER SESSION, SYSBACKUP TO ops$azbackup;
      
  7. If you receive the error ORA-46953: The password file is not in the 12.2 format when you run the GRANT statement, follow these steps to migrate the orapwd file to 12.2 format. Perform these steps for every Oracle Database instance on the VM.

    1. Exit SQL Plus.

    2. Move the password file with the old format to a new name.

    3. Migrate the password file.

    4. Remove the old file.

    5. Run the following commands:

      mv $ORACLE_HOME/dbs/orapworatest1 $ORACLE_HOME/dbs/orapworatest1.tmp
      orapwd file=$ORACLE_HOME/dbs/orapworatest1 input_file=$ORACLE_HOME/dbs/orapworatest1.tmp
      rm $ORACLE_HOME/dbs/orapworatest1.tmp
      
    6. Rerun the GRANT operation in SQL Plus.

  8. Create a stored procedure to log backup messages to the database alert log. Use the following code for each database installed on the VM:

    sqlplus / as sysdba
    SQL> GRANT EXECUTE ON DBMS_SYSTEM TO SYSBACKUP;
    SQL> CREATE PROCEDURE sysbackup.azmessage(in_msg IN VARCHAR2)
    AS
      v_timestamp     VARCHAR2(32);
    BEGIN
      SELECT TO_CHAR(SYSDATE, 'YYYY-MM-DD HH24:MI:SS')
      INTO v_timestamp FROM DUAL;
      DBMS_OUTPUT.PUT_LINE(v_timestamp || ' - ' || in_msg);
      SYS.DBMS_SYSTEM.KSDWRT(SYS.DBMS_SYSTEM.ALERT_FILE, in_msg);
    END azmessage;
    /
    SQL> SHOW ERRORS
    SQL> QUIT
    

Set up application-consistent backups

  1. Switch to the root user:

    sudo su -
    
  2. Check for the /etc/azure folder. If it isn't present, create the working directory for the application-consistent backup:

    if [ ! -d "/etc/azure" ]; then
       mkdir /etc/azure
    fi
    
  3. Check for the workload.conf file within the folder. If it isn't present, create it in the /etc/azure directory and give it the following contents. The comments must begin with [workload]. If the file is already present, just edit the fields so that they match the following contents. Otherwise, the following command creates the file and populates the contents:

    echo "[workload]
    workload_name = oracle
    configuration_path = /etc/oratab
    timeout = 90
    linux_user = azbackup" > /etc/azure/workload.conf
    

    The workload.conf file uses the following format:

    • The workload_name parameter indicates the database workload type. In this case, setting the parameter to Oracle allows Azure Backup to run the correct prescripts and postscripts (consistency commands) for Oracle Database instances.
    • The timeout parameter indicates the maximum time, in seconds, that each database must complete storage snapshots.
    • The linux_user parameter indicates the Linux user account that Azure Backup uses to run database quiesce operations. You created this user, azbackup, previously.
    • The configuration_path parameter indicates the absolute path name for a text file on the VM. Each line lists a database instance running on the VM. This is typically the /etc/oratab file that Oracle generates during database installation, but it can be any file with any name that you choose. It must follow these format rules:
      • The file is a text file. Each field is delimited with the colon character (:).
      • The first field in each line is the name for an ORACLE_SID instance.
      • The second field in each line is the absolute path name for ORACLE_HOME for that ORACLE_SID instance.
      • All text after the first two fields is ignored.
      • If the line starts with a pound sign (#), the entire line is ignored as a comment.
      • If the first field has the value +ASM, denoting an Oracle ASM instance, it's ignored.

Trigger an application-consistent backup of the VM

  1. In the Azure portal, go to your rg-oracle resource group and select your vmoracle19c virtual machine.

  2. On the Backup pane:

    1. Under Recovery Services vault, select Create new.
    2. For the name of the vault, use myVault.
    3. For Resource group, select rg-oracle.
    4. For Choose backup policy, use (new) DailyPolicy. If you want to change the backup frequency or retention range, select Create a new policy instead.

    Screenshot that shows the pane for configuring a new Recovery Services vault.

  3. Select Enable Backup.

    The backup process doesn't start until the scheduled time expires. To set up an immediate backup, complete the next step.

  4. From the resource group pane, select your newly created Recovery Services vault named myVault. You might need to refresh the page to see it.

  5. On the myVault - Backup items pane, under BACKUP ITEM COUNT, select the backup item count.

    Screenshot that shows details for a Recovery Services vault.

  6. On the Backup Items (Azure Virtual Machine) pane, select the ellipsis (...) button, and then select Backup now.

    Screenshot that shows the command for backing up Recovery Services vaults now.

  7. Accept the default Retain Backup Till value, and then select OK. Wait for the backup process to finish.

  8. To view the status of the backup job, select Backup Jobs.

    Screenshot that shows the pane that lists backup jobs for a Recovery Services vault.

    Select the backup job to see details about its status.

    Screenshot that shows detailed status information for a backup job.

    Although it takes seconds to execute the snapshot, it can take more time to transfer it to the vault. The backup job isn't completed until the transfer is finished.

  9. For an application-consistent backup, address any errors in the log file at /var/log/azure/Microsoft.Azure.RecoveryServices.VMSnapshotLinux/extension.log.

Restore the VM

Restoring an entire VM means that you restore the VM and its attached disks to a new VM from a selected restore point. This action also restores all databases that run on the VM. Afterward, you need to recover each database.

To restore an entire VM, complete these steps:

  1. Stop and delete the VM.
  2. Recover the VM.
  3. Set the public IP address.
  4. Recover the database.

There are two main choices when you're restoring a VM:

  • Restore the VM from which the backups were originally taken.
  • Restore (clone) a new VM without affecting the VM from which the backups were originally taken.

The first steps in this exercise (stopping, deleting, and then recovering the VM) simulate the first use case.

Stop and delete the VM

  1. In the Azure portal, go to the vmoracle19c virtual machine, and then select Stop.

  2. When the virtual machine is no longer running, select Delete and then Yes.

    Screenshot that shows the confirmation message for deleting a virtual machine.

Recover the VM

  1. Create a storage account for staging in the Azure portal:

    1. In the Azure portal, select + Create a resource, and then search for and select Storage account.

      Screenshot that shows where to create a resource.

    2. On the Create storage account pane:

      1. For Resource group, select your existing resource group, rg-oracle.
      2. For Storage account name, enter oracrestore.
      3. Ensure that Location is set to the same region as all your other resources in the resource group.
      4. Set Performance to Standard.
      5. For Account kind, select StorageV2 (general purpose v2).
      6. For Replication, select Locally-redundant storage (LRS).

      Screenshot that shows basic information for creating a storage account.

    3. Select Review + Create, and then select Create.

  2. In the Azure portal, search for the myVault Recovery Services vault and select it.

    Screenshot that shows selecting a Recovery Services vault.

  3. On the Overview pane, select Backup items. Then select Azure Virtual Machine, which should have a nonzero number for BACKUP ITEM COUNT.

    Screenshot that shows selections for a virtual machine backup item for a Recovery Services vault.

  4. On the Backups Items (Azure Virtual Machine) pane, select the vmoracle19c VM.

    Screenshot that shows the pane for virtual machine backup items.

  5. On the vmoracle19c pane, choose a restore point that has a consistency type of Application Consistent. Select the ellipsis (...), and then select Restore VM.

    Screenshot that shows the command for restoring a VM.

  6. On the Restore Virtual Machine pane:

    1. Select Create New.

    2. For Restore Type, select Create new virtual machine.

    3. For Virtual machine name, enter vmoracle19c.

    4. For Virtual network, select vmoracle19cVNET.

      The subnet is automatically populated based on your selection for the virtual network.

    5. For Staging Location, the process of restoring a VM requires an Azure storage account in the same resource group and region. You can choose a storage account or a restore task that you set up earlier.

    Screenshot that shows values for restoring a virtual machine.

  7. To restore the VM, select the Restore button.

  8. To view the status of the restore process, select Jobs, and then select Backup Jobs.

    Screenshot that shows a list of backup jobs.

    Select the In Progress restore operation to show details about the status of the restore process.

    Screenshot that shows detailed status information about a restore process.

Set the public IP address

After the VM is restored, you should reassign the original IP address to the new VM.

  1. In the Azure portal, go to your virtual machine named vmoracle19c. It gets assigned a new public IP and NIC similar to vmoracle19c-nic-XXXXXXXXXXXX, but it doesn't have a DNS address. When the original VM was deleted, its public IP and NIC were retained. The next steps reattach them to the new VM.

    Screenshot that shows a list of public IP addresses.

  2. Stop the VM.

    Screenshot that shows selections for stopping a virtual machine.

  3. Go to Networking.

    Screenshot that shows networking information.

  4. Select Attach network interface. Select the original NIC vmoracle19cVMNic, which the original public IP address is still associated with. Then select OK.

    Screenshot that shows selecting resource type and NIC values.

  5. Detach the NIC that you created with the VM restore operation, because it's configured as the primary interface. Select Detach network interface, select the NIC that's similar to vmoracle19c-nic-XXXXXXXXXXXX, and then select OK.

    Screenshot that shows the pane for detaching a network interface.

    Your re-created VM now has the original NIC, which is associated with the original IP address and network security group rules.

    Screenshot that shows an IP address value.

  6. Go back to the Overview pane and select Start.

Recover the database

To recover a database after a complete VM restore:

  1. Reconnect to the VM:

    ssh azureuser@<publicIpAddress>
    
    

    When the whole VM gets restored, it's important to recover the databases on the VM by performing the following steps on each database.

  2. You might find that the instance is running, because the autostart attempted to start the database on VM startup. However, the database requires recovery and is likely to solely be at the mount stage. Run a preparatory shutdown before starting the mount stage:

    sudo su - oracle
    sqlplus / as sysdba
    SQL> shutdown immediate
    SQL> startup mount
    
  3. Perform database recovery.

    It's important to specify the USING BACKUP CONTROLFILE syntax to inform the RECOVER AUTOMATIC DATABASE command that recovery shouldn't stop at the Oracle system change number (SCN) recorded in the restored database control file.

    The restored database control file was a snapshot, along with the rest of the database. The SCN stored within it is from the point in time of the snapshot. There might be transactions recorded after this point, and you want to recover to the point of the last transaction committed to the database.

    SQL> recover automatic database using backup controlfile until cancel;
    
  4. When the last available archive log file gets applied, enter CANCEL to end recovery.

    When recovery finishes successfully, the message Media recovery complete appears.

    However, when you're using the BACKUP CONTROLFILE clause, the recover command ignores online log files. It's possible that changes in the current online redo log are required to complete point-in-time recovery. In this situation, you might see messages similar to these examples:

    SQL> recover automatic database until cancel using backup controlfile;
    ORA-00279: change 2172930 generated at 04/08/2021 12:27:06 needed for thread 1
    ORA-00289: suggestion :
    /u02/fast_recovery_area/ORATEST1/archivelog/2021_04_08/o1_mf_1_13_%u_.arc
    ORA-00280: change 2172930 for thread 1 is in sequence #13
    ORA-00278: log file
    '/u02/fast_recovery_area/ORATEST1/archivelog/2021_04_08/o1_mf_1_13_%u_.arc' no
    longer needed for this recovery
    ORA-00308: cannot open archived log
    '/u02/fast_recovery_area/ORATEST1/archivelog/2021_04_08/o1_mf_1_13_%u_.arc'
    ORA-27037: unable to obtain file status
    Linux-x86_64 Error: 2: No such file or directory
    Additional information: 7
    
    Specify log: {<RET>=suggested | filename | AUTO | CANCEL}
    

    Important

    If the current online redo log is lost or corrupted and you can't use it, you might cancel recovery at this point.

    To correct this situation, you can identify which online log wasn't archived and supply the fully qualified file name to the prompt.

  5. Open the database.

    The RESETLOGS option is required when the RECOVER command uses the USING BACKUP CONTROLFILE option. RESETLOGS creates a new incarnation of the database by resetting the history back to the beginning, because there's no way to determine how much of the previous database incarnation was skipped in the recovery.

    SQL> alter database open resetlogs;
    
  6. Check that the database content was recovered:

    SQL> select * from scott.scott_table;
    

The backup and recovery of Oracle Database on an Azure Linux VM are now finished.

You can find more information about Oracle commands and concepts in the Oracle documentation, including:

Delete the VM

When you no longer need the VM, you can use the following commands to remove the resource group, the VM, and all related resources:

  1. Disable soft delete of backups in the vault:

    az backup vault backup-properties set --name myVault --resource-group rg-oracle --soft-delete-feature-state disable
    
  2. Stop protection for the VM and delete backups:

    az backup protection disable --resource-group rg-oracle --vault-name myVault --container-name vmoracle19c --item-name vmoracle19c --delete-backup-data true --yes
    
  3. Remove the resource group, including all resources:

    az group delete --name rg-oracle
    

Next steps

Create highly available VMs

Explore Azure CLI samples for VM deployment