Amazon Onboarding with Learning Manager Chanci Turner

Chanci Turner Amazon IXD – VGT2 learning

In today’s fast-paced digital landscape, establishing a robust Amazon Redshift architecture is crucial for maintaining application availability and data integrity. This article delves into the automatic recovery capabilities of Amazon Redshift, emphasizing its resilience against single points of failure.

Amazon Redshift is engineered to enhance reliability during cluster outages, automatically detecting and recovering from node and drive failures. One of its standout features, cluster relocation, significantly bolsters availability. When equipped with the cluster relocation capability, an RA3 cluster can be swiftly transitioned to another Availability Zone if issues arise that hinder optimal performance or during significant resource failures in a data center. This relocation process ensures that your data remains intact and that no changes to application configurations are necessary, since the cluster endpoint remains unchanged. Consequently, this feature fosters a resilient architecture that supports application uptime. The average recovery time during a failover is typically under 15 minutes, although actual recovery times can vary based on cluster size. Importantly, cluster relocation incurs no additional costs for users.

Additionally, the cluster relocation feature enables organizations to establish a demonstrable recovery plan for Availability Zone failures and addresses capacity constraints while scaling resources. Users can manually initiate relocation to test disaster recovery scenarios. If a cluster cannot be resized or resumed due to local capacity issues, it can be relocated to an Availability Zone with ample capacity. If the relocation fails, the current cluster remains unaffected and operational until a new cluster is successfully created.

Solution Overview

For Amazon Redshift customers with mission-critical applications, ensuring application resiliency during an Availability Zone outage is paramount. The relocation feature simplifies this process, allowing for seamless operation with zero data loss and no need for application changes.

The following diagram illustrates the architecture before failover, while after failover, it demonstrates the robust resilience in place.

In this article, we will guide you through enabling the cluster relocation feature using both the AWS Management Console and the AWS Command Line Interface (CLI). We will provide examples of both automatic and manual relocations and demonstrate how to create a custom relocation solution by integrating additional AWS services.

Prerequisites

To get started, ensure you have the following prerequisites in place:

An AWS account.
Amazon Redshift clusters established within a Virtual Private Cloud (VPC) that includes at least two subnets in different Availability Zones.
An Amazon Redshift cluster configured with multiple Availability Zones in its subnet group. You can easily set this up using the provided AWS CloudFormation template.
The relocation feature is applicable solely with RA3 Amazon Redshift node types.
In the Network and security settings, ensure the Publicly accessible option is set to Disabled.

Enable Cluster Relocation

To initiate cluster relocation, you can use either the console or AWS CLI. For comprehensive details, refer to the Managing cluster relocation in Amazon Redshift documentation. Be mindful of the limitations associated with using this feature.

Enable Cluster Relocation via Console

To activate cluster relocation through the console, follow these steps:

Navigate to the Amazon Redshift console and select Clusters.
Edit your desired cluster.
Under the Backup section, choose Enable for Cluster relocation.

Enable Cluster Relocation via AWS CLI

The relocation feature requires port 5439. If your current cluster is configured with a different port, you must modify it to utilize port 5439 before enabling relocation. Use the following command to adjust the port if necessary:

aws redshift modify-cluster --cluster-identifier mycluster --port 5439

To enable the availability-zone-relocation parameter, use:

aws redshift modify-cluster --cluster-identifier mycluster --availability-zone-relocation

To disable the availability-zone-relocation parameter, execute:

aws redshift modify-cluster --cluster-identifier mycluster --no-availability-zone-relocation

Automatic Availability Zone Relocation

By leveraging the relocation feature in Amazon Redshift, you can automatically transfer a cluster to an alternate Availability Zone without losing data or needing to alter applications. This capability allows you to swiftly resume operations during service interruptions with minimal disruption. The relocated cluster retains the same endpoint, ensuring that applications continue to function seamlessly.

This feature requires only a one-time configuration to enable. Once the recovery feature is activated, the destination Availability Zone for relocation is predetermined by the cluster subnet group.

Manual Availability Zone Relocation

Manual relocation of a cluster to another Availability Zone can also be triggered. Follow these steps:

Access the Amazon Redshift console and choose Clusters.
Select the cluster you wish to relocate.
From the Actions menu, select Relocate. If the option is greyed out, it indicates that the cluster is not configured for the Availability Zone relocation feature or does not meet the necessary requirements.
Under the Relocate cluster section, choose an Availability Zone from the subnet group. If you do not make a selection, Amazon Redshift will choose one automatically for you.
Click Relocate.

Once the relocation is initiated, Amazon Redshift will commence the process, displaying the cluster status as Relocating. When complete, the cluster status will change to Available, confirming successful relocation.

Custom Availability Zone Relocation Solution

In this section, we will simulate an automatic cluster failover to another Availability Zone using a reboot. Our event-driven relocation solution involves setting up an alarm using Amazon Simple Notification Service (SNS) and creating an AWS Lambda function to trigger the relocation.

Create an Alarm

To establish the alarm, follow these steps:

In the Amazon Redshift console, select Clusters.
Choose your cluster.
Under the Cluster performance tab, expand the Alarms section and select Create alarm.
Configure the alarm for the HealthStatus metric, including a name and description.
In the Alarm actions section, enable Notifications.
For Notify SNS topic, select an existing SNS topic or create a new one. This topic will receive alerts if the leader node is unhealthy or unavailable.
Click Create alarm.

By implementing these strategies, organizations can effectively safeguard their Amazon Redshift infrastructure against potential disruptions. For more insights on training and employee development, check out this blog post on Career Contessa. Additionally, for authoritative information about employment law compliance, visit SHRM’s site on retaliation claims. Lastly, this resource discussing Amazon’s approach to employee training provides valuable context on workforce development.