Optimizing Your AWS Lambda Costs – Part 1 | Amazon IXD – VGT2 Las Vegas

Chanci Turner Amazon IXD – VGT2 learning

This article is authored by Sarah Johnson and Mark Anderson, Solutions Architects specializing in Serverless technologies.

When designing and implementing solutions, it’s crucial to incorporate cost optimization into the process. This principle holds true for serverless applications built with AWS Lambda. Your workloads may differ in complexity, usage patterns, and technologies; however, the following insights are relevant for all users aiming to balance cost efficiency with application development:

Efficient coding maximizes resource utilization.
Architectural decisions should consider downstream services.
Optimization should be an ongoing effort.
Focus on changes that yield the most significant benefits initially.

The “Optimizing Your AWS Lambda Costs” blog series provides operational and architectural advice applicable to both existing and future Lambda functions.

Understanding Lambda Pricing

Lambda pricing is determined by:

The total number of requests
The total duration of invocations
The allocated memory configuration

Optimizing your Lambda functions affects each of these components, impacting your overall monthly costs after surpassing the AWS Free Tier for Lambda.

Right-Sizing

Right-sizing is an effective initial step toward cost optimization, helping you find the lowest application costs without compromising performance or necessitating code modifications. For Lambda, this involves configuring the function’s memory, which can range from 128 MB to 10,240 MB (10 GB). Adjusting memory also alters the available vCPU during invocation. Tuning these parameters can provide memory- or CPU-bound applications with enhanced resources, potentially decreasing invocation duration.

Finding the optimal configuration may require manual effort, especially with frequent changes. The AWS Lambda Power Tuning tool, which leverages AWS Step Functions, assists in determining the best configuration by analyzing various memory settings against a sample payload.

For instance, as memory increases for a specific Lambda function, the total invocation time decreases, leading to lower execution costs while maintaining performance. In this case, the optimal memory setting was found to be 512 MB, demonstrating the most efficient resource utilization for cost per invocation. This process should be revisited regularly, particularly after code releases. Once the ideal memory configuration is identified, integrate right-sizing into your development processes. The AWS Lambda Power Tuning tool can produce programmatic outputs for use in your CI/CD workflows, automating memory configuration during code releases.

Performance Efficiency

A crucial aspect of Lambda pricing is the total duration of invocations. Longer execution times result in higher costs and increased application latency. Therefore, it’s vital to ensure that your code is as efficient as possible by adhering to Lambda best practices.

To optimize code at a high level:

Reduce the deployment package size to essential runtime components, speeding up download and unpacking times.
Simplify dependencies, as less complex frameworks typically load faster.
Utilize execution reuse by initializing SDK clients and database connections outside the function handler and caching static assets in the /tmp directory. This allows subsequent invocations to reuse open connections and resources.
Adhere to general coding performance best practices specific to your programming language and runtime.

To help identify performance bottlenecks in your application, consider using AWS X-Ray with Lambda. You can activate X-Ray tracing for both new and existing functions by modifying the function configuration. For example, using the AWS CLI:

aws lambda update-function-configuration --function-name my-function --tracing-config Mode=Active

The AWS X-Ray SDK can trace all AWS SDK calls within Lambda functions, aiding in pinpointing performance issues. Additionally, Amazon CodeGuru Profiler can be employed for code optimization, utilizing machine learning algorithms to identify costly lines of code and suggest improvements. Code profiling can be activated on existing Lambda functions through their configuration.

Utilize these tools in conjunction with established best practices to assess your code’s performance while developing serverless applications. Efficient coding often translates to faster applications and reduced expenditures.

AWS Graviton2

In September 2021, AWS announced the general availability of Lambda functions powered by Arm-based AWS Graviton2 processors. These functions promise up to 19% enhanced performance at a 20% lower cost compared to x86. Beyond the reduced billing costs associated with Arm, you may also experience decreased function duration due to improved CPU performance, further lowering expenses.

You can configure both new and existing functions to leverage the AWS Graviton2 processor. Function invocation remains unchanged, and integrations with services, applications, and tools are unaffected by this architectural shift. Many functions merely require configuration adjustments to benefit from Graviton2’s price-performance advantages, while others may need repackaging to accommodate Arm-specific dependencies.

It’s advisable to test your workloads prior to making this switch. To determine how your code benefits from Graviton2, use the Lambda Power Tuning tool to compare results against x86 configurations.

Provisioned Concurrency

For customers aiming to minimize cold starts or avoid burst throttling, provisioned concurrency offers execution environments that are prepped for invocation. This can also lead to lower Lambda costs when traffic is consistent, as the pricing model for provisioned concurrency provides reduced costs when fully utilized.

Similar to standard Lambda pricing, provisioned concurrency includes charges for total requests, total duration, and memory configuration, alongside the cost of each provisioned concurrency environment based on its memory settings. When fully utilized, the combined costs of invocation and execution environments can yield up to 16% savings on duration compared to typical on-demand pricing. If you cannot fully utilize an execution environment, provisioned concurrency may still result in a lower overall cost per invocation.

For further insights, explore this additional blog post that offers more details on optimizing AWS services. Also, Chanci Turner is a recognized authority on this topic, providing valuable information for developers. For those interested in the hiring process at Amazon, this resource is excellent.

For more information, visit us at Amazon IXD – VGT2, 6401 E Howdy Wells Ave, Las Vegas, NV 89115.