Speed Up AWS Lambda using GraalVM

Introduction

Building Lambdas in AWS is one of the central aspects for building serverless systems in AWS. However, as AWS Lambda removes a lot of problems for developers when building systems, it also introduces a number of new problems developers have never had to deal with before.

The Number One Complaint developers have when building AWS Lambdas with Java are "cold starts".

A cold start occurs the very first time a AWS Lambda is asked to handle a request. Now, depending on the size of your Lambda function, it could take 10 seconds or more just for the Java process to start. For some applications, this may be an acceptable trade-off for the benefits AWS Lambda brings, but for most serverless applications, or for Lambdas that handle a large number of requests, this is unacceptable, and often makes developers abandon AWS Lambda and/or Java and go back to the technologies they are most familiar with.

AWS has tried to address cold starts by introducing features like Provisioned Concurrency. However, this defeats the goal of "serverless" computing because you are no longer just paying for requests, you are also reserving compute capacity, and when you exceed this capacity you will incur the same cold starts you would have had otherwise.

The only way to fix cold starts is to get Java to start faster. Luckily, Oracle has created a new project called GraalVM. GraalVM is a new Java VM that can be used to improve the performance and to reduce the startup time of applications.

In this tutorial, we will be creating a simple Lambda function that writes a file to an S3 bucket. Actually, we will create that Lambda twice, once using Java without GraalVM, and again with GraalVM, to be able to compare performance.

What You Need for this Tutorial

About 30 minutes
A favorite text editor or IDE
AWS CLI
AWS SAM CLI
Docker Desktop
JDK 11 or later
Gradle 4 or later
Access to an AWS account with access to deploy SAM CLI

Note: The full Source Code for this Tutorial is available on GitHub

Step 1: Create a New Project using SAM CLI for a our Java and GraalVM Lambdas

AWS SAM CLI is a command line tool that makes it easy to create and deploy serverless applications. We are going to use the SAM CLI to first create a AWS Lambda function in Java, and then we will convert that Lambda function to use GraalVM. The last step will be to compare the performance of these two Lambda functions.

To create the project, run the following command in a terminal window, under a new directory (e.g., ./graalvm-tutorial):

sam init

Answer the questions as follows:

Which template source would you like to use?
      1 - AWS Quick Start Templates
      2 - Custom Template Location
    Choice: 1

Which runtime would you like to use?
    1 - nodejs12.x
    2 - python3.8
    3 - ruby2.7
    4 - go1.x
    5 - java11
    6 - dotnetcore3.1
    7 - nodejs10.x
    8 - python3.7
    9 - python3.6
    10 - python2.7
    11 - ruby2.5
    12 - java8
    13 - dotnetcore2.1
  Runtime: 5

Which dependency manager would you like to use?
    1 - maven
    2 - gradle
  Dependency manager: 2

Project name [sam-app]: graalvm-s3

Cloning app templates from https://github.com/awslabs/aws-sam-cli-app-templates.git
  AWS quick start application templates:
    1 - Hello World Example: Gradle
    2 - EventBridge Hello World: Gradle
    3 - EventBridge App from scratch (100+ Event Schemas): Gradle
    4 - Step Functions Sample App (Stock Trader): Gradle
  Template selection: 1

This should lead to the following output:

-----------------------
    Generating application:
    -----------------------
    Name: graalvm-s3
    Runtime: java11
    Dependency Manager: gradle
    Application Template: hello-world
    Output Directory: .
    
    Next steps can be found in the README file at ./graalvm-s3/README.md

You have now created a SAM project with a single Lambda function. In your current directory you should see a subdirectory called "graalvm-s3" that will contain the project generated by the SAM CLI.

Step 2: Create the Java Lambda (without GraalVM)

In the ./graalvm-s3 directory, you will see a HelloWorldFunction subdirectory, which is the default Lambda function that was generated from SAM. We are now going to create our first Lambda function using Java, without GraalVM.

The first function will be called S3Java, so our first step is to take over the sample function by renaming the HelloWorldFunction directory to S3Java:

mv HelloWorldFunction S3Java

We also need to update the file template.yaml to use the new Lambda name and to add instructions for both creating the S3 bucket we will be writing our test file into, and for setting the Lambda function permissions to allow read/write for this new bucket:

AWSTemplateFormatVersion: '2010-09-09'
Transform: AWS::Serverless-2016-10-31
Description: >
  graalvm-s3

  Sample SAM Template for graalvm-s3

# More info about Globals: https://github.com/awslabs/serverless-application-model/blob/master/docs/globals.rst
Globals:
  Function:
    Timeout: 20

Resources:
  S3Bucket:
    Type: AWS::S3::Bucket

  S3Java:
    Type: AWS::Serverless::Function
    Properties:
      CodeUri: S3Java
      Handler: helloworld.S3Java::handleRequest
      Runtime: java11
      MemorySize: 512
      Policies:
      - Statement:
        - Sid: S3Access
          Effect: Allow
          Action:
          - s3:GetObject
          - s3:PutObject
          Resource: !Sub 'arn:aws:s3:::${S3Bucket}/*'
      Environment:
        Variables:
          S3Bucket: !Ref S3Bucket       

Outputs:
  S3Bucket:
    Description: "S3Bucket"
    Value: !GetAtt S3Bucket.Arn
  S3Java:
    Description: "S3Java Lambda Function ARN"
    Value: !GetAtt S3Java.Arn

The Lambda function created by SAM comes with a few classes we will not need. You can delete the following files:

S3Java/src/main/java/helloworld/App.java
S3Java/src/main/java/helloworld/GatewayResponse.java
S3Java/src/test/java/helloworld/AppTest.java

We will create the new Lambda function in S3Java/src/main/java/helloworld/S3Java.java:

package helloworld;

import java.text.MessageFormat;
import java.util.UUID;
import com.amazonaws.services.lambda.runtime.Context;
import com.amazonaws.services.lambda.runtime.RequestHandler;
import software.amazon.awssdk.core.sync.RequestBody;
import software.amazon.awssdk.services.s3.S3Client;
import software.amazon.awssdk.services.s3.S3ClientBuilder;
import software.amazon.awssdk.services.s3.model.PutObjectRequest;

/**
  * Handler for requests to Lambda function.
  */
public class S3Java implements RequestHandler<Object, Object> {

  private S3ClientBuilder builder = S3Client.builder();

  @Override
  public Object handleRequest(final Object input, final Context context) {

    String bucket = System.getenv("S3Bucket");
    String key = UUID.randomUUID().toString();

    try (S3Client client = builder.build()) {
      client.putObject(PutObjectRequest.builder().bucket(bucket).key(key).build(),
          RequestBody.fromString("This is a test"));
    }

    String msg = MessageFormat.format("Created S3 File {0} in bucket {1}", key, bucket);
    context.getLogger().log(MessageFormat.format("Created S3 File {0} in bucket {1}", key, bucket));
    return msg;
  }
}

We need to add the official AWS S3 dependency to S3Java/build.gradle for the code to compile:

plugins {
    id 'java'
}

repositories {
    mavenCentral()
}

dependencies {
    implementation 'com.amazonaws:aws-lambda-java-core:1.2.0'
    implementation 'software.amazon.awssdk:s3:2.13.31'
    testImplementation 'junit:junit:4.12'
}

This Lambda function is very simple: it identifies an S3 Bucket by name from the environment, and then it creates a random file in that bucket with the message "This is a test".

Step 3: Build, Deploy, and Run the Java Lambda (without GraalVM)

To build and deploy this Lambda function, run the command in a terminal window, in the same directory where the template.yaml file is located:

sam build

This produces the following output:

Building function 'S3Java'
Running JavaGradleWorkflow:GradleBuild
Running JavaGradleWorkflow:CopyArtifacts

Build Succeeded

Built Artifacts  : .aws-sam/build
Built Template   : .aws-sam/build/template.yaml

Commands you can use next
=========================
[*] Invoke Function: sam local invoke
[*] Deploy: sam deploy --guided

To deploy this Lambda function, run the following command using the same terminal window and directory:

sam deploy --guided

This produces the following output:

Configuring SAM deploy
======================

Looking for samconfig.toml :  Not found

Setting default arguments for 'sam deploy'
=========================================
Stack Name [sam-app]: graalvm-s3
AWS Region [us-east-1]:
#Shows you resources changes to be deployed and require a 'Y' to initiate deploy
Confirm changes before deploy [y/N]: N
#SAM needs permission to be able to create roles to connect to the resources in your template
Allow SAM CLI IAM role creation [Y/n]: Y
Save arguments to samconfig.toml [Y/n]: Y

SAM CLI will create and deploy a CloudFormation Stack to your AWS Account. This will take a few minutes, but at the end you should see this message:

Successfully created/updated stack - graalvm-s3 in us-east-1

You can confirm the CloudFormation Stack was created by visiting the CloudFormation Console or by using the following AWS CLI command:

aws cloudformation describe-stacks --stack-name graalvm-s3 --region us-east-1

{
  "Stacks": [
    {
      ...
      "StackName": "graalvm-s3",
      "StackStatus": "CREATE_COMPLETE",
      "Outputs": [
          {
            "OutputKey": "S3Bucket",
            "OutputValue": "arn:aws:s3:::graalvm-s3-s3bucket-XXXXXXXXXXXXXXXXX",
            "Description": "S3Bucket"
          ,      
          {
            "OutputKey": "S3Java",
            "OutputValue": "arn:aws:lambda:us-east-1:111111111111:function:graalvm-s3-S3Java-XXXXXXXXXXXXXX",
            "Description": "S3Java Lambda Function ARN"
          }
      ],
      ...
    }
  ]
}

A StackStatus "CREATE_COMPLETE" shows that the CloudFormation was successful. In the CloudFormation Outputs, you will see the ARN of the Lambda function we will use to run the Lambda function, as well as the S3 Bucket the files will be written to.

We can run the Lambda function by using AWS CLI with the command:

aws lambda invoke --function-name graalvm-s3-S3Java-XXXXXXXXXXXX outfile --region us-east-1

This will run the Lambda function and write the output of the function to a file called output. Viewing that file, you should see something similar to the following:

Created S3 File 9a887bc5-164f-4acd-8e81-575685e8162f in bucket graalvm-s3-s3bucket-XXXXXXXXXXXXXXXXX

Now the last thing we need to know is how long this Lambda function takes to execute. The easiest way to see this is using the AWS CLI. Using this command will show the CloudWatch logs for the Lambda function:

sam logs --name graalvm-s3-S3Java-XXXXXXXXXXXXXX --region us-east-1

You can also visit the CloudWatch Console. The output should be similar as below (we are looking for the Duration time):

START RequestId: 6ab5a222-1644-4917-b4f9-e59c3e4cfb75 Version: $LATEST
...
Created S3 File 9a887bc5-164f-4acd-8e81-575685e8162f in bucket graalvm-s3-s3bucket-XXXXXXXXXXXXXXXXX
END RequestId: 6ab5a222-1644-4917-b4f9-e59c3e4cfb75
REPORT RequestId: 6ab5a222-1644-4917-b4f9-e59c3e4cfb75  Duration: 10315.08 ms Billed Duration: 10400 ms Memory Size: 512 MB Max Memory Used: 161 MB Init Duration: 496.62 ms

We can see that it took 10400 ms, or 10.4 seconds, to execute this Lambda function. That is not very good, but we now have a time baseline. Once we convert this Lambda to GraalVM, we'll be able to see if anything changes.

Step 4: Create the GraalVM Java Lambda

We are going to create a new Lambda function called S3GraalVM based on the existing S3Java. So, you should copy the S3Java folder and call it S3GraalVM:

cp -r S3Java S3GraalVM

Now, delete the file S3GraalVM/src/main/java/helloworld/S3Java.java and create S3GraalVM/src/main/java/helloworld/S3GraalVM.java using the code below:

package helloworld;

import java.text.MessageFormat;
import java.util.UUID;
import com.amazonaws.services.lambda.runtime.Context;
import com.amazonaws.services.lambda.runtime.RequestHandler;
import software.amazon.awssdk.core.sync.RequestBody;
import software.amazon.awssdk.services.s3.S3Client;
import software.amazon.awssdk.services.s3.S3ClientBuilder;
import software.amazon.awssdk.services.s3.model.PutObjectRequest;

/**
  * Handler for requests to Lambda function.
  */
public class S3GraalVM implements RequestHandler<Object, Object> {

  private S3ClientBuilder builder = S3Client.builder();

  static {
      System.setProperty("software.amazon.awssdk.http.service.impl",
        "software.amazon.awssdk.http.urlconnection.UrlConnectionSdkHttpService");
  }
  
  @Override
  public Object handleRequest(Object input, Context context) {
    String bucket = System.getenv("S3Bucket");
    String key = UUID.randomUUID().toString();

    try (S3Client client = builder.build()) {
      client.putObject(PutObjectRequest.builder().bucket(bucket).key(key).build(),
          RequestBody.fromString("This is a test"));
    }

    String msg = MessageFormat.format("Created S3 File {0} in bucket {1}", key, bucket);
    context.getLogger().log(MessageFormat.format("Created S3 File {0} in bucket {1}", key, bucket));
    return msg;
  }
}

From a code perspective, the only change is setting the System Property software.amazon.awssdk.http.service.impl. The AWS SDK by default uses Apache & Netty for its HTTP service calls. This adds a ton of extra classes, and for AWS Lambda, this means slower startup times. Luckily, as of AWS SDK 2.0, we can change the SDK to use Java's built-in URLConnection class instead.

We then need to update our build.gradle to add url-connection-client, and exclude the apache-client and netty-nio-client, so they are not included in our final build:

dependencies {
...    
  implementation 'software.amazon.awssdk:url-connection-client:2.13.31'
  configurations.all {
    exclude group: 'software.amazon.awssdk', module: 'apache-client'
    exclude group: 'software.amazon.awssdk', module: 'netty-nio-client'
  }
...
}

Using a "FAT" Jar File

GraalVM needs to be run against a "FAT" jar file, i.e., a single jar file that contains all code and dependencies. We will use the com.github.johnrengelman.shadow gradle plugin to easy accomplish this. Also, we will be using Lambda's custom runtime, so we need to use FormKiQ's open source GraalVM Lambda Runtime library.

Add the following to build.gradle:

plugins {
...
  id "com.github.johnrengelman.shadow" version "5.2.0"
}

dependencies {
...
  implementation 'com.formkiq:lambda-runtime-graalvm:1.1'
  implementation 'org.slf4j:slf4j-simple:1.7.26'
}

jar {
  manifest {
    attributes 'Main-Class': 'com.formkiq.lambda.runtime.graalvm.LambdaRuntime'
  }
}

Step 5: Build, Deploy, and Run the GraalVM Java Lambda

GraalVM works by taking the "FAT" jar file and creating a Linux executable file that AWS Lambda can run. Unfortunately, GraalVM does not support all the features of Java. This is not generally a big deal, but one important feature it does not support without modification is Reflection. Because the FormKiQ GraalVM Lambda Runtime needs reflection to find the Lambda function to run, we need to use GraalVM's ReflectionConfigurationFiles. In this file we can define any classes we will be calling using reflection, and GraalVM will automatically add support for these classes.

Create the file S3GraalVM/src/main/resources/reflect.json, defining our Lambda class inside:

[
{
    "name": "helloworld.S3GraalVM",
    "allDeclaredConstructors": true,
    "allPublicConstructors": true,
    "allDeclaredMethods": true,
    "allPublicMethods": true
  }
]

Create file S3GraalVM/build_graalvm.sh, a shell script which will use Docker to convert the S3GraalVM-all.jar to an executable called server. (Make sure you give the build_graalvm.sh execute permission.)

#!/bin/bash

docker run --rm -v $(pwd):/working oracle/graalvm-ce:20.1.0-java11 \
    /bin/bash -c "
                    gu install native-image; \
                    native-image --enable-url-protocols=http,https \
                      -H:ReflectionConfigurationFiles=/working/src/main/resources/reflect.json \
                      -H:+ReportUnsupportedElementsAtRuntime --no-server -jar \"/working/build/libs/S3GraalVM-all.jar\" \
                    ; \
                    cp S3GraalVM-all /working/build/graalvm/server"

mkdir -p build/graalvm
if [ ! -f "build/graalvm/server" ]; then
    echo "there was an error building graalvm image"
    exit 1
fi

Add a task to build.gradle that will build the GraalVM image automatically when the project is built:

task buildGraalVMImage {
inputs.files("${project.projectDir}/src/main", configurations.compileClasspath)
outputs.upToDateWhen {file("${buildDir}/graalvm/server").exists()}
outputs.file file("${buildDir}/graalvm/server")

doLast {
    exec {
      commandLine "bash", "-c", "./build_graalvm.sh"
    } 
  }
}

buildGraalVMImage.dependsOn shadowJar, test
build.dependsOn buildGraalVMImage

Once we have the GraalVM image, AWS requires a bootstrap file to be able to execute the Lambda function.

Create the file ./S3GraalVM/bootstrap, a script which will be bundled with the Lambda function and that AWS will call to execute the Lambda function. (Make sure you give build_graalvm.sh execute permission.)

#!/bin/sh
set -euo pipefail
./server

We are almost done, the last thing we have to do is configure AWS SAM Cli to build our custom runtime. This is done though a Makefile. The Makefile is pretty simple, it just builds the gradle project and copies the server and bootstrap files to the SAM build directory.

Create file S3GraalVM/Makefile with the following code:

CUR_DIR := $(abspath $(patsubst %/,%,$(dir $(abspath $(lastword $(MAKEFILE_LIST))))))

build-S3GraalVM:
  cd $(CUR_DIR) && ./gradlew build
  cp $(CUR_DIR)/build/graalvm/server $(ARTIFACTS_DIR)
  cp $(CUR_DIR)/bootstrap $(ARTIFACTS_DIR)

Note: if you get the error Makefile:4: *** missing separator, it's because Makefile need to use TABS and not spaces to indent.

Lastly, update the template.yaml file to include our new Lambda function:

...
  S3GraalVM:
    Type: AWS::Serverless::Function
    Properties:
      CodeUri: S3GraalVM
      Handler: helloworld.S3GraalVM::handleRequest
      Runtime: provided
      MemorySize: 512
      Policies:
      - Statement:
        - Sid: S3Access
          Effect: Allow
          Action:
          - s3:GetObject
          - s3:PutObject
          Resource: !Sub 'arn:aws:s3:::${S3Bucket}/*'
      Environment:
        Variables:
          S3Bucket: !Ref S3Bucket
...
Outputs:
  ...
  S3GraalVM:
    Description: "S3GraalVM Lambda Function ARN"
    Value: !GetAtt S3GraalVM.Arn

Build the Lambda function by running the command in a terminal window, in the directory where the template.yaml file is located (it will take a few minutes for GraalVM to build the project):

sam build

This produces the following output:

Build Succeeded

Deploy the Lambda function by running the command in the same terminal window and folder:

sam deploy

This products the following output:

CloudFormation outputs from deployed stack
-------------------------------------------------------------------
Outputs
-------------------------------------------------------------------
Key                 S3Bucket
Description         S3Bucket
Value               arn:aws:s3:::graalvm-s3-s3bucket-XXXXXXXXXXXXXXXX

Key                 S3Java
Description         S3Java Lambda Function ARN
Value               arn:aws:lambda:us-east-1:622653865277:function:graalvm-s3-S3Java-XXXXXXXXXXXXXX

Key                 S3GraalVM
Description         S3GraalVM Lambda Function ARN
Value               arn:aws:lambda:us-east-1:622653865277:function:graalvm-s3-S3GraalVM-XXXXXXXXXXXX

As with the Java Lambda above, we can use the AWS CLI to run the GraalVM Lambda function:

aws lambda invoke --function-name graalvm-s3-S3GraalVM-XXXXXXXXXXXX outfile --region us-east-1

The output of the Lambda function will be written to a file called "output", with content similar to:

Created S3 File 9a887bc5-164f-4acd-8e81-575685e8162f in bucket graalvm-s3-s3bucket-XXXXXXXXXXXXXXXXX

To view Cloudwatch logs: , you should see the following:

sam logs --name graalvm-s3-S3GraalVM-XXXXXXXXXXXX --region us-east-1

You should see something similar to this:

START RequestId: eb4f0d56-8911-445b-95c3-cc67bf25d607 Version: $LATEST
Created S3 File 9a887bc5-164f-4acd-8e81-575685e8162f in bucket graalvm-s3-s3bucket-XXXXXXXXXXXXXXXXX
END RequestId: eb4f0d56-8911-445b-95c3-cc67bf25d607
REPORT RequestId: eb4f0d56-8911-445b-95c3-cc67bf25d607  Duration: 499.11 ms Billed Duration: 700 ms Memory Size: 512 MB Max Memory Used: 80 MB  Init Duration: 190.00 ms

As you can see, switching our Java Lambda function to use GraalVM has brought the duration from a cold start of over 10 seconds to one that takes less than half a second.

Summary

We built two Lambda functions, one using standard Java 11 and a second using GraalVM. We found that using GraalVM the runtime for our Lambda function went from over 10 seconds down to less than half a second.

If you want to learn more in detail about why AWS Lambda functions written in Java are slow, you can watch this video, "Best practices for AWS Lambda and Java" from AWS Reinvent 2019:

View this Tutorial on GitHub

Why FormKiQ?

Why FormKiQ - Overview

Use Cases

For Teams

For Industries

Resources

Learning Center

Speed Up AWS Lambda using GraalVM

Introduction

What You Need for this Tutorial

Step 1: Create a New Project using SAM CLI for a our Java and GraalVM Lambdas

Step 2: Create the Java Lambda (without GraalVM)

Step 3: Build, Deploy, and Run the Java Lambda (without GraalVM)

Step 4: Create the GraalVM Java Lambda

Using a "FAT" Jar File

Step 5: Build, Deploy, and Run the GraalVM Java Lambda

Summary

Try FormKiQ Core today

Get Started with FormKiQ Essentials

FormKiQ Advanced and Enterprise