DEV Community

Cover image for AWS DynamoDB DocumentClient & Node.js - Complete Cheat Sheet
Rafal Wilinski
Rafal Wilinski

Posted on • Originally published at dynobase.dev

AWS DynamoDB DocumentClient & Node.js - Complete Cheat Sheet

DynamoDB Node.js Query Examples

This cheat sheet should help you how to perform variefy of operations starting from simple queries ending with complex transactions using AWS DynamoDB DocumentClient and Node.js.

There are two basic ways to interact with DynamoDB tables from Node.js applications:

  • Class AWS.DynamoDB from AWS SDK for JavaScript/Typescript
  • AWS.DynamoDB.DocumentClient which simplifies working with DynamoDB items by abstracting away DynamoDB Types and converting responses to native JS

This cheat sheet will mostly focus on DocumentClient but some of the operations like creating tables must be run using classical DynamoDB service.

If you're looking for similar cheat sheet but for Python, you can find it here

Bonus: AWS recently announced Node.js AWS SDK v3. I wrote a tutorial on how to use new SDK V3, especially in the DynamoDB context

Table of Contents

Setup

Setting up your Node.js application to work with DynamoDB is fairly easy. First, make sure that aws-sdk is installed, either by running yarn add aws-sdk or npm i aws-sdk --save. Then, paste following piece of code:

const AWS = require("aws-sdk") // Or use `import` syntax for Typescript and newer ES versions

const dynamoDB = new AWS.DynamoDB({
  region: "us-east-1", // If not set, will get from ~/.aws directory or environment variable
  // and rest of properties
})

// Or

const documentClient = new AWS.DynamoDB.DocumentClient({
  region: "us-east-1",
  // and rest of properties
})
Enter fullscreen mode Exit fullscreen mode

As one of the Node.js best practices is to avoid callback hell, we'll be ending up all of our SDK calls with .promise() call in order to get Promises returned from SDK. It will also allow us to use async/await syntax which makes our programs much more readable.

Keep in mind that using access and secret keys is against best security practices, and you should instead use IAM roles/policies to interact with DynamoDB. This code, if ran on Lamba function or EC2 instance, will automatically use IAM Role attached to it.

Create table

DynamoDB structures data in tables, so if you want to save some data to DynamoDB, first you need to create a table. You can do that using AWS Console, AWS CLI or using AWS-SDK for Javascript, like this:

const AWS = require("aws-sdk")
const dynamoDB = new AWS.DynamoDB({ region: "us-east-1" })

dynamoDB
  .createTable({
    AttributeDefinitions: [
      {
        AttributeName: "id",
        AttributeType: "S",
      },
    ],
    KeySchema: [
      {
        AttributeName: "id",
        KeyType: "HASH",
      },
    ],
    BillingMode: "PAY_PER_REQUEST",
    TableName: "my-table",
  })
  .promise()
  .then(data => console.log("Success!", data))
  .catch(console.error)
Enter fullscreen mode Exit fullscreen mode

After this call resolves, it does not necessarily mean that table status is ACTIVE and it's is ready for read and write operations. Before start manipulating items in it, we should check if it's in ACTIVE state first using describeTable function ran every 5 seconds until it is ready:

const backoffInterval = 5000 // 5 seconds

const waitForTable = TableName =>
  dynamoDB
    .describeTable({ TableName })
    .promise()
    .then(data => {
      if (data.Table.TableStatus !== "ACTIVE") {
        console.log(
          `Table status: ${data.Table.TableStatus}, retrying in ${backoffInterval}ms...`
        )
        return new Promise(resolve => {
          setTimeout(() => waitForTable().then(resolve), backoffInterval)
        })
      } else {
        return
      }
    })
    .catch(error => {
      console.warn(
        `Table not found! Error below. Retrying in ${backoffInterval} ms...`,
        error
      )

      return new Promise(resolve => {
        setTimeout(() => waitForTable().then(resolve), backoffInterval)
      })
    })

waitForTable("my-table").then(() => console.log(`my-table is ready!`))
Enter fullscreen mode Exit fullscreen mode

Delete table

If you changed your mind and need to remove DynamoDB table, don't worry, it's simple:

const AWS = require("aws-sdk")
const dynamoDB = new AWS.DynamoDB({ region: "us-east-1" })

dynamoDB
  .deleteTable({
    TableName: "my-table",
  })
  .promise()
  .then(() => console.log("Table has been deleted"))
  .catch(console.error)
Enter fullscreen mode Exit fullscreen mode

Keep in mind that Dynobase is capable to removing tables too.

List Tables

If you want to check what tables are available at your disposal in current region, use listTables call. Keep in mind that if selected region has more than 100 tables you'll have to paginate through them to fetch a complete list.

const AWS = require("aws-sdk")
const dynamoDB = new AWS.DynamoDB({ region: "us-east-1" })

dynamoDB
  .listTables()
  .promise()
  .then(() => console.log("Table has been deleted"))
  .catch(console.error)
Enter fullscreen mode Exit fullscreen mode

Get All Items / Scan in DynamoDB

After our table is provisioned and it's in ACTIVE state, first thing that we probably would like to do is get all items in it aka use (DynamoDB Scan operation):

const AWS = require("aws-sdk")
const dynamoDB = new AWS.DynamoDB.DocumentClient()

dynamoDB
  .scan({
    TableName: "my-table",
  })
  .promise()
  .then(data => console.log(data.Items))
  .catch(console.error)
Enter fullscreen mode Exit fullscreen mode

If you want to narrow your search results, use FilterExpressions combined with ExpressionAttributeNames object like so:

const AWS = require("aws-sdk")
const dynamoDB = new AWS.DynamoDB.DocumentClient({ region: "us-east-1" })

dynamoDB
  .scan({
    TableName: "my-table",
    FilterExpression:
      "attribute_not_exists(deletedAt) AND contains(firstName, :firstName)",
    ExpressionAttributeValues: {
      ":firstName": "John",
    },
  })
  .promise()
  .then(data => console.log(data.Items))
  .catch(console.error)
Enter fullscreen mode Exit fullscreen mode

You can find full reference how to write FilterExpressions in this post by Alex Debrie.

The snippet above will in fact return all the items in the table under one condition - you have less than 1MB of data inside it. If your table is bigger than that, you'll have to run Scan command a few times in a loop using pagination.

Get Item

If you know the exact Partition Key (and Sort Key if using composite key) of the item that you want to retrieve from the DynamoDB table, you can use get operation:

const AWS = require("aws-sdk")
AWS.config.update({ region: "us-east-1" })
const dynamoDB = new AWS.DynamoDB.DocumentClient()

dynamoDB
  .get({
    TableName: "my-table",
    Key: {
      id: "123", // id is the Partition Key, '123' is the value of it
    },
  })
  .promise()
  .then(data => console.log(data.Item))
  .catch(console.error)
Enter fullscreen mode Exit fullscreen mode

Batch Get Item

DocumentClient is also capable of running bunch of get operations in a single call to the DynamoDB service:

const AWS = require("aws-sdk")
AWS.config.update({ region: "us-east-1" })
const dynamoDB = new AWS.DynamoDB.DocumentClient()

dynamoDB
  .batchGet({
    RequestItems: {
      "my-table": {
        Keys: [
          {
            id: "123",
          },
          {
            id: "124",
          },
        ],
      },
      "other-table": {
        Keys: [
          {
            id: "abc",
          },
          {
            id: "abd",
          },
        ],
      },
    },
  })
  .promise()
  .then(data => console.log(data.Responses))
  .catch(console.error)
Enter fullscreen mode Exit fullscreen mode

As you can see, the RequestItems objects can accept multiple table names and can fetch multiple items from multiple tables in a single call. Keep in mind that number of items retrieved using batchGet is limited to 100 items or 16MB of data.

Moreover, if you exceed table capacity, this call will return UnprocessedKeys attribute containing a map of keys which weren't fetched.

Put Item aka Write

put operation creates a new item, or replaces an old item with a new item if it's using the same key(s):

const AWS = require("aws-sdk")
const dynamoDB = new AWS.DynamoDB.DocumentClient({ region: "us-east-1" })

dynamoDB
  .put({
    Item: {
      id: "12346",
      name: "John Doe",
      email: "john@doe.io",
    },
    TableName: "my-table",
  })
  .promise()
  .then(data => console.log(data.Attributes))
  .catch(console.error)
Enter fullscreen mode Exit fullscreen mode

Batch Write / Put Item

If you need to insert, update or delete multiple items in a single API call, use batchWrite operation. It bundles multiple database requests against multiple tables into a single SDK call. It decreases amount of network calls needed to be made, reduces the overall latency and makes your application faster.

Example request deleting one item with key id = 123 and putting another to the same table:

const AWS = require("aws-sdk")
const dynamoDB = new AWS.DynamoDB.DocumentClient({ region: "us-east-1" })

dynamoDB
  .batchWrite({
    RequestItems: {
      MyTable: [
        {
          DeleteRequest: {
            Key: { id: "123" },
          },
        },
        {
          PutRequest: {
            Item: {
              id: "234",
              name: "dynobase",
              email: "dynobase@dynobase.dev",
              description: "Professional DynamoDB Client",
            },
          },
        },
      ],
    },
  })
  .promise()
  .then(data => console.log(data.Attributes))
  .catch(console.error)
Enter fullscreen mode Exit fullscreen mode

If you want to write into multiple tables at once, simply specify these tables inside RequestItems object like this:

const AWS = require("aws-sdk")
const dynamoDB = new AWS.DynamoDB.DocumentClient({ region: "us-east-1" })

dynamoDB
  .batchWrite({
    RequestItems: {
      TableOne: [
        {
          DeleteRequest: {
            Key: { id: "123" },
          },
        },
      ],
      TableTwo: [
        {
          PutRequest: {
            Item: {
              id: "234",
              name: "dynobase",
              email: "dynobase@dynobase.dev",
              description: "Professional DynamoDB Client",
            },
          },
        },
      ],
    },
  })
  .promise()
  .then(data => console.log(data.Attributes))
  .catch(console.error)
Enter fullscreen mode Exit fullscreen mode

If you are curious about performance of batchWrite, head to dynamodb-performance-testing repo by Alex DeBrie.

Query for a Set of Items

If your table has composite key (which is the best practice), in order to get a collection of items sharing the same Parition Key, use Query method. It also allows to use multiple operators for SortKey such as begins_with or mathematical ones like >, =, >= and so on.

const AWS = require('aws-sdk');
const dynamoDB = new AWS.DynamoDB.DocumentClient({region:'us-east-1'});

dynamoDB
  .query({
    TableName: 'my-table',
    KeyConditionExpression: 'id = :hashKey and date > :rangeKey'
    ExpressionAttributeValues: {
      ':hashKey': '123',
      ':rangeKey': 20150101
    }
  })
  .promise()
  .then(data => console.log(data.Items))
  .catch(console.error);
Enter fullscreen mode Exit fullscreen mode

Keep in mind that Query can return up to 1MB of data and you can also use FilterExpressions here to narrow the results on non-key attributes.

If you don't know how to construct your Query and its attributes, head to our DynamoDB Query Builder which will generate code for you. You can also use Query Code Generation feature inside Dynobase.

Simple Transaction

DynamoDB also support transactions - they allow to run multiple write operations atomically meaning that either all of operations are executed succesfully or none of them. It is especially useful when dealing with applications where data integrity is essential, e.g. in e-commerce - adding an item to a cart and decrementing count of items still available to buy.

Such flow should:

  • Should happen atomically - these two operations should be treated as one, we don't want to have a single moment in time where there's a discrepancy in items count
  • Should succeed only if count of items available to buy is greated than zero

Described flow can be modelled in Javascript & DocumentClient like this:

const AWS = require("aws-sdk")
const dynamoDB = new AWS.DynamoDB.DocumentClient({ region:'us-east-1' })

await dynamoDB.transactWrite({
  TransactItems: [
    {
      Put: { // Add item to cart
        Item: {
          id: '1',
          count: '1'
        }
        TableName: "CartTable",
      },
    },
    {
      Update: { // decrement count of items available to buy only if the count is greater than zero
        ConditionExpression: "#count > 0",
        ExpressionAttributeNames: { "#count": "count" },
        ExpressionAttributeValues: {
          ":value": 1,
        },
        Key: {
          id: '123',
        }
        TableName: "ItemsTable",
        UpdateExpression: "SET #count = :count - :value",
      },
    },
  ],
}).promise();
Enter fullscreen mode Exit fullscreen mode

If you want to learn more about transactions, head to our DynamoDB Transactions Guide.

Read Transaction

Transactions can be also used for reading data atomically. Like in batchGet, you can retrieve the data from multiple tables in a single call:

const AWS = require("aws-sdk")
const dynamoDB = new AWS.DynamoDB.DocumentClient({ region: "us-east-1" })

await dynamoDB
  .transactGet({
    TransactItems: [
      {
        Get: {
          TableName: "TableOne",
          Key: {
            HashKey: "abcd",
          },
        },
      },
      {
        Get: {
          TableName: "TableTwo",
          Key: {
            HashKey: "1234",
          },
        },
      },
    ],
  })
  .promise()
Enter fullscreen mode Exit fullscreen mode

If you want to learn more about transactions, head to our DynamoDB Transactions Guide.

Query with Sorting

Unfortunately, DynamoDB offers only one way of sorting the results on the database side - using the sort key. If your table does not have one, your sorting capabilities are limited to sorting items in application code after fetching the results. However, if you need to sort DynamoDB results on sort key descending or ascending, you can use following syntax:

const AWS = require('aws-sdk');
const dynamoDB = new AWS.DynamoDB.DocumentClient({ region: 'us-east-1' });

dynamoDB
  .query({
    TableName: 'my-table',
    IndexName: 'Index', // Main one
    KeyConditionExpression: 'id = :hashKey and date > :rangeKey'
    ExpressionAttributeValues: {
      ':hashKey': '123',
      ':rangeKey': 20150101
    },
    ScanIndexForward: true // true or false to sort by "date" Sort/Range key ascending or descending
  })
  .promise()
  .then(data => console.log(data.Items))
  .catch(console.error);
Enter fullscreen mode Exit fullscreen mode

Query (and Scan) DynamoDB Pagination

Both Query and Scan operations return results with up to 1MB of items. If you need to fetch more records, you need to invoke a second call to fetch the next page of results. If LastEvaluatedKey is present in response object, this table has more items like requested and another call with ExclusiveStartKey should be sent to fetch more of them:

const getAll = async () => {
  let result, accumulated, ExclusiveStartKey;

  do {
    result = await DynamoDB.query({
      TableName: argv.table,
      ExclusiveStartKey,
      Limit: 100,
      KeyConditionExpression: 'id = :hashKey and date > :rangeKey'
      ExpressionAttributeValues: {
        ':hashKey': '123',
        ':rangeKey': 20150101
      },
    }).promise();

    ExclusiveStartKey = result.LastEvaluatedKey;
    accumulated = [...accumulated, ...result.Items];
  } while (result.Items.length || result.LastEvaluatedKey);

  return accumulated;
};

getAll()
  .then(console.log)
  .catch(console.error);
Enter fullscreen mode Exit fullscreen mode

Update Item

DynamoDB update operation in Node.js consists of two main parts:

  • Part which item to update (Key), similar to get
  • Part what in the selected item should be updated (UpdateExpression and ExpressionAttributeValues)
const AWS = require("aws-sdk")
AWS.config.update({ region: "us-east-1" })
const dynamoDB = new AWS.DynamoDB.DocumentClient()

dynamoDB
  .update({
    TableName: "my-table",
    Key: {
      id: "123",
    },
    UpdateExpression: `set firstName = :firstName`,
    ExpressionAttributeValues: {
      ":firstName": "John McNewname",
    },
  })
  .promise()
  .then(data => console.log(data.Attributes))
  .catch(console.error)
Enter fullscreen mode Exit fullscreen mode

Conditionally Update Item

Sometimes we want to update our record only if some condition is met, e.g. item is not soft-deleted (does not have deletedAt attribute set). To do that, use ConditionExpression which has similar syntax to the FilterExpression:

const AWS = require("aws-sdk")
AWS.config.update({ region: "us-east-1" })
const dynamoDB = new AWS.DynamoDB.DocumentClient()

dynamoDB
  .update({
    TableName: "my-table",
    Key: {
      id: "123",
    },
    UpdateExpression: `set firstName = :firstName`,
    ExpressionAttributeValues: {
      ":firstName": "John McNewname",
      ":company": "Apple",
    },
    ConditionExpression: `attribute_not_exists(deletedAt) and company = :company`,
  })
  .promise()
  .then(data => console.log(data.Attributes))
  .catch(console.error)
Enter fullscreen mode Exit fullscreen mode

In this example the name attribute of the record with partition key id = 123 in table my-table will be only updated if this item does not have attribute deletedAt and its attribute company has value Apple.

Increment Item Attribute

Incrementing a Number value in DynamoDB item can be achieved in two ways:

  1. Get item, update the value in the application code and send a put request back to DDB overwriting item
  2. Using update operation

While it might be tempting to use first method because Update syntax is unfriendly, I strongly recommend using second one because of the fact it's much faster (requires only one request) and atomic:

const AWS = require("aws-sdk")
AWS.config.update({ region: "us-east-1" })
const dynamoDB = new AWS.DynamoDB.DocumentClient()

dynamoDB
  .update({
    TableName: "my-table",
    Key: {
      id: "123",
    },
    UpdateExpression: `set score = :score + :value`,
    ExpressionAttributeValues: {
      ":value": 1,
    },
  })
  .promise()
  .then(data => console.log(data.Attributes))
  .catch(console.error)
Enter fullscreen mode Exit fullscreen mode

In this example the score attribute of the record with partition key id = 123 in table my-table will incremented by one. Of course, you can use other mathematical operators too.

Delete Item

Removing single item from table is very similar to Get Item operation. The parameters of the call are actually exactly the same, the only difference is that we call delete instead of get:

const AWS = require("aws-sdk")
const dynamoDB = new AWS.DynamoDB.DocumentClient({ region: "us-east-1" })

dynamoDB
  .delete({
    TableName: "my-table",
    Key: {
      id: "123", // id is the Partition Key, '123' is the value of it
    },
  })
  .promise()
  .then(data => console.log(data.Attributes))
  .catch(console.error)
Enter fullscreen mode Exit fullscreen mode

Delete All Items

Unfortunately, there's no easy way to delete all items from DynamoDB just like in SQL-based databases by using DELETE FROM my-table;. To achieve the same result in DynamoDB, you need to query/scan to get all the items in a table using pagination until all items are scanned and then perform delete operation one-by-one on each record.

const truncateTable = async () => {
  let ExclusiveStartKey, result

  do {
    result = await DynamoDB.scan({
      TableName: argv.table,
      ExclusiveStartKey,
    }).promise()

    ExclusiveStartKey = result.LastEvaluatedKey

    console.log(`Found ${result.Items.length} Items, removing...`)

    if (result.Items.length > 0) {
      await Promise.all(
        result.Items.map(async item =>
          DynamoDB.delete({
            TableName: argv.table,
            Key: {
              pk: item.pk,
              sk: item.sk,
            },
          }).promise()
        )
      )
    }
  } while (result.Items.length || result.LastEvaluatedKey)
}

truncateTable()
  .then(() => console.log("Done!"))
  .catch(console.error)
Enter fullscreen mode Exit fullscreen mode

If you don't want to run this script on your own (it might be really time consuming), truncating table is possible just with few clicks using Dynobase.

Run DynamoDB Local

If you need to use DynamoDB offline locally, you can use DynamoDB local distributed by AWS or DynamoDB from Localstack. Connecting to it is as easy as changing the endpoint parameter DynamoDB or DocumentClient constructor:

const AWS = require("aws-sdk")

const dynamoDB = new AWS.DynamoDB({
  // or DocumentClient
  endpoint: "http://localhost:8000",
})
Enter fullscreen mode Exit fullscreen mode

If you want to see your local tables and data in them, you can use Dynobase to query and modify items in offline tables.

Learn more about running DynamoDB locally.

Bonus!

If you're looking for Serverless Framework based CRUD project which uses other AWS Lambda and DynamoDB best practices with Node.js, you can find one on our Github.

Top comments (0)