Chapter Three

Databases

Eight databases spanning relational, document, key-value, in-memory, analytical, graph, and time-series models. AWS gives you a purpose-built engine for each shape of data — and the cost of picking wrong is a rewrite.

8 services

Core Terminology

A few terms recur across this chapter. Defining them here keeps the service sections focused.

OLTP vs OLAP

Transactional databases (OLTP) serve many small reads and writes; analytical databases (OLAP) scan huge ranges for aggregates. Different engines, different shapes.

Read Replica

A read-only copy that offloads read traffic from the primary. Asynchronous, so it can lag slightly behind.

Multi-AZ

A synchronous standby in another Availability Zone that takes over automatically on failure — durability and failover, not read scaling.

Partition Key

The attribute DynamoDB hashes to distribute data. Choosing it poorly creates hot partitions that throttle throughput.

Provisioned vs On-Demand

Two capacity modes: reserve throughput for predictable load and lower cost, or pay per request for spiky, unpredictable load.

Services in This Chapter

Managed relational databases — PostgreSQL, MySQL, MariaDB, Oracle, SQL Server. AWS handles patching, backups, and failover; you keep the engine you know.

DatabaseRelational

AWS's cloud-native MySQL- and PostgreSQL-compatible engine. Decoupled storage, faster failover, and read scaling beyond what stock RDS offers.

DatabaseRelational

Amazon DynamoDB

Serverless key-value and document database with single-digit-millisecond latency at any scale — provided you model your access patterns up front.

Amazon ElastiCache

Managed Redis and Memcached. The in-memory cache and ephemeral store that takes read load off your primary database.

DatabaseIn-Memory

Amazon Redshift

Petabyte-scale data warehouse. Columnar, massively parallel analytics over structured data — the OLAP counterpart to your OLTP databases.

AnalyticsWarehouse

Amazon DocumentDB

MongoDB-compatible managed document database. The choice when you have Mongo workloads and want AWS to operate them.

Managed graph database for highly connected data — social graphs, fraud rings, recommendation networks — with Gremlin and SPARQL support.

Amazon Timestream

Purpose-built time-series database for IoT and operational metrics, with tiered storage that ages old data to cheap storage automatically.

DatabaseTime-Series