Verify

RTO and RPO

Recovery Time Objective (RTO) is the maximum acceptable time between a disaster and service restoration. Recovery Point Objective (RPO) is the maximum acceptable data loss measured backwards from the disaster, RPO of 1 hour means up to 1 hour of recent transactions may be lost. Together they define the disaster recovery contract.

May 23, 2026

RTO drives the recovery architecture: 4-hour RTO can be served by backup-and-restore; 15-minute RTO requires hot standby; sub-minute RTO requires active-active. RPO drives the replication architecture: 24-hour RPO is met by nightly snapshots; 1-hour RPO requires hourly snapshots or continuous WAL shipping; near-zero RPO requires synchronous multi-AZ replication (which adds latency to every write). The mistake: setting tight RTO/RPO without budgeting for the architecture and operational discipline to meet them. The honest exercise is to ask 'when did we last actually achieve this in a test?' If the answer is 'never', the numbers are aspirational.