All glossary terms
Design

Lambda architecture

Lambda architecture is the data-processing pattern that runs two parallel pipelines: a batch layer that computes complete views over all data with high latency, and a speed layer that computes incremental views over recent data with low latency. Queries combine results from both layers to balance freshness and completeness.

Lambda was popularised by Nathan Marz (Storm, Apache projects) around 2012-2014. The trade-off: dual implementations of the same business logic (one for batch, one for streaming) double maintenance cost and create consistency risks. Kappa architecture (Jay Kreps, 2014) proposed eliminating the batch layer by handling everything via streaming with replay capability — simpler but historically required more sophisticated stream-processing tools. Modern data architectures often use unified frameworks (Apache Flink, Apache Beam, Materialize) that bridge the gap. Lambda remains relevant for legacy systems and specific cases where batch's analytical depth complements streaming's recency.

Related terms