Test management
Keep the test suite useful as it grows past 10,000 cases.
Test management is the unglamorous middle of the QA stack — not the writing of tests, not the running of them, but the discipline of keeping the system useful as it grows past 1,000 test cases, past 10,000, past 100,000. The team that ignores it ends up with a test suite that takes hours to run, a backlog of failures nobody owns, and a regression strategy that's "run everything and pray".
This hub is the answer to that middle. Five articles, each addressing one decision the team has to make explicitly:
- Test-case design — writing cases that don't go stale after the third refactor
- Traceability matrix — without spreadsheet hell
- Regression strategy — at 10k+ tests, run-everything doesn't scale
- Exploratory testing — alongside automation, not instead of it
- Defect triage — the process that prevents drowning when defect volume grows
Why this matters now
Test management is in flux for three reasons:
- AI test generation has changed the volume game. A team using AI-assisted test generation can ship 5-10x more test cases per sprint than they could write manually. Most test management systems were designed for the human-authored-test era — they handle 5,000 tests fine and start to creak at 50,000. The middle is uncomfortable.
- Shift-left has put tests in the developer's hands. The old model — a separate QA team owns the test suite, files defects back to engineering — is increasingly rare in modern product engineering. Tests live in the same repo as the code; defects are tracked in the same system as stories; the QA function is increasingly about strategy and tooling rather than test authoring.
- The traceability-and-compliance bar is rising. Regulated industries (FDA, FedRAMP, SOC 2) have always needed traceability. The 2024+ trend is that non-regulated industries are adopting similar discipline as enterprise procurement starts asking for it. A test management story that says "we have tests but can't tell you which AC they verify" loses deals.
The teams getting this right have systematic answers to all five questions in this hub. The teams not getting it right have a TestRail license and hope.
What this hub assumes
You're in a 10-50 person engineering organisation with at least a thousand active test cases — manual, automated, or some mix. You have an existing tool of record (TestRail, Zephyr, qTest, Xray, or an in-house system) or you're evaluating whether to adopt one. You're shipping at least weekly, and the test suite is starting to feel like more of a constraint than a safety net.
If you're earlier than that (a few dozen tests, single product, single team), the discipline this hub describes is overkill — start with the test pyramid glossary entry and the code-review hub instead.
The five practices in one paragraph
A mature test-management practice writes tests anchored to behaviour rather than UI steps so they survive refactors, maintains traceability via the underlying entity graph rather than spreadsheets, runs regression by risk and impact rather than by exhaustive execution, complements automation with explicit exploratory sessions that find what scripts can't, and triages defects by severity-frequency-impact with explicit non-fix criteria so the backlog doesn't drown the team. Each of the five articles unpacks one of these dimensions in detail.
How to use this hub
If you're optimising an existing test-management practice, start with defect triage — most teams find their biggest pain there. If you're building from scratch, read in order — test-case design sets the foundation, traceability + regression operationalise it, exploratory closes the AI gap, and triage handles the steady state.
Each article includes practical templates (case structure, matrix layouts, triage workflows) you can adapt directly. The Stride research linked from each article digs deeper into the underlying patterns.
All articles in this hub
Test-case design that doesn't go stale
9 minBehaviour-anchored Gherkin survives refactors that break step-anchored UI tests. The 5 components every good case has, and structural moves that age well.
ReadTraceability matrix without spreadsheet hell
9 minManual spreadsheets drift within weeks. The derived-matrix approach — auto-generated from the entity graph — stays accurate and pays back for audit-grade compliance.
ReadRegression strategy that scales past 10,000 tests
11 minAt 10k+ tests, "run everything" stops being a strategy. The 4-tier approach (smoke / affected / full nightly / pre-release) keeps iteration fast without sacrificing coverage.
ReadExploratory testing alongside automation
10 minCharters, time-boxes, observed defect rates. The structured discipline that finds the bugs automation never catches — UX issues, unexpected combinations, real-world data quirks.
ReadDefect triage that doesn't drown the team
10 minSeverity × frequency × impact, with explicit non-fix criteria and SLAs per severity tier. The process that prevents the backlog from growing to 400+ untriaged items.
Read
Long-form blog posts that go deeper on test management — defended in detail.
- Are AI-generated test cases worth shipping?Yes, with a sharp caveat — when they're tied to AC and reviewed by a human. Five categories where AI test generation is great, five anti-patterns to catch.9 min read
- Can AI write Gherkin? (yes — here's how)Yes. AI writes Gherkin well, often better than humans for surface area coverage. Five wins, five recognisable failure modes, and the prompts that work.8 min read
- The connected delivery graph: one source of truth from PRD to prodMost teams ship software with five tools that don't talk to each other. The friction isn't any individual tool — it's the missing graph between them. This is the case for one connected graph.9 min read
- What's the actual ROI of AI in software delivery?$4-$8 back for every dollar spent within 6 months, for most teams. The honest math from real data, not the deck.7 min read