This means having strategies for keeping performance good even when load increases.
Load and performance need to first be quantified to be able to discuss load where we define load parameters to do so, as well as examine response time percentiles.

Percentiles

We want to use percentiles here, instead of mathematical means, since we get a more physical meaning out of something like a p50 (half the responses are above and below this metric).