How to Test throughput and latency to API?

Question

Ravi Vishwakarma · Accepted Answer

Testing API throughput and latency is about measuring
how fast and how much your API can handle under different conditions. Below is a
practical, production-style guide with tools, metrics, and examples—especially relevant if you’re working with .NET / web APIs.
1. Key Definitions (Clear Difference)
Latency
Time taken for one request → one response
Measured in:
ms (milliseconds)P50, P90, P95, P99 (percentiles)
Example:
P50 = 120ms
P95 = 450ms
Throughput
Number of requests your API can handle per second
Measured in:
RPS (Requests Per Second)TPS (Transactions Per Second)
Example:
Throughput = 2,500 RPS
2. What You Should Measure
Metric
Why it matters
Avg latency
General performance
P95 / P99 latency
Real user experience
Throughput (RPS)
Capacity
Error rate
Stability
CPU / Memory
Bottlenecks
DB time
Backend limits
3. Simple Local Testing (Quick Checks)
Using curl (Latency only)
curl -w "
Time_Total: %{time_total}s
" -o /dev/null -s https://api.example.com/users
Good for:
One requestNetwork + server latency
4. Apache Benchmark (ab)
Install
sudo apt install apache2-utils
Test example
ab -n 10000 -c 100 https://api.example.com/users
Where:
-n = total requests-c = concurrent users
Output to focus on
Requests per second: 2300.15 [#/sec]
Time per request: 43.2 ms
5. k6 (Highly Recommended – Production Grade)
Best for throughput + latency + load patterns
Install
choco install k6       # Windows
brew install k6        # macOS
k6 Latency Test
import http from 'k6/http';
import { sleep } from 'k6';
export let options = {
  vus: 50,
  duration: '30s',
};
export default function () {
  http.get('https://api.example.com/users');
  sleep(1);
}
Run:
k6 run test.js
Metrics you get:
avg latencyp90 / p95 / p99RPSfailure %
k6 Throughput Stress Test
export let options = {
  stages: [
    { duration: '30s', target: 100 },
    { duration: '1m', target: 500 },
    { duration: '30s', target: 0 },
  ],
};
Shows:
breaking pointmax sustainable throughput
6. JMeter (GUI + Heavy Testing)
Best when:
QA teamsComplex workflowsAuth flows
Steps:
Thread Group → UsersHTTP Request → APISummary ReportAggregate Report
Key Metrics:
Throughput90% LineError %
7. Postman (Beginner-Friendly)
Collection Runner
Set iterationsSet concurrencyView avg response time
Not accurate for high throughput testing
8. Production-Like Testing (Best Practice)
Run tests from:
Same region as usersSeparate machine (not API server)
Monitor during test:
API CPUMemoryGC pauses (.NET)SQL query timeThread pool starvation
9. Example Targets (Reference Benchmarks)
API Type
Good Latency (P95)
Throughput
CRUD API
andlt; 300ms
2k–5k RPS
Auth API
andlt; 200ms
1k–3k RPS
Search API
andlt; 500ms
500–2k RPS
10. Common Mistakes to Avoid
Testing from localhostIgnoring P95/P99Not warming up APIForgetting DB limitsRunning load test on production without rate limits
11. Recommended Tool Stack (Professional)
Purpose
Tool
Quick test
curl
Load and stress
k6
Complex flows
JMeter
CI integration
k6 + GitHub Actions
Monitoring
Grafana + Prometheus
12. Want This Tailored to You?
I can:
Create k6 scripts for your specific APIHelp interpret test resultsRecommend rate limits and scaling strategyTune ASP.NET MVC / Web API performance

How to Test throughput and latency to API?

0

1 Answer

0

1. Key Definitions (Clear Difference)

Latency

Throughput

2. What You Should Measure

3. Simple Local Testing (Quick Checks)

Using `curl` (Latency only)

4. Apache Benchmark (ab)

Install

Test example

Output to focus on

5. k6 (Highly Recommended – Production Grade)

Install

k6 Latency Test

Metrics you get:

k6 Throughput Stress Test

6. JMeter (GUI + Heavy Testing)

Steps:

7. Postman (Beginner-Friendly)

Collection Runner

8. Production-Like Testing (Best Practice)

Run tests from:

Monitor during test:

9. Example Targets (Reference Benchmarks)

10. Common Mistakes to Avoid

11. Recommended Tool Stack (Professional)

12. Want This Tailored to You?

Your Answer

Metric	Why it matters
Avg latency	General performance
P95 / P99 latency	Real user experience
Throughput (RPS)	Capacity
Error rate	Stability
CPU / Memory	Bottlenecks
DB time	Backend limits

API Type	Good Latency (P95)	Throughput
CRUD API	< 300ms	2k–5k RPS
Auth API	< 200ms	1k–3k RPS
Search API	< 500ms	500–2k RPS

Purpose	Tool
Quick test	curl
Load & stress	k6
Complex flows	JMeter
CI integration	k6 + GitHub Actions
Monitoring	Grafana + Prometheus