test(load): k6 script for the unified apps/api server

The pre-launch consolidation collapsed 17+ per-product backends into the single Hono/Bun process at apps/api. That makes apps/api the single point of failure for every authenticated module call the unified Mana web app makes — a missing index, a hot-path allocation in auth middleware, or rate-limiter contention degrades all 16 modules at once. The other scripts in load-tests/ already cover mana-auth, mana-sync, mana-llm and the SvelteKit frontends, but apps/api itself was unmeasured. This is that missing piece. What it tests ------------- A weighted mixed workload that walks the full middleware stack (CORS → request logger → rate limit → auth → router → handler) plus a representative range of handler shapes: 25% GET /health (no auth, baseline) 20% GET /api/v1/moodlit/presets (auth + in-memory return) 15% GET /api/v1/chat/models (auth + DB read) 20% POST /api/v1/calendar/events/expand (auth + Zod + RRULE compute) 12% POST /api/v1/todo/compute/next-occurrence (auth + Zod + rrule lib) 8% POST /api/v1/todo/compute/validate (auth + Zod + validation) Deliberately no write endpoints — those would conflate write amplification with API-server cost. The compute routes here all run in <50ms warm; what we're measuring is the overhead the unified server adds on top of pure handler work. Per-route-class p95 budgets via tags: health < 100ms authed_get < 300ms authed_post < 500ms global p95 < 500ms, p99 < 2s Application-level error rate (4xx + 5xx + check failures) must stay under 1% — exit code 1 otherwise, so it's CI-gateable. Auth setup ---------- apps/api requires JWT on every /api/* route. setup() acquires a token once before VUs start hammering and shares it for the run. Three sources tried in order: 1. $MANA_API_TOKEN (CI passes a pre-minted token) 2. login at $TEST_EMAIL / $TEST_PASSWORD 3. register a fresh account on the fly Bails with a clear error message if all three fail. Load profile ------------ 4 minute total: 30s warmup → 2m sustained @ 50 VUs → 1m peak @ 100 VUs → 30s cooldown. Override with --vus / --duration as usual. Closes item #23 in docs/REFACTORING_AUDIT_2026_04.md. Follow-ups not in this commit: - Wire into .github/workflows/daily-tests.yml (requires standing up the apps/api stack in the runner — bigger lift, separate PR) - Per-module thresholds once we have a few real runs and know where the natural baseline sits Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-05-14 16:41:08 +02:00 · 2026-04-09 12:27:54 +02:00 · 2026-04-09 12:27:54 +02:00 · 7f6b41654e
commit 7f6b41654e
parent 5052926481
2 changed files with 302 additions and 0 deletions
--- a/load-tests/README.md
+++ b/load-tests/README.md
@ -18,9 +18,15 @@ brew install k6
 # Gegen lokale Umgebung
 k6 run load-tests/web-apps.js
 k6 run load-tests/auth-api.js
+k6 run load-tests/api.js
 k6 run load-tests/sync-websocket.js
 k6 run load-tests/llm-ollama.js

+# api.js braucht ein gültiges JWT — entweder via $MANA_API_TOKEN
+# oder es loggt sich mit den TEST_EMAIL/TEST_PASSWORD env vars ein
+# (default: loadtest-api@mana.test / LoadTestApi123!).
+k6 run -e MANA_API_TOKEN=eyJhbGc... load-tests/api.js
+
 # Gegen Produktion (vorsichtig!)
 k6 run -e BASE_URL=https://mana.how load-tests/web-apps.js

@ -37,9 +43,24 @@ k6 run --out json=results.json load-tests/web-apps.js
 |--------|------|-------------|-------|
 | `web-apps.js` | SvelteKit Frontends (HTML-Responses) | 10→50→10 | 5 min |
 | `auth-api.js` | Login, Register, Token Validation | 5→20→5 | 4 min |
+| `api.js` | Unified `apps/api` Hono server (16 Module) — gemixte Workload mit Auth, Compute & Validation | 10→50→100→0 | 4 min |
 | `sync-websocket.js` | mana-sync WebSocket Connections | 10→30→10 | 5 min |
 | `llm-ollama.js` | Ollama Chat Completions | 1→3→1 | 3 min |

+### `api.js` Thresholds
+
+Pro Route-Klasse hat das Script eigene p95-Budgets über `tags`:
+
+| Klasse | Endpoints | p95-Budget |
+|--------|-----------|------------|
+| `health` | `GET /health` | < 100ms |
+| `authed_get` | `GET /api/v1/moodlit/presets`, `GET /api/v1/chat/models` | < 300ms |
+| `authed_post` | `POST /api/v1/calendar/events/expand`, `POST /api/v1/todo/compute/*` | < 500ms |
+| Global | alle Requests aggregiert | p95 < 500ms, p99 < 2s |
+
+Application-level error rate (4xx + 5xx + Check-Failures) muss unter
+1% bleiben, sonst exit-code 1 → CI-Build bricht.
+
 ## Metriken interpretieren

 | Metrik | Gut | Akzeptabel | Schlecht |
--- a/load-tests/api.js
+++ b/load-tests/api.js
@ -0,0 +1,281 @@
+/* eslint-disable no-undef */
+/**
+ * Load test for `apps/api` — the unified Hono/Bun API server that hosts
+ * all 16 product compute modules (calendar, todo, chat, picture, planta,
+ * nutriphi, news, traces, moodlit, presi, music, contacts, storage,
+ * context, guides, research) on a single port.
+ *
+ * Why this script exists
+ * ----------------------
+ * The pre-launch consolidation collapsed 17+ per-product backends into
+ * one process. That makes apps/api the single point of failure for
+ * every authenticated module call the unified Mana web app makes.
+ * If a single Drizzle query is missing an index, or the auth middleware
+ * has a hot-path allocation, or the rate limiter contends on a shared
+ * map — every module degrades together. The other load-tests in this
+ * directory cover mana-auth, mana-sync, mana-llm, and the SvelteKit
+ * frontends, but apps/api itself was unmeasured. This is that missing
+ * piece.
+ *
+ * What it tests
+ * -------------
+ * A weighted mixed workload that exercises the full middleware stack
+ * (CORS → request logger → rate limit → auth → router → handler) plus
+ * a representative range of handler shapes:
+ *
+ *   - 25%  GET /health                            (no auth, baseline)
+ *   - 20%  GET /api/v1/moodlit/presets            (auth + in-memory return)
+ *   - 15%  GET /api/v1/chat/models                (auth + DB read of models)
+ *   - 20%  POST /api/v1/calendar/events/expand    (auth + Zod + RRULE compute)
+ *   - 12%  POST /api/v1/todo/compute/next-occurrence
+ *                                                 (auth + Zod + rrule lib)
+ *   -  8%  POST /api/v1/todo/compute/validate     (auth + Zod + rrule lib)
+ *
+ * No write endpoints are exercised — those would need cleanup and would
+ * conflate write-amplification load with API-server cost. The compute
+ * routes here all run in <50ms on a warm machine; what we're measuring
+ * is the overhead the unified server adds on top of pure handler work.
+ *
+ * Authentication
+ * --------------
+ * apps/api requires JWT auth on every /api/* route. setup() acquires a
+ * token once before the VUs start hammering and shares it for the
+ * duration of the run. Three sources, in order:
+ *
+ *   1. $MANA_API_TOKEN  — provide a pre-minted token (CI-friendly)
+ *   2. login a fresh test account at $TEST_EMAIL / $TEST_PASSWORD
+ *   3. register a new account on the fly with the same credentials
+ *
+ * The script bails with a clear error if none of these work.
+ *
+ * Usage
+ * -----
+ *   # local
+ *   k6 run load-tests/api.js
+ *
+ *   # against staging
+ *   k6 run -e API_URL=https://api.mana.how -e AUTH_URL=https://auth.mana.how \
+ *          -e MANA_API_TOKEN=eyJhbGc... \
+ *          load-tests/api.js
+ *
+ *   # heavier
+ *   k6 run --vus 200 --duration 5m load-tests/api.js
+ *
+ *   # JSON output for Grafana
+ *   k6 run --out json=api-load.json load-tests/api.js
+ */
+
+import http from 'k6/http';
+import { check, sleep, group, fail } from 'k6';
+import { Rate, Trend } from 'k6/metrics';
+
+const errorRate = new Rate('errors');
+const authedLatency = new Trend('authed_request_duration');
+
+const API_URL = __ENV.API_URL || 'http://localhost:3060';
+const AUTH_URL = __ENV.AUTH_URL || 'http://localhost:3001';
+const TEST_EMAIL = __ENV.TEST_EMAIL || 'loadtest-api@mana.test';
+const TEST_PASSWORD = __ENV.TEST_PASSWORD || 'LoadTestApi123!';
+
+export const options = {
+	stages: [
+		{ duration: '30s', target: 10 }, // warmup
+		{ duration: '2m', target: 50 }, // sustained
+		{ duration: '1m', target: 100 }, // peak
+		{ duration: '30s', target: 0 }, // cooldown
+	],
+	thresholds: {
+		// Overall — health-checks pull the average way down so the global
+		// p95 should sit below 500ms.
+		http_req_duration: ['p(95)<500', 'p(99)<2000'],
+
+		// Per-route-class budgets — these are the real signal.
+		'http_req_duration{kind:health}': ['p(95)<100'],
+		'http_req_duration{kind:authed_get}': ['p(95)<300'],
+		'http_req_duration{kind:authed_post}': ['p(95)<500'],
+
+		// Application-level error rate (4xx + 5xx + check failures).
+		errors: ['rate<0.01'],
+
+		// Setup must succeed — if we can't even acquire a token, abort.
+		'http_req_failed{kind:setup}': ['rate<0.01'],
+	},
+};
+
+/**
+ * Acquire a JWT for the load run. Runs once before any VU starts.
+ */
+export function setup() {
+	const envToken = __ENV.MANA_API_TOKEN;
+	if (envToken) {
+		console.log('[setup] using $MANA_API_TOKEN from env');
+		return { token: envToken };
+	}
+
+	// Try login first — works on subsequent runs after the first
+	// register has seeded the test account.
+	let res = http.post(
+		`${AUTH_URL}/api/v1/auth/login`,
+		JSON.stringify({ email: TEST_EMAIL, password: TEST_PASSWORD }),
+		{
+			headers: { 'Content-Type': 'application/json' },
+			tags: { kind: 'setup' },
+		}
+	);
+
+	if (res.status === 200) {
+		const token = res.json('accessToken');
+		if (token) {
+			console.log(`[setup] logged in as ${TEST_EMAIL}`);
+			return { token };
+		}
+	}
+
+	// Login failed — first run, register the account.
+	res = http.post(
+		`${AUTH_URL}/api/v1/auth/register`,
+		JSON.stringify({
+			email: TEST_EMAIL,
+			password: TEST_PASSWORD,
+			name: 'API Load Test',
+		}),
+		{
+			headers: { 'Content-Type': 'application/json' },
+			tags: { kind: 'setup' },
+		}
+	);
+
+	if (res.status === 200 || res.status === 201) {
+		const token = res.json('accessToken');
+		if (token) {
+			console.log(`[setup] registered new account ${TEST_EMAIL}`);
+			return { token };
+		}
+	}
+
+	fail(`could not acquire test token — login=${res.status} body=${String(res.body).slice(0, 200)}`);
+}
+
+export default function (data) {
+	const headers = {
+		'Content-Type': 'application/json',
+		Authorization: `Bearer ${data.token}`,
+	};
+
+	const roll = Math.random();
+
+	if (roll < 0.25) {
+		// 25% — Baseline. /health has no auth, no DB, no module — measures
+		// pure middleware cost (CORS + request logger + 404 routing).
+		group('health', () => {
+			const res = http.get(`${API_URL}/health`, { tags: { kind: 'health' } });
+			const ok = check(res, {
+				'health 200': (r) => r.status === 200,
+			});
+			errorRate.add(!ok);
+		});
+	} else if (roll < 0.45) {
+		// 20% — Authed GET, in-memory response. Tests auth middleware
+		// overhead + JSON serialization on the hot path.
+		group('moodlit_presets', () => {
+			const res = http.get(`${API_URL}/api/v1/moodlit/presets`, {
+				headers,
+				tags: { kind: 'authed_get' },
+			});
+			authedLatency.add(res.timings.duration);
+			const ok = check(res, {
+				'presets 200': (r) => r.status === 200,
+				'presets is array': (r) => {
+					try {
+						const body = r.json();
+						return Array.isArray(body) && body.length > 0;
+					} catch {
+						return false;
+					}
+				},
+			});
+			errorRate.add(!ok);
+		});
+	} else if (roll < 0.6) {
+		// 15% — Authed GET, DB-backed read. The chat models endpoint
+		// returns the catalogue from postgres — exercises the connection
+		// pool and a small SELECT.
+		group('chat_models', () => {
+			const res = http.get(`${API_URL}/api/v1/chat/models`, {
+				headers,
+				tags: { kind: 'authed_get' },
+			});
+			authedLatency.add(res.timings.duration);
+			const ok = check(res, {
+				'models 200': (r) => r.status === 200,
+			});
+			errorRate.add(!ok);
+		});
+	} else if (roll < 0.8) {
+		// 20% — Authed POST, Zod validation, pure compute. The expand
+		// route walks the RRULE manually and builds an array of ISO
+		// timestamps; no DB, no I/O. This is what an authenticated POST
+		// to apps/api should cost in the ideal case.
+		group('calendar_expand', () => {
+			const res = http.post(
+				`${API_URL}/api/v1/calendar/events/expand`,
+				JSON.stringify({
+					rrule: 'FREQ=WEEKLY;BYDAY=MO,WE,FR',
+					dtstart: '2026-01-01T09:00:00Z',
+					until: '2026-04-01T00:00:00Z',
+				}),
+				{ headers, tags: { kind: 'authed_post' } }
+			);
+			authedLatency.add(res.timings.duration);
+			const ok = check(res, {
+				'expand 200': (r) => r.status === 200,
+				'expand returns occurrences': (r) => {
+					try {
+						return Array.isArray(r.json('occurrences'));
+					} catch {
+						return false;
+					}
+				},
+			});
+			errorRate.add(!ok);
+		});
+	} else if (roll < 0.92) {
+		// 12% — Same shape as expand but uses the rrule library instead
+		// of the hand-rolled walker. Catches the case where the rrule
+		// dependency is the bottleneck rather than our own code.
+		group('todo_next_occurrence', () => {
+			const res = http.post(
+				`${API_URL}/api/v1/todo/compute/next-occurrence`,
+				JSON.stringify({
+					rrule: 'FREQ=DAILY;COUNT=30',
+					after: '2026-04-09T00:00:00Z',
+				}),
+				{ headers, tags: { kind: 'authed_post' } }
+			);
+			authedLatency.add(res.timings.duration);
+			const ok = check(res, {
+				'next-occurrence 200': (r) => r.status === 200,
+			});
+			errorRate.add(!ok);
+		});
+	} else {
+		// 8% — Tiny compute path that mostly exercises the validation
+		// branch and Zod schema parsing.
+		group('todo_validate_rrule', () => {
+			const res = http.post(
+				`${API_URL}/api/v1/todo/compute/validate`,
+				JSON.stringify({ rrule: 'FREQ=MONTHLY;BYMONTHDAY=15' }),
+				{ headers, tags: { kind: 'authed_post' } }
+			);
+			authedLatency.add(res.timings.duration);
+			const ok = check(res, {
+				'validate 200': (r) => r.status === 200,
+			});
+			errorRate.add(!ok);
+		});
+	}
+
+	// Sleep 0.5–2s between iterations to keep the VU count translatable
+	// to "concurrent users" rather than "max requests/sec".
+	sleep(Math.random() * 1.5 + 0.5);
+}