Dimension · score weight 0%

Response Latency & Throughput

What this dimension detects

TTFT and throughput can reveal routing or cache anomalies, but they are too environment-dependent to score in the current model.

Algorithm

Collect request latency, time to first token where available, and tokens per second across probes. Compare the distribution with coarse expected ranges and display deviations as diagnostic context.

Thresholds

Condition	Verdict contribution
Within coarse expected range	Diagnostic match
Large deviation or unstable distribution	Diagnostic anomaly
Any result	Score contribution remains 0

Limitations

Latency is dominated by geography, provider load, queueing, gateway buffering, client network, and cache state. It can support a story but should not decide identity.

References

TrueLLMs lib/fingerprints/latency.ts

Back to the full methodology