Benchmarks¶
This section is the public benchmark index for YAMS.
Each page should answer three things:
- what workload or dataset was measured,
- what the latest published results are,
- how to run the benchmark locally.
Benchmark Docs¶
- Performance Report - canonical ingestion, metadata, IPC, and multi-client baseline tables plus run commands.
- LongMemEval_S Retrieval Quality Baseline - dataset statistics, retrieval-quality baselines, and the benchmark command.
- Storage Backends Benchmark - local vs R2 CLI CRUD and multi-client benchmark results.
- Multi-Client Optimization Loop - throughput/stability runbook plus summary and regression commands.
- Retrieval Precision Optimization - ranking-quality tuning loop and summary workflow.
Scope¶
- Public benchmark docs focus on benchmark data, workload context, and reproducible commands.
- Internal harness plumbing and generated artifacts are intentionally omitted from public docs.
testedmeans the path is validated in this repository.supportedmeans the path exists but is not currently part of automated validation here.
When in doubt, treat tested paths as the default reference point.