Switch `make serve` default to Qwen3.6 UD Q6_K_XL (32 GB, hybrid
DeltaNet, near-lossless) and register it in the model catalog. Add
--jinja to the llama-server launcher so tool/function calling works —
without it clients silently ignore tool definitions advertised by the
server.
- Fix missing BATCH_ARGS in long-context commands (both benchmark scripts)
- Fix CLAUDE.md stale venv path (data/venv → .venv) and add serve/power docs
- Add -b/--batch to bin/benchmark help text
- Add --no-think flag to serve script (--reasoning-budget 0)
- Sanitize model names in eval run directories
- Simplify agentic setup to use requirements.txt
- Add serve --help test, batch flag assertions to existing tests
- Add requirements.txt for reproducible venv setup (Python 3.13)