strix-halo-optimizations

Files

Felipe Cardoso 6ab08537ca fix: address code review findings — batch args, venv path, serve flags

- Fix missing BATCH_ARGS in long-context commands (both benchmark scripts)
- Fix CLAUDE.md stale venv path (data/venv → .venv) and add serve/power docs
- Add -b/--batch to bin/benchmark help text
- Add --no-think flag to serve script (--reasoning-budget 0)
- Sanitize model names in eval run directories
- Simplify agentic setup to use requirements.txt
- Add serve --help test, batch flag assertions to existing tests
- Add requirements.txt for reproducible venv setup (Python 3.13)

2026-03-31 10:10:48 +02:00

agentic

feat: add Qwen3.5 model catalog and agentic evaluation framework

2026-03-26 00:20:23 +01:00

audit

Initial commit

2026-03-25 20:13:15 +01:00

benchmark

fix: address code review findings — batch args, venv path, serve flags

2026-03-31 10:10:48 +02:00

monitor

Initial commit

2026-03-25 20:13:15 +01:00

optimize

feat(optimize): add Phase 2 power profile and system tuning

2026-03-30 18:53:52 +02:00

serve

feat(serve): add optimized llama-server launcher with n-gram speculation

2026-03-30 21:12:30 +02:00