strix-halo-optimizations

Files

Felipe Cardoso f92b710492 fix(benchmark): parse llama-bench output with variable column count

KV cache quantization adds type_k/type_v columns to llama-bench output,
shifting test and t/s to different indices. Parse from end of row instead
of hardcoded positions. Also fix KV suffix separator (underscore to dash)
to avoid regex ambiguity with type names like q8_0.

Add 5-phase optimization guide, optimization log for tracking results,
and research docs on llama.cpp and inference landscape optimizations.

2026-03-27 14:54:19 +01:00

agentic

feat: add Qwen3.5 model catalog and agentic evaluation framework

2026-03-26 00:20:23 +01:00

audit

Initial commit

2026-03-25 20:13:15 +01:00

benchmark

fix(benchmark): parse llama-bench output with variable column count

2026-03-27 14:54:19 +01:00

monitor

fix: address code review findings (HIGH + MEDIUM)

2026-03-25 20:19:44 +01:00

optimize

fix: address code review findings (HIGH + MEDIUM)

2026-03-25 20:19:44 +01:00