strix-halo-optimizations/scripts/benchmark/run-suite.sh at d22c062ca72a04c64468cec95cb209d0220fc70f

Files

Felipe Cardoso cb25fa3f6f feat: add benchmark filtering (--max-size, --category, --skip-longctx)

Both run-baseline.sh and run-suite.sh now support:
- --max-size GB: skip models larger than N GB (prevents OOM)
- --category LIST: filter by catalog category (smoke,dense,moe)
- --skip-longctx: skip 32K context tests (saves time + memory)
- --reps N: configure repetition count
- --help: shows usage with examples

Safe pre-optimization run: benchmark baseline --max-size 20 --skip-longctx
Full post-optimization: benchmark baseline (no filters, all models + longctx)

Also: 4 new BATS tests for flag parsing (98 total, all passing)

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

2026-03-26 19:07:24 +01:00

8.1 KiB

Raw Blame History

View Raw

8.1 KiB Raw Blame History

8.1 KiB

Raw Blame History