Go to file

Felipe Cardoso 5b81437637 docs: add README, CLAUDE.md, AGENTS.md, and full docs/ suite

- README.md: project overview, quick start, command reference, workflow
- CLAUDE.md: AI safety rules, technical details, conventions
- AGENTS.md: agent workflows, file responsibility map, dependency matrix
- docs/architecture.md: script layers, data flow, unified memory, JSON schemas
- docs/optimization.md: step-by-step optimization walkthrough
- docs/benchmarking.md: methodology, test params, result interpretation
- docs/troubleshooting.md: common issues and fixes
- docs/references.md: centralized external links (single source of truth)
- docs/bios-vram-guide.md: add back-link to optimization workflow

Cross-linked non-redundantly: each doc owns one layer, others link to it.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

2026-03-25 20:50:00 +01:00

.idea

Initial commit

2026-03-25 20:13:15 +01:00

bin

Initial commit

2026-03-25 20:13:15 +01:00

configs

Initial commit

2026-03-25 20:13:15 +01:00

docs

docs: add README, CLAUDE.md, AGENTS.md, and full docs/ suite

2026-03-25 20:50:00 +01:00

lib

fix: address code review findings (HIGH + MEDIUM)

2026-03-25 20:19:44 +01:00

scripts

fix: address code review findings (HIGH + MEDIUM)

2026-03-25 20:19:44 +01:00

.gitignore

Initial commit

2026-03-25 20:13:15 +01:00

AGENTS.md

docs: add README, CLAUDE.md, AGENTS.md, and full docs/ suite

2026-03-25 20:50:00 +01:00

CLAUDE.md

docs: add README, CLAUDE.md, AGENTS.md, and full docs/ suite

2026-03-25 20:50:00 +01:00

Makefile

Initial commit

2026-03-25 20:13:15 +01:00

README.md

docs: add README, CLAUDE.md, AGENTS.md, and full docs/ suite

2026-03-25 20:50:00 +01:00

README.md

Strix Halo Optimization Toolkit

Audit, monitor, benchmark, and optimize AMD Strix Halo integrated GPU systems for LLM inference workloads.

Target hardware: AMD Ryzen AI MAX+ 395 / Radeon 8060S (gfx1151) with 64 GB unified memory, on Fedora 43. Tested on HP ZBook Ultra G1a.

Quick Start

make audit                # See current system status and optimization score
make monitor-install      # Install amdgpu_top + btop
make benchmark-setup      # Create toolbox containers + download test model
make benchmark-baseline   # Capture performance before optimization

System Status

make audit produces a single-screen overview:

=== Memory Allocation ===
  [!!] VRAM (dedicated)               32.0 GiB — should be 0.5 GiB in BIOS
  [!!] GTT (dynamic)                  15.5 GiB — should be ~59.0 GiB with kernel params

=== Kernel Boot Parameters ===
  [!!] iommu=pt                       MISSING
  [!!] amdgpu.gttsize                 MISSING — recommended: 60416
  [!!] ttm.pages_limit                MISSING — recommended: 15466496

=== Performance Profile ===
  [!!] Tuned profile                  throughput-performance — recommended: accelerator-performance

=== Optimization Score ===
  2 / 8 checks passing

Each [!!] is an optimization opportunity. Run make optimize to address them.

Commands

Command	Description
`make audit`	Quick system status (single screen)
`make audit-full`	Full system report (saved to data/audits/)
`make monitor`	Launch tmux monitoring dashboard
`make monitor-simple`	Launch amdgpu_top only
`make monitor-install`	Install monitoring tools (amdgpu_top, btop)
`make monitor-log`	Start background CSV metric logger
`make benchmark-setup`	Ensure toolboxes and test models are ready
`make benchmark-baseline`	Capture pre-optimization baseline
`make benchmark`	Run full benchmark suite
`make benchmark-compare`	Compare two runs (`BEFORE=dir AFTER=dir`)
`sudo make optimize`	Interactive optimization walkthrough
`sudo make optimize-kernel`	Configure kernel boot parameters
`sudo make optimize-tuned`	Switch to accelerator-performance profile
`make optimize-vram`	BIOS VRAM guidance + GTT verification
`make verify`	Post-optimization verification checklist
`sudo make rollback`	Rollback optimizations

Optimization Workflow

1. Audit          make audit
      │
2. Monitor        make monitor-install && make monitor
      │
3. Baseline       make benchmark-setup && make benchmark-baseline
      │
4. Optimize       sudo make optimize
      │               ├── tuned profile  (instant, +5-8% pp)
      │               ├── kernel params  (reboot required)
      │               └── BIOS VRAM      (reboot + BIOS access)
      │
5. Verify         make verify
      │
6. Re-benchmark   make benchmark && make benchmark-compare BEFORE=... AFTER=...

See docs/optimization.md for the full walkthrough with explanations.

Project Structure

bin/            Entry points (audit, monitor, benchmark, optimize)
lib/            Shared bash libraries (common, detect, format)
scripts/        Implementation organized by function
configs/        Reference configuration templates
data/           Runtime output: audits, benchmarks, logs, backups (gitignored)
docs/           Technical documentation

See docs/architecture.md for the full architecture, data flow, and JSON schemas.

Requirements

OS: Fedora 43 (tested), Fedora 42+ should work
Hardware: AMD Strix Halo (Ryzen AI MAX / MAX+) with RDNA 3.5 iGPU
Tools: bc, python3, tmux, podman, toolbox
Optional: amdgpu_top (installed via make monitor-install), huggingface-cli (for model downloads)

Documentation

Document	Contents
docs/architecture.md	Script layers, data flow, unified memory model, JSON schemas
docs/optimization.md	Step-by-step optimization walkthrough
docs/benchmarking.md	Benchmark methodology, test params, result interpretation
docs/bios-vram-guide.md	HP ZBook BIOS configuration for VRAM
docs/troubleshooting.md	Common issues and fixes
docs/references.md	External links: AMD docs, toolboxes, community resources

Contributing

AI assistants: see CLAUDE.md for safety rules and technical context. Agent workflows are in AGENTS.md.