syndarix/docs/architecture/IMPLEMENTATION_ROADMAP.md at 88cf4e0abc336e24c68733919059882048752b3f

Felipe Cardoso 88cf4e0abc feat: Update to production model stack and fix remaining inconsistencies

## Model Stack Updates (User's Actual Models)

Updated all documentation to reflect production models:
- Claude Opus 4.5 (primary reasoning)
- GPT 5.1 Codex max (code generation specialist)
- Gemini 3 Pro/Flash (multimodal, fast inference)
- Qwen3-235B (cost-effective, self-hostable)
- DeepSeek V3.2 (self-hosted, open weights)

### Files Updated:
- ADR-004: Full model groups, failover chains, cost tables
- ADR-007: Code example with correct model identifiers
- ADR-012: Cost tracking with new model prices
- ARCHITECTURE.md: Model groups, failover diagram
- IMPLEMENTATION_ROADMAP.md: External services list

## Architecture Diagram Updates

- Added LangGraph Runtime to orchestration layer
- Added technology labels (Type-Instance, transitions)

## Self-Hostability Table Expanded

Added entries for:
- LangGraph (MIT)
- transitions (MIT)
- DeepSeek V3.2 (MIT)
- Qwen3-235B (Apache 2.0)

## Metric Alignments

- Response time: Split into API (<200ms) and Agent (<10s/<60s)
- Cost per project: Adjusted to $100/sprint for Opus 4.5 pricing
- Added concurrent projects (10+) and agents (50+) metrics

## Infrastructure Updates

- Celery workers: 4-8 instances (was 2-4) across 4 queues
- MCP servers: Clarified Phase 2 + Phase 5 deployment
- Sync interval: Clarified 60s fallback + 15min reconciliation

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Risk	Impact	Probability	Mitigation
LLM API outages	High	Medium	Multi-provider failover
Cost overruns	High	Medium	Budget enforcement, local models
Agent hallucinations	High	Medium	Approval gates, code review
Performance bottlenecks	Medium	Medium	Load testing, caching
Integration failures	Medium	Low	Contract testing, mocks

Metric	Target	Measurement
Agent task success rate	>90%	Completed tasks / total tasks
API response time (P95)	<200ms	Pure API latency (per NFR-101)
Agent response time	<10s simple, <60s code	End-to-end including LLM (per NFR-103)
Cost per project	<$100/sprint	LLM + compute costs (with Opus 4.5 pricing)
Time to first commit	<1 hour	From requirements to PR
Client satisfaction	>4/5	Post-sprint survey
Concurrent projects	10+	Active projects in parallel
Concurrent agents	50+	Agent instances running

10 KiB Raw Blame History

Syndarix Implementation Roadmap

Executive Summary

Phase 0: Foundation (Weeks 1-2)

0.1 Repository Setup

0.2 Core Infrastructure

Deliverables

Phase 1: Core Platform (Weeks 3-6)

1.1 Data Model

1.2 API Layer

1.3 Real-time Infrastructure

1.4 Frontend Foundation

Deliverables

Phase 2: MCP Integration (Weeks 7-10)

2.1 MCP Client Infrastructure

2.2 LLM Gateway MCP (Priority 1)

2.3 Knowledge Base MCP (Priority 2)

2.4 Git MCP (Priority 3)

2.5 Issues MCP (Priority 4)

Deliverables

Phase 3: Agent Orchestration (Weeks 11-14)

3.1 Agent Runner

3.2 Agent Orchestrator

3.3 Inter-Agent Communication

3.4 Background Task Integration

Deliverables

Phase 4: Workflow Engine (Weeks 15-18)

4.1 State Machine Foundation

4.2 Core Workflows

4.3 Approval Gates

4.4 Autonomy Levels

Deliverables

Phase 5: Advanced Features (Weeks 19-22)

5.1 Cost Management

5.2 Audit & Compliance

5.3 Human-Agent Collaboration

5.4 Additional MCP Servers

Deliverables

Phase 6: Polish & Launch (Weeks 23-24)

6.1 Performance Optimization

6.2 Security Hardening

6.3 Documentation

6.4 Deployment

Risk Register

Success Metrics

Dependencies

Resource Requirements

Development Team

Infrastructure

External Services

10 KiB

Raw Blame History