fix(mcp-kb): add transactional batch insert and atomic document update

- Wrap store_embeddings_batch in transaction for all-or-nothing semantics - Add replace_source_embeddings method for atomic document updates - Update collection_manager to use transactional replace - Prevents race conditions and data inconsistency (closes #77) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
fix(mcp-kb): address critical issues from deep review
2026-01-04 01:07:40 +01:00 · 2026-01-04 01:03:58 +01:00 · 2026-01-04 00:58:31 +01:00 · 2026-01-04 00:58:24 +01:00
12 changed files with 835 additions and 80 deletions
--- a/CLAUDE.md
+++ b/CLAUDE.md
@@ -83,6 +83,37 @@ docs/
 3. **Testing Required**: All code must be tested, aim for >90% coverage
 4. **Code Review**: Must pass multi-agent review before merge
 5. **No Direct Commits**: Never commit directly to `main` or `dev`
 6. **Stack Verification**: ALWAYS run the full stack before considering work done (see below)
 ### CRITICAL: Stack Verification Before Merge
 **This is NON-NEGOTIABLE. A feature with 100% test coverage that crashes on startup is WORTHLESS.**
 Before considering ANY issue complete:
 ```bash
 # 1. Start the dev stack
 make dev
 # 2. Wait for backend to be healthy, check logs
 docker compose -f docker-compose.dev.yml logs backend --tail=100
 # 3. Start frontend
 cd frontend && npm run dev
 # 4. Verify both are running without errors
 ```
 **The issue is NOT done if:**
 - Backend crashes on startup (import errors, missing dependencies)
 - Frontend fails to compile or render
 - Health checks fail
 - Any error appears in logs
 **Why this matters:**
 - Tests run in isolation and may pass despite broken imports
 - Docker builds cache layers and may hide dependency issues
 - A single `ModuleNotFoundError` renders all test coverage meaningless
 ### Common Commands
--- a/backend/Dockerfile
+++ b/backend/Dockerfile
@@ -7,7 +7,10 @@ ENV PYTHONDONTWRITEBYTECODE=1 \
    PYTHONPATH=/app \
    UV_COMPILE_BYTECODE=1 \
    UV_LINK_MODE=copy \
-    UV_NO_CACHE=1
+    UV_NO_CACHE=1 \
    UV_PROJECT_ENVIRONMENT=/opt/venv \
    VIRTUAL_ENV=/opt/venv \
    PATH="/opt/venv/bin:$PATH"
 # Install system dependencies and uv
 RUN apt-get update && \
@@ -20,7 +23,7 @@ RUN apt-get update && \
 # Copy dependency files
 COPY pyproject.toml uv.lock ./
-# Install dependencies using uv (development mode with dev dependencies)
+# Install dependencies using uv into /opt/venv (outside /app to survive bind mounts)
 RUN uv sync --extra dev --frozen
 # Copy application code
@@ -45,7 +48,10 @@ ENV PYTHONDONTWRITEBYTECODE=1 \
    PYTHONPATH=/app \
    UV_COMPILE_BYTECODE=1 \
    UV_LINK_MODE=copy \
-    UV_NO_CACHE=1
+    UV_NO_CACHE=1 \
    UV_PROJECT_ENVIRONMENT=/opt/venv \
    VIRTUAL_ENV=/opt/venv \
    PATH="/opt/venv/bin:$PATH"
 # Install system dependencies and uv
 RUN apt-get update && \
@@ -58,7 +64,7 @@ RUN apt-get update && \
 # Copy dependency files
 COPY pyproject.toml uv.lock ./
-# Install only production dependencies using uv (no dev dependencies)
+# Install only production dependencies using uv into /opt/venv
 RUN uv sync --frozen --no-dev
 # Copy application code
@@ -67,7 +73,7 @@ COPY entrypoint.sh /usr/local/bin/
 RUN chmod +x /usr/local/bin/entrypoint.sh
 # Set ownership to non-root user
-RUN chown -R appuser:appuser /app
+RUN chown -R appuser:appuser /app /opt/venv
 # Switch to non-root user
 USER appuser
@@ -77,4 +83,4 @@ HEALTHCHECK --interval=30s --timeout=10s --start-period=40s --retries=3 \
    CMD curl -f http://localhost:8000/health || exit 1
 ENTRYPOINT ["/usr/local/bin/entrypoint.sh"]
-CMD ["uv", "run", "uvicorn", "app.main:app", "--host", "0.0.0.0", "--port", "8000"]
+CMD ["uvicorn", "app.main:app", "--host", "0.0.0.0", "--port", "8000"]
--- a/backend/entrypoint.sh
+++ b/backend/entrypoint.sh
@@ -1,11 +1,11 @@
 #!/bin/bash
 set -e
-# Ensure the project's virtualenv binaries are on PATH so commands like
+# Ensure the virtualenv binaries are on PATH. Dependencies are installed
-# 'uvicorn' work even when not prefixed by 'uv run'. This matches how uv
+# to /opt/venv (not /app/.venv) to survive bind mounts in development.
-# installs the env into /app/.venv in our containers.
+if [ -d "/opt/venv/bin" ]; then
-if [ -d "/app/.venv/bin" ]; then
+  export PATH="/opt/venv/bin:$PATH"
-  export PATH="/app/.venv/bin:$PATH"
+  export VIRTUAL_ENV="/opt/venv"
 fi
 # Only the backend service should run migrations and init_db
--- a/docker-compose.dev.yml
+++ b/docker-compose.dev.yml
@@ -40,8 +40,7 @@ services:
    volumes:
      - ./backend:/app
      - ./uploads:/app/uploads
-      # Exclude local .venv from bind mount to use container's .venv
+      # Note: venv is at /opt/venv (not /app/.venv) so bind mount doesn't affect it
      - /app/.venv
    ports:
      - "8000:8000"
    env_file:
@@ -76,7 +75,6 @@ services:
      target: development
    volumes:
      - ./backend:/app
      - /app/.venv
    env_file:
      - .env
    environment:
@@ -99,7 +97,6 @@ services:
      target: development
    volumes:
      - ./backend:/app
      - /app/.venv
    env_file:
      - .env
    environment:
@@ -122,7 +119,6 @@ services:
      target: development
    volumes:
      - ./backend:/app
      - /app/.venv
    env_file:
      - .env
    environment:
@@ -145,7 +141,6 @@ services:
      target: development
    volumes:
      - ./backend:/app
      - /app/.venv
    env_file:
      - .env
    environment:
--- a/docs/development/WORKFLOW.md
+++ b/docs/development/WORKFLOW.md
@@ -214,9 +214,9 @@ test(frontend): add unit tests for ProjectDashboard
 ---
-## Phase 2+ Implementation Workflow
+## Rigorous Implementation Workflow
-**For complex infrastructure issues (Phase 2 MCP, core systems), follow this rigorous process:**
+**This workflow applies to ALL feature implementations. Follow this process rigorously:**
 ### 1. Branch Setup
 ```bash
@@ -257,12 +257,42 @@ Before closing an issue, perform deep review from multiple angles:
 **No stone unturned. No sloppy results. No unreviewed work.**
-### 5. Final Validation
+### 5. Stack Verification (CRITICAL - NON-NEGOTIABLE)
 **ALWAYS run the full stack and verify it boots correctly before considering ANY work done.**
 ```bash
 # Start the full development stack
 make dev
 # Check backend logs for startup errors
 docker compose -f docker-compose.dev.yml logs backend --tail=100
 # Start frontend separately
 cd frontend && npm run dev
 # Check frontend console for errors
 ```
 **A feature is NOT complete if:**
 - The stack doesn't boot
 - There are import errors in logs
 - Health checks fail
 - Any component crashes on startup
 **This rule exists because:**
 - Tests can pass but the application won't start (import errors, missing deps)
 - 90% test coverage is worthless if the app crashes on boot
 - Docker builds can mask local issues
 ### 6. Final Validation Checklist
 - [ ] All tests pass (unit, integration, E2E)
 - [ ] Type checking passes
 - [ ] Linting passes
 - [ ] **Stack boots successfully** (backend + frontend)
 - [ ] **Logs show no errors**
 - [ ] Coverage meets threshold (>90% backend, >90% frontend)
 - [ ] Documentation updated
 - [ ] Coverage meets threshold
 - [ ] Issue checklist 100% complete
 - [ ] Multi-agent review passed
--- a/mcp-servers/knowledge-base/collection_manager.py
+++ b/mcp-servers/knowledge-base/collection_manager.py
@@ -265,9 +265,10 @@ class CollectionManager:
        metadata: dict[str, Any] | None = None,
    ) -> IngestResult:
        """
-        Update a document by replacing existing chunks.
+        Update a document by atomically replacing existing chunks.
-        Deletes existing chunks for the source path and ingests new content.
+        Uses a database transaction to delete existing chunks and insert new ones
        atomically, preventing race conditions during concurrent updates.
        Args:
            project_id: Project ID
@@ -282,26 +283,76 @@ class CollectionManager:
        Returns:
            Ingest result
        """
-        # First delete existing chunks for this source
+        request_metadata = metadata or {}
        await self.database.delete_by_source(
            project_id=project_id,
            source_path=source_path,
            collection=collection,
        )
-        # Then ingest new content
+        # Chunk the content
-        request = IngestRequest(
+        chunks = self.chunker_factory.chunk_content(
            project_id=project_id,
            agent_id=agent_id,
            content=content,
            source_path=source_path,
            collection=collection,
            chunk_type=chunk_type,
            file_type=file_type,
-            metadata=metadata or {},
+            chunk_type=chunk_type,
            metadata=request_metadata,
        )
-        return await self.ingest(request)
+        if not chunks:
            # No chunks = delete existing and return empty result
            await self.database.delete_by_source(
                project_id=project_id,
                source_path=source_path,
                collection=collection,
            )
            return IngestResult(
                success=True,
                chunks_created=0,
                embeddings_generated=0,
                source_path=source_path,
                collection=collection,
                chunk_ids=[],
            )
        # Generate embeddings for new chunks
        chunk_texts = [chunk.content for chunk in chunks]
        embeddings_list = await self.embeddings.generate_batch(
            texts=chunk_texts,
            project_id=project_id,
            agent_id=agent_id,
        )
        # Build embeddings data for transactional replace
        embeddings_data = []
        for chunk, embedding in zip(chunks, embeddings_list, strict=True):
            chunk_metadata = {
                **request_metadata,
                **chunk.metadata,
                "token_count": chunk.token_count,
                "source_path": chunk.source_path or source_path,
                "start_line": chunk.start_line,
                "end_line": chunk.end_line,
                "file_type": (chunk.file_type or file_type).value if (chunk.file_type or file_type) else None,
            }
            embeddings_data.append((
                chunk.content,
                embedding,
                chunk.chunk_type,
                chunk_metadata,
            ))
        # Atomically replace old embeddings with new ones
        _, chunk_ids = await self.database.replace_source_embeddings(
            project_id=project_id,
            source_path=source_path,
            collection=collection,
            embeddings=embeddings_data,
        )
        return IngestResult(
            success=True,
            chunks_created=len(chunk_ids),
            embeddings_generated=len(embeddings_list),
            source_path=source_path,
            collection=collection,
            chunk_ids=chunk_ids,
        )
    async def cleanup_expired(self) -> int:
        """
--- a/mcp-servers/knowledge-base/config.py
+++ b/mcp-servers/knowledge-base/config.py
@@ -112,6 +112,20 @@ class Settings(BaseSettings):
        description="TTL for embedding records in days (0 = no expiry)",
    )
    # Content size limits (DoS prevention)
    max_document_size: int = Field(
        default=10 * 1024 * 1024,  # 10 MB
        description="Maximum size of a single document in bytes",
    )
    max_batch_size: int = Field(
        default=100,
        description="Maximum number of documents in a batch operation",
    )
    max_batch_total_size: int = Field(
        default=50 * 1024 * 1024,  # 50 MB
        description="Maximum total size of all documents in a batch",
    )
    model_config = {"env_prefix": "KB_", "env_file": ".env", "extra": "ignore"}
--- a/mcp-servers/knowledge-base/database.py
+++ b/mcp-servers/knowledge-base/database.py
@@ -285,38 +285,40 @@ class DatabaseManager:
        try:
            async with self.acquire() as conn:
-                for project_id, collection, content, embedding, chunk_type, metadata in embeddings:
+                # Wrap in transaction for all-or-nothing batch semantics
-                    content_hash = self.compute_content_hash(content)
+                async with conn.transaction():
-                    source_path = metadata.get("source_path")
+                    for project_id, collection, content, embedding, chunk_type, metadata in embeddings:
-                    start_line = metadata.get("start_line")
+                        content_hash = self.compute_content_hash(content)
-                    end_line = metadata.get("end_line")
+                        source_path = metadata.get("source_path")
-                    file_type = metadata.get("file_type")
+                        start_line = metadata.get("start_line")
                        end_line = metadata.get("end_line")
                        file_type = metadata.get("file_type")
-                    embedding_id = await conn.fetchval(
+                        embedding_id = await conn.fetchval(
-                        """
+                            """
-                        INSERT INTO knowledge_embeddings
+                            INSERT INTO knowledge_embeddings
-                        (project_id, collection, content, embedding, chunk_type,
+                            (project_id, collection, content, embedding, chunk_type,
-                         source_path, start_line, end_line, file_type, metadata,
+                             source_path, start_line, end_line, file_type, metadata,
-                         content_hash, expires_at)
+                             content_hash, expires_at)
-                        VALUES ($1, $2, $3, $4, $5, $6, $7, $8, $9, $10, $11, $12)
+                            VALUES ($1, $2, $3, $4, $5, $6, $7, $8, $9, $10, $11, $12)
-                        ON CONFLICT DO NOTHING
+                            ON CONFLICT DO NOTHING
-                        RETURNING id
+                            RETURNING id
-                        """,
+                            """,
-                        project_id,
+                            project_id,
-                        collection,
+                            collection,
-                        content,
+                            content,
-                        embedding,
+                            embedding,
-                        chunk_type.value,
+                            chunk_type.value,
-                        source_path,
+                            source_path,
-                        start_line,
+                            start_line,
-                        end_line,
+                            end_line,
-                        file_type,
+                            file_type,
-                        metadata,
+                            metadata,
-                        content_hash,
+                            content_hash,
-                        expires_at,
+                            expires_at,
-                    )
+                        )
-                    if embedding_id:
+                        if embedding_id:
-                        ids.append(str(embedding_id))
+                            ids.append(str(embedding_id))
            logger.info(f"Stored {len(ids)} embeddings in batch")
            return ids
@@ -345,8 +347,9 @@ class DatabaseManager:
        """
        try:
            async with self.acquire() as conn:
-                # Build query with optional filters
+                # Build query with optional filters using CTE to filter by similarity
-                query = """
+                # We use a CTE to compute similarity once, then filter in outer query
                inner_query = """
                    SELECT
                        id, project_id, collection, content, embedding,
                        chunk_type, source_path, start_line, end_line,
@@ -361,18 +364,21 @@ class DatabaseManager:
                param_idx = 3
                if collection:
-                    query += f" AND collection = ${param_idx}"
+                    inner_query += f" AND collection = ${param_idx}"
                    params.append(collection)
                    param_idx += 1
                if file_types:
                    file_type_values = [ft.value for ft in file_types]
-                    query += f" AND file_type = ANY(${param_idx})"
+                    inner_query += f" AND file_type = ANY(${param_idx})"
                    params.append(file_type_values)
                    param_idx += 1
-                query += f"""
+                # Wrap in CTE and filter by threshold in outer query
-                    HAVING 1 - (embedding <=> $1) >= ${param_idx}
+                query = f"""
                    WITH scored AS ({inner_query})
                    SELECT * FROM scored
                    WHERE similarity >= ${param_idx}
                    ORDER BY similarity DESC
                    LIMIT ${param_idx + 1}
                """
@@ -531,6 +537,96 @@ class DatabaseManager:
                cause=e,
            )
    async def replace_source_embeddings(
        self,
        project_id: str,
        source_path: str,
        collection: str,
        embeddings: list[tuple[str, list[float], ChunkType, dict[str, Any]]],
    ) -> tuple[int, list[str]]:
        """
        Atomically replace all embeddings for a source path.
        Deletes existing embeddings and inserts new ones in a single transaction,
        preventing race conditions during document updates.
        Args:
            project_id: Project ID
            source_path: Source file path being updated
            collection: Collection name
            embeddings: List of (content, embedding, chunk_type, metadata)
        Returns:
            Tuple of (deleted_count, new_embedding_ids)
        """
        expires_at = None
        if self._settings.embedding_ttl_days > 0:
            expires_at = datetime.now(UTC) + timedelta(
                days=self._settings.embedding_ttl_days
            )
        try:
            async with self.acquire() as conn:
                # Use transaction for atomic replace
                async with conn.transaction():
                    # First, delete existing embeddings for this source
                    delete_result = await conn.execute(
                        """
                        DELETE FROM knowledge_embeddings
                        WHERE project_id = $1 AND source_path = $2 AND collection = $3
                        """,
                        project_id,
                        source_path,
                        collection,
                    )
                    deleted_count = int(delete_result.split()[-1])
                    # Then insert new embeddings
                    new_ids = []
                    for content, embedding, chunk_type, metadata in embeddings:
                        content_hash = self.compute_content_hash(content)
                        start_line = metadata.get("start_line")
                        end_line = metadata.get("end_line")
                        file_type = metadata.get("file_type")
                        embedding_id = await conn.fetchval(
                            """
                            INSERT INTO knowledge_embeddings
                            (project_id, collection, content, embedding, chunk_type,
                             source_path, start_line, end_line, file_type, metadata,
                             content_hash, expires_at)
                            VALUES ($1, $2, $3, $4, $5, $6, $7, $8, $9, $10, $11, $12)
                            RETURNING id
                            """,
                            project_id,
                            collection,
                            content,
                            embedding,
                            chunk_type.value,
                            source_path,
                            start_line,
                            end_line,
                            file_type,
                            metadata,
                            content_hash,
                            expires_at,
                        )
                        if embedding_id:
                            new_ids.append(str(embedding_id))
                    logger.info(
                        f"Replaced source {source_path}: deleted {deleted_count}, "
                        f"inserted {len(new_ids)} embeddings"
                    )
                    return deleted_count, new_ids
        except asyncpg.PostgresError as e:
            logger.error(f"Replace source error: {e}")
            raise DatabaseQueryError(
                message=f"Failed to replace source embeddings: {e}",
                cause=e,
            )
    async def delete_collection(
        self,
        project_id: str,
--- a/mcp-servers/knowledge-base/server.py
+++ b/mcp-servers/knowledge-base/server.py
@@ -5,11 +5,13 @@ Provides RAG capabilities with pgvector for semantic search,
 intelligent chunking, and collection management.
 """
 import inspect
 import logging
 from contextlib import asynccontextmanager
-from typing import Any
+from typing import Any, get_type_hints
-from fastapi import FastAPI
+from fastapi import FastAPI, Request
 from fastapi.responses import JSONResponse
 from fastmcp import FastMCP
 from pydantic import Field
@@ -116,6 +118,259 @@ async def health_check() -> dict[str, Any]:
    return status
 # Tool registry for JSON-RPC
 _tool_registry: dict[str, Any] = {}
 def _python_type_to_json_schema(python_type: Any) -> dict[str, Any]:
    """Convert Python type annotation to JSON Schema."""
    type_name = getattr(python_type, "__name__", str(python_type))
    if python_type is str or type_name == "str":
        return {"type": "string"}
    elif python_type is int or type_name == "int":
        return {"type": "integer"}
    elif python_type is float or type_name == "float":
        return {"type": "number"}
    elif python_type is bool or type_name == "bool":
        return {"type": "boolean"}
    elif type_name == "NoneType":
        return {"type": "null"}
    elif hasattr(python_type, "__origin__"):
        origin = python_type.__origin__
        args = getattr(python_type, "__args__", ())
        if origin is list:
            item_type = args[0] if args else Any
            return {"type": "array", "items": _python_type_to_json_schema(item_type)}
        elif origin is dict:
            return {"type": "object"}
        elif origin is type(None) or str(origin) == "typing.Union":
            # Handle Optional types (Union with None)
            non_none_args = [a for a in args if a is not type(None)]
            if len(non_none_args) == 1:
                schema = _python_type_to_json_schema(non_none_args[0])
                schema["nullable"] = True
                return schema
            return {"type": "object"}
    return {"type": "object"}
 def _get_tool_schema(func: Any) -> dict[str, Any]:
    """Extract JSON Schema from a tool function."""
    sig = inspect.signature(func)
    hints = get_type_hints(func) if hasattr(func, "__annotations__") else {}
    properties: dict[str, Any] = {}
    required: list[str] = []
    for name, param in sig.parameters.items():
        if name in ("self", "cls"):
            continue
        prop: dict[str, Any] = {}
        # Get type from hints
        if name in hints:
            prop = _python_type_to_json_schema(hints[name])
        # Get description and constraints from Field default (FieldInfo object)
        default_val = param.default
        if hasattr(default_val, "description") and default_val.description:
            prop["description"] = default_val.description
        if hasattr(default_val, "ge") and default_val.ge is not None:
            prop["minimum"] = default_val.ge
        if hasattr(default_val, "le") and default_val.le is not None:
            prop["maximum"] = default_val.le
        # Handle Field default value (check for PydanticUndefined)
        if hasattr(default_val, "default"):
            field_default = default_val.default
            # Check if it's the "required" sentinel (...)
            if field_default is not ... and not (
                hasattr(field_default, "__class__")
                and "PydanticUndefined" in field_default.__class__.__name__
            ):
                prop["default"] = field_default
        # Determine if required
        if param.default is inspect.Parameter.empty:
            required.append(name)
        elif hasattr(default_val, "default"):
            field_default = default_val.default
            # Required if default is ellipsis or PydanticUndefined
            if field_default is ... or (
                hasattr(field_default, "__class__")
                and "PydanticUndefined" in field_default.__class__.__name__
            ):
                required.append(name)
        properties[name] = prop
    return {
        "type": "object",
        "properties": properties,
        "required": required,
    }
 def _register_tool(name: str, tool_or_func: Any, description: str | None = None) -> None:
    """Register a tool in the registry.
    Handles both raw functions and FastMCP FunctionTool objects.
    """
    # Extract the underlying function from FastMCP FunctionTool if needed
    if hasattr(tool_or_func, "fn"):
        func = tool_or_func.fn
        # Use FunctionTool's description if available
        if not description and hasattr(tool_or_func, "description") and tool_or_func.description:
            description = tool_or_func.description
    else:
        func = tool_or_func
    _tool_registry[name] = {
        "func": func,
        "description": description or (func.__doc__ or "").strip(),
        "schema": _get_tool_schema(func),
    }
@app.get("/mcp/tools")
 async def list_mcp_tools() -> dict[str, Any]:
    """
    Return list of available MCP tools with their schemas.
    This endpoint enables tool discovery for the backend MCP client.
    """
    tools = []
    for name, info in _tool_registry.items():
        tools.append({
            "name": name,
            "description": info["description"],
            "inputSchema": info["schema"],
        })
    return {"tools": tools}
@app.post("/mcp")
 async def mcp_rpc(request: Request) -> JSONResponse:
    """
    JSON-RPC 2.0 endpoint for MCP tool execution.
    Request format:
    {
        "jsonrpc": "2.0",
        "method": "<tool_name>",
        "params": {...},
        "id": <request_id>
    }
    Response format:
    {
        "jsonrpc": "2.0",
        "result": {...},
        "id": <request_id>
    }
    """
    try:
        body = await request.json()
    except Exception as e:
        return JSONResponse(
            status_code=400,
            content={
                "jsonrpc": "2.0",
                "error": {"code": -32700, "message": f"Parse error: {e}"},
                "id": None,
            },
        )
    # Validate JSON-RPC structure
    jsonrpc = body.get("jsonrpc")
    method = body.get("method")
    params = body.get("params", {})
    request_id = body.get("id")
    if jsonrpc != "2.0":
        return JSONResponse(
            status_code=400,
            content={
                "jsonrpc": "2.0",
                "error": {"code": -32600, "message": "Invalid Request: jsonrpc must be '2.0'"},
                "id": request_id,
            },
        )
    if not method:
        return JSONResponse(
            status_code=400,
            content={
                "jsonrpc": "2.0",
                "error": {"code": -32600, "message": "Invalid Request: method is required"},
                "id": request_id,
            },
        )
    # Look up tool
    tool_info = _tool_registry.get(method)
    if not tool_info:
        return JSONResponse(
            status_code=404,
            content={
                "jsonrpc": "2.0",
                "error": {"code": -32601, "message": f"Method not found: {method}"},
                "id": request_id,
            },
        )
    # Execute tool
    try:
        func = tool_info["func"]
        # Resolve Field defaults for missing parameters
        sig = inspect.signature(func)
        resolved_params = dict(params)
        for name, param in sig.parameters.items():
            if name not in resolved_params:
                default_val = param.default
                # Check if it's a FieldInfo with a default value
                if hasattr(default_val, "default"):
                    field_default = default_val.default
                    # Only use if it has an actual default (not required)
                    if field_default is not ... and not (
                        hasattr(field_default, "__class__")
                        and "PydanticUndefined" in field_default.__class__.__name__
                    ):
                        resolved_params[name] = field_default
        result = await func(**resolved_params)
        return JSONResponse(
            content={
                "jsonrpc": "2.0",
                "result": result,
                "id": request_id,
            }
        )
    except TypeError as e:
        return JSONResponse(
            status_code=400,
            content={
                "jsonrpc": "2.0",
                "error": {"code": -32602, "message": f"Invalid params: {e}"},
                "id": request_id,
            },
        )
    except Exception as e:
        logger.error(f"Tool execution error: {e}")
        return JSONResponse(
            status_code=500,
            content={
                "jsonrpc": "2.0",
                "error": {"code": -32000, "message": f"Server error: {e}"},
                "id": request_id,
            },
        )
 # MCP Tools
@@ -261,6 +516,15 @@ async def ingest_content(
    the LLM Gateway, and stored in pgvector for search.
    """
    try:
        # Validate content size to prevent DoS
        settings = get_settings()
        content_size = len(content.encode("utf-8"))
        if content_size > settings.max_document_size:
            return {
                "success": False,
                "error": f"Content size ({content_size} bytes) exceeds maximum allowed ({settings.max_document_size} bytes)",
            }
        # Parse chunk type
        try:
            chunk_type_enum = ChunkType(chunk_type.lower())
@@ -492,6 +756,15 @@ async def update_document(
    Replaces all existing chunks for the source path with new content.
    """
    try:
        # Validate content size to prevent DoS
        settings = get_settings()
        content_size = len(content.encode("utf-8"))
        if content_size > settings.max_document_size:
            return {
                "success": False,
                "error": f"Content size ({content_size} bytes) exceeds maximum allowed ({settings.max_document_size} bytes)",
            }
        # Parse chunk type
        try:
            chunk_type_enum = ChunkType(chunk_type.lower())
@@ -550,6 +823,16 @@ async def update_document(
        }
 # Register tools in the JSON-RPC registry
 # This must happen after tool functions are defined
 _register_tool("search_knowledge", search_knowledge)
 _register_tool("ingest_content", ingest_content)
 _register_tool("delete_content", delete_content)
 _register_tool("list_collections", list_collections)
 _register_tool("get_collection_stats", get_collection_stats)
 _register_tool("update_document", update_document)
 def main() -> None:
    """Run the server."""
    import uvicorn
--- a/mcp-servers/knowledge-base/tests/conftest.py
+++ b/mcp-servers/knowledge-base/tests/conftest.py
@@ -61,6 +61,7 @@ def mock_database():
    mock_db.delete_by_source = AsyncMock(return_value=1)
    mock_db.delete_collection = AsyncMock(return_value=5)
    mock_db.delete_by_ids = AsyncMock(return_value=2)
    mock_db.replace_source_embeddings = AsyncMock(return_value=(1, ["new-id-1"]))
    mock_db.list_collections = AsyncMock(return_value=[])
    mock_db.get_collection_stats = AsyncMock()
    mock_db.cleanup_expired = AsyncMock(return_value=0)
--- a/mcp-servers/knowledge-base/tests/test_collection_manager.py
+++ b/mcp-servers/knowledge-base/tests/test_collection_manager.py
@@ -192,7 +192,7 @@ class TestCollectionManager:
    @pytest.mark.asyncio
    async def test_update_document(self, collection_manager):
-        """Test updating a document."""
+        """Test updating a document with atomic replace."""
        result = await collection_manager.update_document(
            project_id="proj-123",
            agent_id="agent-456",
@@ -201,9 +201,10 @@ class TestCollectionManager:
            collection="default",
        )
-        # Should delete first, then ingest
+        # Should use atomic replace (delete + insert in transaction)
-        collection_manager._database.delete_by_source.assert_called_once()
+        collection_manager._database.replace_source_embeddings.assert_called_once()
        assert result.success is True
        assert len(result.chunk_ids) == 1
    @pytest.mark.asyncio
    async def test_cleanup_expired(self, collection_manager):
--- a/mcp-servers/knowledge-base/tests/test_server.py
+++ b/mcp-servers/knowledge-base/tests/test_server.py
@@ -1,9 +1,11 @@
 """Tests for server module and MCP tools."""
 import json
 from datetime import UTC, datetime
-from unittest.mock import AsyncMock, MagicMock
+from unittest.mock import AsyncMock, MagicMock, patch
 import pytest
 from fastapi.testclient import TestClient
 class TestHealthCheck:
@@ -355,3 +357,248 @@ class TestUpdateDocumentTool:
        assert result["success"] is False
        assert "Invalid chunk type" in result["error"]
 class TestMCPToolsEndpoint:
    """Tests for /mcp/tools endpoint."""
    def test_list_mcp_tools(self):
        """Test listing available MCP tools."""
        import server
        client = TestClient(server.app)
        response = client.get("/mcp/tools")
        assert response.status_code == 200
        data = response.json()
        assert "tools" in data
        assert len(data["tools"]) == 6  # 6 tools registered
        tool_names = [t["name"] for t in data["tools"]]
        assert "search_knowledge" in tool_names
        assert "ingest_content" in tool_names
        assert "delete_content" in tool_names
        assert "list_collections" in tool_names
        assert "get_collection_stats" in tool_names
        assert "update_document" in tool_names
    def test_tool_has_schema(self):
        """Test that each tool has input schema."""
        import server
        client = TestClient(server.app)
        response = client.get("/mcp/tools")
        data = response.json()
        for tool in data["tools"]:
            assert "inputSchema" in tool
            assert "type" in tool["inputSchema"]
            assert tool["inputSchema"]["type"] == "object"
 class TestMCPRPCEndpoint:
    """Tests for /mcp JSON-RPC endpoint."""
    def test_valid_jsonrpc_request(self):
        """Test valid JSON-RPC request."""
        import server
        from models import SearchResponse, SearchResult
        mock_search = MagicMock()
        mock_search.search = AsyncMock(
            return_value=SearchResponse(
                query="test",
                search_type="hybrid",
                results=[
                    SearchResult(
                        id="id-1",
                        content="Test",
                        score=0.9,
                        source_path="/test.py",
                        chunk_type="code",
                        collection="default",
                    )
                ],
                total_results=1,
                search_time_ms=5.0,
            )
        )
        server._search = mock_search
        client = TestClient(server.app)
        response = client.post(
            "/mcp",
            json={
                "jsonrpc": "2.0",
                "method": "search_knowledge",
                "params": {
                    "project_id": "proj-123",
                    "agent_id": "agent-456",
                    "query": "test",
                },
                "id": 1,
            },
        )
        assert response.status_code == 200
        data = response.json()
        assert data["jsonrpc"] == "2.0"
        assert data["id"] == 1
        assert "result" in data
        assert data["result"]["success"] is True
    def test_invalid_jsonrpc_version(self):
        """Test request with invalid JSON-RPC version."""
        import server
        client = TestClient(server.app)
        response = client.post(
            "/mcp",
            json={
                "jsonrpc": "1.0",
                "method": "search_knowledge",
                "params": {},
                "id": 1,
            },
        )
        assert response.status_code == 400
        data = response.json()
        assert data["error"]["code"] == -32600
        assert "jsonrpc must be '2.0'" in data["error"]["message"]
    def test_missing_method(self):
        """Test request without method."""
        import server
        client = TestClient(server.app)
        response = client.post(
            "/mcp",
            json={
                "jsonrpc": "2.0",
                "params": {},
                "id": 1,
            },
        )
        assert response.status_code == 400
        data = response.json()
        assert data["error"]["code"] == -32600
        assert "method is required" in data["error"]["message"]
    def test_unknown_method(self):
        """Test request with unknown method."""
        import server
        client = TestClient(server.app)
        response = client.post(
            "/mcp",
            json={
                "jsonrpc": "2.0",
                "method": "unknown_method",
                "params": {},
                "id": 1,
            },
        )
        assert response.status_code == 404
        data = response.json()
        assert data["error"]["code"] == -32601
        assert "Method not found" in data["error"]["message"]
    def test_invalid_params(self):
        """Test request with invalid params."""
        import server
        client = TestClient(server.app)
        response = client.post(
            "/mcp",
            json={
                "jsonrpc": "2.0",
                "method": "search_knowledge",
                "params": {"invalid_param": "value"},  # Missing required params
                "id": 1,
            },
        )
        assert response.status_code == 400
        data = response.json()
        assert data["error"]["code"] == -32602
 class TestContentSizeLimits:
    """Tests for content size validation."""
    @pytest.mark.asyncio
    async def test_ingest_rejects_oversized_content(self):
        """Test that ingest rejects content exceeding size limit."""
        import server
        from config import get_settings
        settings = get_settings()
        # Create content larger than max size
        oversized_content = "x" * (settings.max_document_size + 1)
        result = await server.ingest_content.fn(
            project_id="proj-123",
            agent_id="agent-456",
            content=oversized_content,
            chunk_type="text",
        )
        assert result["success"] is False
        assert "exceeds maximum" in result["error"]
    @pytest.mark.asyncio
    async def test_update_rejects_oversized_content(self):
        """Test that update rejects content exceeding size limit."""
        import server
        from config import get_settings
        settings = get_settings()
        oversized_content = "x" * (settings.max_document_size + 1)
        result = await server.update_document.fn(
            project_id="proj-123",
            agent_id="agent-456",
            source_path="/test.py",
            content=oversized_content,
            chunk_type="text",
        )
        assert result["success"] is False
        assert "exceeds maximum" in result["error"]
    @pytest.mark.asyncio
    async def test_ingest_accepts_valid_size_content(self):
        """Test that ingest accepts content within size limit."""
        import server
        from models import IngestResult
        mock_collections = MagicMock()
        mock_collections.ingest = AsyncMock(
            return_value=IngestResult(
                success=True,
                chunks_created=1,
                embeddings_generated=1,
                source_path="/test.py",
                collection="default",
                chunk_ids=["id-1"],
            )
        )
        server._collections = mock_collections
        # Small content that's within limits
        # Pass all parameters to avoid Field default resolution issues
        result = await server.ingest_content.fn(
            project_id="proj-123",
            agent_id="agent-456",
            content="def hello(): pass",
            source_path="/test.py",
            collection="default",
            chunk_type="text",
            file_type=None,
            metadata=None,
        )
        assert result["success"] is True
Author	SHA1	Message	Date
Felipe Cardoso	cd7a9ccbdf	fix(mcp-kb): add transactional batch insert and atomic document update - Wrap store_embeddings_batch in transaction for all-or-nothing semantics - Add replace_source_embeddings method for atomic document updates - Update collection_manager to use transactional replace - Prevents race conditions and data inconsistency (closes #77) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-04 01:07:40 +01:00
Felipe Cardoso	953af52d0e	fix(mcp-kb): address critical issues from deep review - Fix SQL HAVING clause bug by using CTE approach (closes #73) - Add /mcp JSON-RPC 2.0 endpoint for tool execution (closes #74) - Add /mcp/tools endpoint for tool discovery (closes #75) - Add content size limits to prevent DoS attacks (closes #78) - Add comprehensive tests for new endpoints 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-04 01:03:58 +01:00
Felipe Cardoso	e6e98d4ed1	docs(workflow): enforce stack verification as mandatory step - Added "Stack Verification" section to CLAUDE.md with detailed steps. - Updated WORKFLOW.md to mandate running the full stack before marking work as complete. - Prevents issues where high test coverage masks application startup failures.	2026-01-04 00:58:31 +01:00
Felipe Cardoso	ca5f5e3383	refactor(environment): update virtualenv path to `/opt/venv` in Docker setup - Adjusted `docker-compose.dev.yml` to reflect the new venv location. - Modified entrypoint script and Dockerfile to reference `/opt/venv` for isolated dependencies. - Improved bind mount setup to prevent venv overwrites during development.	2026-01-04 00:58:24 +01:00