docs: rename Docker deployment to self-hosting guide with comprehensive monitoring documentation

Major documentation restructuring to emphasize self-hosting capabilities and fully document the real-time monitoring system. Changes: - Renamed docker-deployment.md → self-hosting.md to better reflect the value proposition - Updated mkdocs.yml navigation to "Self-Hosting Guide" - Completely rewrote introduction emphasizing self-hosting benefits: * Data privacy and ownership * Cost control and transparency * Performance and security advantages * Full customization capabilities - Expanded "Metrics & Monitoring" → "Real-time Monitoring & Operations" with: * Monitoring Dashboard section documenting the /monitor UI * Complete feature breakdown (system health, requests, browsers, janitor, errors) * Monitor API Endpoints with all REST endpoints and examples * WebSocket Streaming integration guide with Python examples * Control Actions for manual browser management * Production Integration patterns (Prometheus, custom dashboards, alerting) * Key production metrics to track - Enhanced summary section: * What users learned checklist * Why self-hosting matters * Clear next steps * Key resources with monitoring dashboard URL The monitoring dashboard built 2-3 weeks ago is now fully documented and discoverable. Users will understand they have complete operational visibility at http://localhost:11235/monitor with real-time updates, browser pool management, and programmatic control via REST/WebSocket APIs. This positions Crawl4AI as an enterprise-grade self-hosting solution with DevOps-level monitoring capabilities, not just a Docker deployment.
Update gitignore
2025-11-09 13:31:52 +08:00 · 2025-11-09 10:49:42 +08:00 · 2025-10-18 12:41:29 +08:00 · 2025-10-18 12:05:49 +08:00 · 2025-10-18 11:38:25 +08:00 · 2025-10-17 22:43:06 +08:00
46 changed files with 6177 additions and 9200 deletions
--- a/.github/workflows/docker-release.yml
+++ b/.github/workflows/docker-release.yml
@@ -1,81 +0,0 @@
 name: Docker Release
 on:
  release:
    types: [published]
  push:
    tags:
      - 'docker-rebuild-v*'  # Allow manual Docker rebuilds via tags
 jobs:
  docker:
    runs-on: ubuntu-latest
    steps:
      - name: Checkout code
        uses: actions/checkout@v4
      - name: Extract version from release or tag
        id: get_version
        run: |
          if [ "${{ github.event_name }}" == "release" ]; then
            # Triggered by release event
            VERSION="${{ github.event.release.tag_name }}"
            VERSION=${VERSION#v}  # Remove 'v' prefix
          else
            # Triggered by docker-rebuild-v* tag
            VERSION=${GITHUB_REF#refs/tags/docker-rebuild-v}
          fi
          echo "VERSION=$VERSION" >> $GITHUB_OUTPUT
          echo "Building Docker images for version: $VERSION"
      - name: Extract major and minor versions
        id: versions
        run: |
          VERSION=${{ steps.get_version.outputs.VERSION }}
          MAJOR=$(echo $VERSION | cut -d. -f1)
          MINOR=$(echo $VERSION | cut -d. -f1-2)
          echo "MAJOR=$MAJOR" >> $GITHUB_OUTPUT
          echo "MINOR=$MINOR" >> $GITHUB_OUTPUT
          echo "Semantic versions - Major: $MAJOR, Minor: $MINOR"
      - name: Set up Docker Buildx
        uses: docker/setup-buildx-action@v3
      - name: Log in to Docker Hub
        uses: docker/login-action@v3
        with:
          username: ${{ secrets.DOCKER_USERNAME }}
          password: ${{ secrets.DOCKER_TOKEN }}
      - name: Build and push Docker images
        uses: docker/build-push-action@v5
        with:
          context: .
          push: true
          tags: |
            unclecode/crawl4ai:${{ steps.get_version.outputs.VERSION }}
            unclecode/crawl4ai:${{ steps.versions.outputs.MINOR }}
            unclecode/crawl4ai:${{ steps.versions.outputs.MAJOR }}
            unclecode/crawl4ai:latest
          platforms: linux/amd64,linux/arm64
          cache-from: type=gha
          cache-to: type=gha,mode=max
      - name: Summary
        run: |
          echo "## 🐳 Docker Release Complete!" >> $GITHUB_STEP_SUMMARY
          echo "" >> $GITHUB_STEP_SUMMARY
          echo "### Published Images" >> $GITHUB_STEP_SUMMARY
          echo "- \`unclecode/crawl4ai:${{ steps.get_version.outputs.VERSION }}\`" >> $GITHUB_STEP_SUMMARY
          echo "- \`unclecode/crawl4ai:${{ steps.versions.outputs.MINOR }}\`" >> $GITHUB_STEP_SUMMARY
          echo "- \`unclecode/crawl4ai:${{ steps.versions.outputs.MAJOR }}\`" >> $GITHUB_STEP_SUMMARY
          echo "- \`unclecode/crawl4ai:latest\`" >> $GITHUB_STEP_SUMMARY
          echo "" >> $GITHUB_STEP_SUMMARY
          echo "### Platforms" >> $GITHUB_STEP_SUMMARY
          echo "- linux/amd64" >> $GITHUB_STEP_SUMMARY
          echo "- linux/arm64" >> $GITHUB_STEP_SUMMARY
          echo "" >> $GITHUB_STEP_SUMMARY
          echo "### 🚀 Pull Command" >> $GITHUB_STEP_SUMMARY
          echo "\`\`\`bash" >> $GITHUB_STEP_SUMMARY
          echo "docker pull unclecode/crawl4ai:${{ steps.get_version.outputs.VERSION }}" >> $GITHUB_STEP_SUMMARY
          echo "\`\`\`" >> $GITHUB_STEP_SUMMARY
--- a/.github/workflows/docs/ARCHITECTURE.md
+++ b/.github/workflows/docs/ARCHITECTURE.md
@@ -1,917 +0,0 @@
 # Workflow Architecture Documentation
 ## Overview
 This document describes the technical architecture of the split release pipeline for Crawl4AI.
 ---
 ## Architecture Diagram
 ```
 ┌─────────────────────────────────────────────────────────────────┐
 │                         Developer                                │
 │                              │                                   │
 │                              ▼                                   │
 │                    git tag v1.2.3                               │
 │                    git push --tags                              │
 └──────────────────────────────┬──────────────────────────────────┘
                               │
                               ▼
 ┌─────────────────────────────────────────────────────────────────┐
 │                      GitHub Repository                           │
 │                                                                  │
 │  ┌────────────────────────────────────────────────────────┐   │
 │  │                  Tag Event: v1.2.3                      │   │
 │  └────────────────────────────────────────────────────────┘   │
 │                              │                                   │
 │                              ▼                                   │
 │  ┌────────────────────────────────────────────────────────┐   │
 │  │           release.yml (Release Pipeline)               │   │
 │  │  ┌──────────────────────────────────────────────┐     │   │
 │  │  │ 1. Extract Version                            │     │   │
 │  │  │    v1.2.3 → 1.2.3                            │     │   │
 │  │  └──────────────────────────────────────────────┘     │   │
 │  │  ┌──────────────────────────────────────────────┐     │   │
 │  │  │ 2. Validate Version                           │     │   │
 │  │  │    Tag == __version__.py                      │     │   │
 │  │  └──────────────────────────────────────────────┘     │   │
 │  │  ┌──────────────────────────────────────────────┐     │   │
 │  │  │ 3. Build Python Package                       │     │   │
 │  │  │    - Source dist (.tar.gz)                    │     │   │
 │  │  │    - Wheel (.whl)                             │     │   │
 │  │  └──────────────────────────────────────────────┘     │   │
 │  │  ┌──────────────────────────────────────────────┐     │   │
 │  │  │ 4. Upload to PyPI                             │     │   │
 │  │  │    - Authenticate with token                  │     │   │
 │  │  │    - Upload dist/*                            │     │   │
 │  │  └──────────────────────────────────────────────┘     │   │
 │  │  ┌──────────────────────────────────────────────┐     │   │
 │  │  │ 5. Create GitHub Release                      │     │   │
 │  │  │    - Tag: v1.2.3                              │     │   │
 │  │  │    - Body: Install instructions               │     │   │
 │  │  │    - Status: Published                        │     │   │
 │  │  └──────────────────────────────────────────────┘     │   │
 │  └────────────────────────────────────────────────────────┘   │
 │                              │                                   │
 │                              ▼                                   │
 │  ┌────────────────────────────────────────────────────────┐   │
 │  │         Release Event: published (v1.2.3)              │   │
 │  └────────────────────────────────────────────────────────┘   │
 │                              │                                   │
 │                              ▼                                   │
 │  ┌────────────────────────────────────────────────────────┐   │
 │  │         docker-release.yml (Docker Pipeline)           │   │
 │  │  ┌──────────────────────────────────────────────┐     │   │
 │  │  │ 1. Extract Version from Release               │     │   │
 │  │  │    github.event.release.tag_name → 1.2.3     │     │   │
 │  │  └──────────────────────────────────────────────┘     │   │
 │  │  ┌──────────────────────────────────────────────┐     │   │
 │  │  │ 2. Parse Semantic Versions                    │     │   │
 │  │  │    1.2.3 → Major: 1, Minor: 1.2              │     │   │
 │  │  └──────────────────────────────────────────────┘     │   │
 │  │  ┌──────────────────────────────────────────────┐     │   │
 │  │  │ 3. Setup Multi-Arch Build                     │     │   │
 │  │  │    - Docker Buildx                            │     │   │
 │  │  │    - QEMU emulation                           │     │   │
 │  │  └──────────────────────────────────────────────┘     │   │
 │  │  ┌──────────────────────────────────────────────┐     │   │
 │  │  │ 4. Authenticate Docker Hub                    │     │   │
 │  │  │    - Username: DOCKER_USERNAME                │     │   │
 │  │  │    - Token: DOCKER_TOKEN                      │     │   │
 │  │  └──────────────────────────────────────────────┘     │   │
 │  │  ┌──────────────────────────────────────────────┐     │   │
 │  │  │ 5. Build Multi-Arch Images                    │     │   │
 │  │  │    ┌────────────────┬────────────────┐       │     │   │
 │  │  │    │  linux/amd64   │  linux/arm64   │       │     │   │
 │  │  │    └────────────────┴────────────────┘       │     │   │
 │  │  │    Cache: GitHub Actions (type=gha)          │     │   │
 │  │  └──────────────────────────────────────────────┘     │   │
 │  │  ┌──────────────────────────────────────────────┐     │   │
 │  │  │ 6. Push to Docker Hub                         │     │   │
 │  │  │    Tags:                                      │     │   │
 │  │  │    - unclecode/crawl4ai:1.2.3                │     │   │
 │  │  │    - unclecode/crawl4ai:1.2                  │     │   │
 │  │  │    - unclecode/crawl4ai:1                    │     │   │
 │  │  │    - unclecode/crawl4ai:latest               │     │   │
 │  │  └──────────────────────────────────────────────┘     │   │
 │  └────────────────────────────────────────────────────────┘   │
 └─────────────────────────────────────────────────────────────────┘
                               │
                               ▼
 ┌─────────────────────────────────────────────────────────────────┐
 │                     External Services                            │
 │                                                                  │
 │  ┌──────────────┐  ┌──────────────┐  ┌──────────────┐         │
 │  │    PyPI      │  │  Docker Hub  │  │   GitHub     │         │
 │  │              │  │              │  │              │         │
 │  │  crawl4ai    │  │ unclecode/   │  │  Releases    │         │
 │  │  1.2.3       │  │ crawl4ai     │  │  v1.2.3      │         │
 │  └──────────────┘  └──────────────┘  └──────────────┘         │
 └─────────────────────────────────────────────────────────────────┘
 ```
 ---
 ## Component Details
 ### 1. Release Pipeline (release.yml)
 #### Purpose
 Fast publication of Python package and GitHub release.
 #### Input
 - **Trigger**: Git tag matching `v*` (excluding `test-v*`)
 - **Example**: `v1.2.3`
 #### Processing Stages
 ##### Stage 1: Version Extraction
 ```bash
 Input:  refs/tags/v1.2.3
 Output: VERSION=1.2.3
 ```
 **Implementation**:
 ```bash
 TAG_VERSION=${GITHUB_REF#refs/tags/v}  # Remove 'refs/tags/v' prefix
 echo "VERSION=$TAG_VERSION" >> $GITHUB_OUTPUT
 ```
 ##### Stage 2: Version Validation
 ```bash
 Input:  TAG_VERSION=1.2.3
 Check:  crawl4ai/__version__.py contains __version__ = "1.2.3"
 Output: Pass/Fail
 ```
 **Implementation**:
 ```bash
 PACKAGE_VERSION=$(python -c "from crawl4ai.__version__ import __version__; print(__version__)")
 if [ "$TAG_VERSION" != "$PACKAGE_VERSION" ]; then
  exit 1
 fi
 ```
 ##### Stage 3: Package Build
 ```bash
 Input:  Source code + pyproject.toml
 Output: dist/crawl4ai-1.2.3.tar.gz
        dist/crawl4ai-1.2.3-py3-none-any.whl
 ```
 **Implementation**:
 ```bash
 python -m build
 # Uses build backend defined in pyproject.toml
 ```
 ##### Stage 4: PyPI Upload
 ```bash
 Input:  dist/*.{tar.gz,whl}
 Auth:   PYPI_TOKEN
 Output: Package published to PyPI
 ```
 **Implementation**:
 ```bash
 twine upload dist/*
 # Environment:
 #   TWINE_USERNAME: __token__
 #   TWINE_PASSWORD: ${{ secrets.PYPI_TOKEN }}
 ```
 ##### Stage 5: GitHub Release Creation
 ```bash
 Input:  Tag: v1.2.3
        Body: Markdown content
 Output: Published GitHub release
 ```
 **Implementation**:
 ```yaml
 uses: softprops/action-gh-release@v2
 with:
  tag_name: v1.2.3
  name: Release v1.2.3
  body: |
    Installation instructions and changelog
  draft: false
  prerelease: false
 ```
 #### Output
 - **PyPI Package**: https://pypi.org/project/crawl4ai/1.2.3/
 - **GitHub Release**: Published release on repository
 - **Event**: `release.published` (triggers Docker workflow)
 #### Timeline
 ```
 0:00 - Tag pushed
 0:01 - Checkout + Python setup
 0:02 - Version validation
 0:03 - Package build
 0:04 - PyPI upload starts
 0:06 - PyPI upload complete
 0:07 - GitHub release created
 0:08 - Workflow complete
 ```
 ---
 ### 2. Docker Release Pipeline (docker-release.yml)
 #### Purpose
 Build and publish multi-architecture Docker images.
 #### Inputs
 ##### Input 1: Release Event (Automatic)
 ```yaml
 Event: release.published
 Data:  github.event.release.tag_name = "v1.2.3"
 ```
 ##### Input 2: Docker Rebuild Tag (Manual)
 ```yaml
 Tag: docker-rebuild-v1.2.3
 ```
 #### Processing Stages
 ##### Stage 1: Version Detection
 ```bash
 # From release event:
 VERSION = github.event.release.tag_name.strip("v")
 # Result: "1.2.3"
 # From rebuild tag:
 VERSION = GITHUB_REF.replace("refs/tags/docker-rebuild-v", "")
 # Result: "1.2.3"
 ```
 ##### Stage 2: Semantic Version Parsing
 ```bash
 Input:  VERSION=1.2.3
 Output: MAJOR=1
        MINOR=1.2
        PATCH=3 (implicit)
 ```
 **Implementation**:
 ```bash
 MAJOR=$(echo $VERSION | cut -d. -f1)    # Extract first component
 MINOR=$(echo $VERSION | cut -d. -f1-2)  # Extract first two components
 ```
 ##### Stage 3: Multi-Architecture Setup
 ```yaml
 Setup:
  - Docker Buildx (multi-platform builder)
  - QEMU (ARM emulation on x86)
 Platforms:
  - linux/amd64 (x86_64)
  - linux/arm64 (aarch64)
 ```
 **Architecture**:
 ```
 GitHub Runner (linux/amd64)
  ├─ Buildx Builder
  │   ├─ Native: Build linux/amd64 image
  │   └─ QEMU: Emulate ARM to build linux/arm64 image
  └─ Generate manifest list (points to both images)
 ```
 ##### Stage 4: Docker Hub Authentication
 ```bash
 Input:  DOCKER_USERNAME
        DOCKER_TOKEN
 Output: Authenticated Docker client
 ```
 ##### Stage 5: Build with Cache
 ```yaml
 Cache Configuration:
  cache-from: type=gha           # Read from GitHub Actions cache
  cache-to: type=gha,mode=max    # Write all layers
 Cache Key Components:
  - Workflow file path
  - Branch name
  - Architecture (amd64/arm64)
 ```
 **Cache Hierarchy**:
 ```
 Cache Entry: main/docker-release.yml/linux-amd64
  ├─ Layer: sha256:abc123... (FROM python:3.12)
  ├─ Layer: sha256:def456... (RUN apt-get update)
  ├─ Layer: sha256:ghi789... (COPY requirements.txt)
  ├─ Layer: sha256:jkl012... (RUN pip install)
  └─ Layer: sha256:mno345... (COPY . /app)
 Cache Hit/Miss Logic:
  - If layer input unchanged → cache hit → skip build
  - If layer input changed → cache miss → rebuild + all subsequent layers
 ```
 ##### Stage 6: Tag Generation
 ```bash
 Input:  VERSION=1.2.3, MAJOR=1, MINOR=1.2
 Output Tags:
  - unclecode/crawl4ai:1.2.3    (exact version)
  - unclecode/crawl4ai:1.2      (minor version)
  - unclecode/crawl4ai:1        (major version)
  - unclecode/crawl4ai:latest   (latest stable)
 ```
 **Tag Strategy**:
 - All tags point to same image SHA
 - Users can pin to desired stability level
 - Pushing new version updates `1`, `1.2`, and `latest` automatically
 ##### Stage 7: Push to Registry
 ```bash
 For each tag:
  For each platform (amd64, arm64):
    Push image to Docker Hub
 Create manifest list:
  Manifest: unclecode/crawl4ai:1.2.3
    ├─ linux/amd64: sha256:abc...
    └─ linux/arm64: sha256:def...
 Docker CLI automatically selects correct platform on pull
 ```
 #### Output
 - **Docker Images**: 4 tags × 2 platforms = 8 image variants + 4 manifests
 - **Docker Hub**: https://hub.docker.com/r/unclecode/crawl4ai/tags
 #### Timeline
 **Cold Cache (First Build)**:
 ```
 0:00 - Release event received
 0:01 - Checkout + Buildx setup
 0:02 - Docker Hub auth
 0:03 - Start build (amd64)
 0:08 - Complete amd64 build
 0:09 - Start build (arm64)
 0:14 - Complete arm64 build
 0:15 - Generate manifests
 0:16 - Push all tags
 0:17 - Workflow complete
 ```
 **Warm Cache (Code Change Only)**:
 ```
 0:00 - Release event received
 0:01 - Checkout + Buildx setup
 0:02 - Docker Hub auth
 0:03 - Start build (amd64) - cache hit for layers 1-4
 0:04 - Complete amd64 build (only layer 5 rebuilt)
 0:05 - Start build (arm64) - cache hit for layers 1-4
 0:06 - Complete arm64 build (only layer 5 rebuilt)
 0:07 - Generate manifests
 0:08 - Push all tags
 0:09 - Workflow complete
 ```
 ---
 ## Data Flow
 ### Version Information Flow
 ```
 Developer
  │
  ▼
 crawl4ai/__version__.py
  __version__ = "1.2.3"
  │
  ├─► Git Tag
  │     v1.2.3
  │       │
  │       ▼
  │     release.yml
  │       │
  │       ├─► Validation
  │       │     ✓ Match
  │       │
  │       ├─► PyPI Package
  │       │     crawl4ai==1.2.3
  │       │
  │       └─► GitHub Release
  │             v1.2.3
  │               │
  │               ▼
  │           docker-release.yml
  │               │
  │               └─► Docker Tags
  │                     1.2.3, 1.2, 1, latest
  │
  └─► Package Metadata
        pyproject.toml
          version = "1.2.3"
 ```
 ### Secrets Flow
 ```
 GitHub Secrets (Encrypted at Rest)
  │
  ├─► PYPI_TOKEN
  │     │
  │     ▼
  │   release.yml
  │     │
  │     ▼
  │   TWINE_PASSWORD env var (masked in logs)
  │     │
  │     ▼
  │   PyPI API (HTTPS)
  │
  ├─► DOCKER_USERNAME
  │     │
  │     ▼
  │   docker-release.yml
  │     │
  │     ▼
  │   docker/login-action (masked in logs)
  │     │
  │     ▼
  │   Docker Hub API (HTTPS)
  │
  └─► DOCKER_TOKEN
        │
        ▼
      docker-release.yml
        │
        ▼
      docker/login-action (masked in logs)
        │
        ▼
      Docker Hub API (HTTPS)
 ```
 ### Artifact Flow
 ```
 Source Code
  │
  ├─► release.yml
  │     │
  │     ▼
  │   python -m build
  │     │
  │     ├─► crawl4ai-1.2.3.tar.gz
  │     │     │
  │     │     ▼
  │     │   PyPI Storage
  │     │     │
  │     │     ▼
  │     │   pip install crawl4ai
  │     │
  │     └─► crawl4ai-1.2.3-py3-none-any.whl
  │           │
  │           ▼
  │         PyPI Storage
  │           │
  │           ▼
  │         pip install crawl4ai
  │
  └─► docker-release.yml
        │
        ▼
      docker build
        │
        ├─► Image: linux/amd64
        │     │
        │     └─► Docker Hub
        │           unclecode/crawl4ai:1.2.3-amd64
        │
        └─► Image: linux/arm64
              │
              └─► Docker Hub
                    unclecode/crawl4ai:1.2.3-arm64
 ```
 ---
 ## State Machines
 ### Release Pipeline State Machine
 ```
 ┌─────────┐
 │  START  │
 └────┬────┘
     │
     ▼
 ┌──────────────┐
 │ Extract      │
 │ Version      │
 └──────┬───────┘
       │
       ▼
 ┌──────────────┐      ┌─────────┐
 │ Validate     │─────►│ FAILED  │
 │ Version      │ No   │ (Exit 1)│
 └──────┬───────┘      └─────────┘
       │ Yes
       ▼
 ┌──────────────┐
 │ Build        │
 │ Package      │
 └──────┬───────┘
       │
       ▼
 ┌──────────────┐      ┌─────────┐
 │ Upload       │─────►│ FAILED  │
 │ to PyPI      │ Error│ (Exit 1)│
 └──────┬───────┘      └─────────┘
       │ Success
       ▼
 ┌──────────────┐
 │ Create       │
 │ GH Release   │
 └──────┬───────┘
       │
       ▼
 ┌──────────────┐
 │  SUCCESS     │
 │ (Emit Event) │
 └──────────────┘
 ```
 ### Docker Pipeline State Machine
 ```
 ┌─────────┐
 │  START  │
 │ (Event) │
 └────┬────┘
     │
     ▼
 ┌──────────────┐
 │ Detect       │
 │ Version      │
 │ Source       │
 └──────┬───────┘
       │
       ▼
 ┌──────────────┐
 │ Parse        │
 │ Semantic     │
 │ Versions     │
 └──────┬───────┘
       │
       ▼
 ┌──────────────┐      ┌─────────┐
 │ Authenticate │─────►│ FAILED  │
 │ Docker Hub   │ Error│ (Exit 1)│
 └──────┬───────┘      └─────────┘
       │ Success
       ▼
 ┌──────────────┐
 │ Build        │
 │ amd64        │
 └──────┬───────┘
       │
       ▼
 ┌──────────────┐      ┌─────────┐
 │ Build        │─────►│ FAILED  │
 │ arm64        │ Error│ (Exit 1)│
 └──────┬───────┘      └─────────┘
       │ Success
       ▼
 ┌──────────────┐
 │ Push All     │
 │ Tags         │
 └──────┬───────┘
       │
       ▼
 ┌──────────────┐
 │  SUCCESS     │
 └──────────────┘
 ```
 ---
 ## Security Architecture
 ### Threat Model
 #### Threats Mitigated
 1. **Secret Exposure**
   - Mitigation: GitHub Actions secret masking
   - Evidence: Secrets never appear in logs
 2. **Unauthorized Package Upload**
   - Mitigation: Scoped PyPI tokens
   - Evidence: Token limited to `crawl4ai` project
 3. **Man-in-the-Middle**
   - Mitigation: HTTPS for all API calls
   - Evidence: PyPI, Docker Hub, GitHub all use TLS
 4. **Supply Chain Tampering**
   - Mitigation: Immutable artifacts, content checksums
   - Evidence: PyPI stores SHA256, Docker uses content-addressable storage
 #### Trust Boundaries
 ```
 ┌─────────────────────────────────────────┐
 │         Trusted Zone                     │
 │  ┌────────────────────────────────┐    │
 │  │  GitHub Actions Runner         │    │
 │  │  - Ephemeral VM                │    │
 │  │  - Isolated environment        │    │
 │  │  - Access to secrets           │    │
 │  └────────────────────────────────┘    │
 │                │                         │
 │                │ HTTPS (TLS 1.2+)       │
 │                ▼                         │
 └─────────────────────────────────────────┘
                 │
    ┌────────────┼────────────┐
    │            │            │
    ▼            ▼            ▼
 ┌────────┐  ┌─────────┐  ┌──────────┐
 │  PyPI  │  │  Docker │  │  GitHub  │
 │  API   │  │  Hub    │  │  API     │
 └────────┘  └─────────┘  └──────────┘
 External     External     External
  Service      Service      Service
 ```
 ### Secret Management
 #### Secret Lifecycle
 ```
 Creation (Developer)
  │
  ├─► PyPI: Create API token (scoped to project)
  ├─► Docker Hub: Create access token (read/write)
  │
  ▼
 Storage (GitHub)
  │
  ├─► Encrypted at rest (AES-256)
  ├─► Access controlled (repo-scoped)
  │
  ▼
 Usage (Workflow)
  │
  ├─► Injected as env vars
  ├─► Masked in logs (GitHub redacts on output)
  ├─► Never persisted to disk (in-memory only)
  │
  ▼
 Transmission (API Call)
  │
  ├─► HTTPS only
  ├─► TLS 1.2+ with strong ciphers
  │
  ▼
 Rotation (Manual)
  │
  └─► Regenerate on PyPI/Docker Hub
      Update GitHub secret
 ```
 ---
 ## Performance Characteristics
 ### Release Pipeline Performance
 | Metric | Value | Notes |
 |--------|-------|-------|
 | Cold start | ~2-3 min | First run on new runner |
 | Warm start | ~2-3 min | Minimal caching benefit |
 | PyPI upload | ~30-60 sec | Network-bound |
 | Package build | ~30 sec | CPU-bound |
 | Parallelization | None | Sequential by design |
 ### Docker Pipeline Performance
 | Metric | Cold Cache | Warm Cache (code) | Warm Cache (deps) |
 |--------|-----------|-------------------|-------------------|
 | Total time | 10-15 min | 1-2 min | 3-5 min |
 | amd64 build | 5-7 min | 30-60 sec | 1-2 min |
 | arm64 build | 5-7 min | 30-60 sec | 1-2 min |
 | Push time | 1-2 min | 30 sec | 30 sec |
 | Cache hit rate | 0% | 85% | 60% |
 ### Cache Performance Model
 ```python
 def estimate_build_time(changes):
    base_time = 60  # seconds (setup + push)
    if "Dockerfile" in changes:
        return base_time + (10 * 60)  # Full rebuild: ~11 min
    elif "requirements.txt" in changes:
        return base_time + (3 * 60)   # Deps rebuild: ~4 min
    elif any(f.endswith(".py") for f in changes):
        return base_time + 60          # Code only: ~2 min
    else:
        return base_time               # No changes: ~1 min
 ```
 ---
 ## Scalability Considerations
 ### Current Limits
 | Resource | Limit | Impact |
 |----------|-------|--------|
 | Workflow concurrency | 20 (default) | Max 20 releases in parallel |
 | Artifact storage | 500 MB/artifact | PyPI packages small (<10 MB) |
 | Cache storage | 10 GB/repo | Docker layers fit comfortably |
 | Workflow run time | 6 hours | Plenty of headroom |
 ### Scaling Strategies
 #### Horizontal Scaling (Multiple Repos)
 ```
 crawl4ai (main)
  ├─ release.yml
  └─ docker-release.yml
 crawl4ai-plugins (separate)
  ├─ release.yml
  └─ docker-release.yml
 Each repo has independent:
  - Secrets
  - Cache (10 GB each)
  - Concurrency limits (20 each)
 ```
 #### Vertical Scaling (Larger Runners)
 ```yaml
 jobs:
  docker:
    runs-on: ubuntu-latest-8-cores  # GitHub-hosted larger runner
    # 4x faster builds for CPU-bound layers
 ```
 ---
 ## Disaster Recovery
 ### Failure Scenarios
 #### Scenario 1: Release Pipeline Fails
 **Failure Point**: PyPI upload fails (network error)
 **State**:
 - ✓ Version validated
 - ✓ Package built
 - ✗ PyPI upload
 - ✗ GitHub release
 **Recovery**:
 ```bash
 # Manual upload
 twine upload dist/*
 # Retry workflow (re-run from GitHub Actions UI)
 ```
 **Prevention**: Add retry logic to PyPI upload
 #### Scenario 2: Docker Pipeline Fails
 **Failure Point**: ARM build fails (dependency issue)
 **State**:
 - ✓ PyPI published
 - ✓ GitHub release created
 - ✓ amd64 image built
 - ✗ arm64 image build
 **Recovery**:
 ```bash
 # Fix Dockerfile
 git commit -am "fix: ARM build dependency"
 # Trigger rebuild
 git tag docker-rebuild-v1.2.3
 git push origin docker-rebuild-v1.2.3
 ```
 **Impact**: PyPI package available, only Docker ARM users affected
 #### Scenario 3: Partial Release
 **Failure Point**: GitHub release creation fails
 **State**:
 - ✓ PyPI published
 - ✗ GitHub release
 - ✗ Docker images
 **Recovery**:
 ```bash
 # Create release manually
 gh release create v1.2.3 \
  --title "Release v1.2.3" \
  --notes "..."
 # This triggers docker-release.yml automatically
 ```
 ---
 ## Monitoring and Observability
 ### Metrics to Track
 #### Release Pipeline
 - Success rate (target: >99%)
 - Duration (target: <3 min)
 - PyPI upload time (target: <60 sec)
 #### Docker Pipeline
 - Success rate (target: >95%)
 - Duration (target: <15 min cold, <2 min warm)
 - Cache hit rate (target: >80% for code changes)
 ### Alerting
 **Critical Alerts**:
 - Release pipeline failure (blocks release)
 - PyPI authentication failure (expired token)
 **Warning Alerts**:
 - Docker build >15 min (performance degradation)
 - Cache hit rate <50% (cache issue)
 ### Logging
 **GitHub Actions Logs**:
 - Retention: 90 days
 - Downloadable: Yes
 - Searchable: Limited
 **Recommended External Logging**:
 ```yaml
 - name: Send logs to external service
  if: failure()
  run: |
    curl -X POST https://logs.example.com/api/v1/logs \
      -H "Content-Type: application/json" \
      -d "{\"workflow\": \"${{ github.workflow }}\", \"status\": \"failed\"}"
 ```
 ---
 ## Future Enhancements
 ### Planned Improvements
 1. **Automated Changelog Generation**
   - Use conventional commits
   - Generate CHANGELOG.md automatically
 2. **Pre-release Testing**
   - Test builds on `test-v*` tags
   - Upload to TestPyPI
 3. **Notification System**
   - Slack/Discord notifications on release
   - Email on failure
 4. **Performance Optimization**
   - Parallel Docker builds (amd64 + arm64 simultaneously)
   - Persistent runners for better caching
 5. **Enhanced Validation**
   - Smoke tests after PyPI upload
   - Container security scanning
 ---
 ## References
 - [GitHub Actions Architecture](https://docs.github.com/en/actions/learn-github-actions/understanding-github-actions)
 - [Docker Build Cache](https://docs.docker.com/build/cache/)
 - [PyPI API Documentation](https://warehouse.pypa.io/api-reference/)
 ---
 **Last Updated**: 2025-01-21
 **Version**: 2.0
--- a/.github/workflows/docs/README.md
+++ b/.github/workflows/docs/README.md
--- a/.github/workflows/docs/WORKFLOW_REFERENCE.md
+++ b/.github/workflows/docs/WORKFLOW_REFERENCE.md
@@ -1,287 +0,0 @@
 # Workflow Quick Reference
 ## Quick Commands
 ### Standard Release
 ```bash
 # 1. Update version
 vim crawl4ai/__version__.py  # Set to "1.2.3"
 # 2. Commit and tag
 git add crawl4ai/__version__.py
 git commit -m "chore: bump version to 1.2.3"
 git tag v1.2.3
 git push origin main
 git push origin v1.2.3
 # 3. Monitor
 # - PyPI: ~2-3 minutes
 # - Docker: ~1-15 minutes
 ```
 ### Docker Rebuild Only
 ```bash
 git tag docker-rebuild-v1.2.3
 git push origin docker-rebuild-v1.2.3
 ```
 ### Delete Tag (Undo Release)
 ```bash
 # Local
 git tag -d v1.2.3
 # Remote
 git push --delete origin v1.2.3
 # GitHub Release
 gh release delete v1.2.3
 ```
 ---
 ## Workflow Triggers
 ### release.yml
 | Event | Pattern | Example |
 |-------|---------|---------|
 | Tag push | `v*` | `v1.2.3` |
 | Excludes | `test-v*` | `test-v1.2.3` |
 ### docker-release.yml
 | Event | Pattern | Example |
 |-------|---------|---------|
 | Release published | `release.published` | Automatic |
 | Tag push | `docker-rebuild-v*` | `docker-rebuild-v1.2.3` |
 ---
 ## Environment Variables
 ### release.yml
 | Variable | Source | Example |
 |----------|--------|---------|
 | `VERSION` | Git tag | `1.2.3` |
 | `TWINE_USERNAME` | Static | `__token__` |
 | `TWINE_PASSWORD` | Secret | `pypi-Ag...` |
 | `GITHUB_TOKEN` | Auto | `ghp_...` |
 ### docker-release.yml
 | Variable | Source | Example |
 |----------|--------|---------|
 | `VERSION` | Release/Tag | `1.2.3` |
 | `MAJOR` | Computed | `1` |
 | `MINOR` | Computed | `1.2` |
 | `DOCKER_USERNAME` | Secret | `unclecode` |
 | `DOCKER_TOKEN` | Secret | `dckr_pat_...` |
 ---
 ## Docker Tags Generated
 | Version | Tags Created |
 |---------|-------------|
 | v1.0.0 | `1.0.0`, `1.0`, `1`, `latest` |
 | v1.1.0 | `1.1.0`, `1.1`, `1`, `latest` |
 | v1.2.3 | `1.2.3`, `1.2`, `1`, `latest` |
 | v2.0.0 | `2.0.0`, `2.0`, `2`, `latest` |
 ---
 ## Workflow Outputs
 ### release.yml
 | Output | Location | Time |
 |--------|----------|------|
 | PyPI Package | https://pypi.org/project/crawl4ai/ | ~2-3 min |
 | GitHub Release | Repository → Releases | ~2-3 min |
 | Workflow Summary | Actions → Run → Summary | Immediate |
 ### docker-release.yml
 | Output | Location | Time |
 |--------|----------|------|
 | Docker Images | https://hub.docker.com/r/unclecode/crawl4ai | ~1-15 min |
 | Workflow Summary | Actions → Run → Summary | Immediate |
 ---
 ## Common Issues
 | Issue | Solution |
 |-------|----------|
 | Version mismatch | Update `crawl4ai/__version__.py` to match tag |
 | PyPI 403 Forbidden | Check `PYPI_TOKEN` secret |
 | PyPI 400 File exists | Version already published, increment version |
 | Docker auth failed | Regenerate `DOCKER_TOKEN` |
 | Docker build timeout | Check Dockerfile, review build logs |
 | Cache not working | First build on branch always cold |
 ---
 ## Secrets Checklist
 - [ ] `PYPI_TOKEN` - PyPI API token (project or account scope)
 - [ ] `DOCKER_USERNAME` - Docker Hub username
 - [ ] `DOCKER_TOKEN` - Docker Hub access token (read/write)
 - [ ] `GITHUB_TOKEN` - Auto-provided (no action needed)
 ---
 ## Workflow Dependencies
 ### release.yml Dependencies
 ```yaml
 Python: 3.12
 Actions:
  - actions/checkout@v4
  - actions/setup-python@v5
  - softprops/action-gh-release@v2
 PyPI Packages:
  - build
  - twine
 ```
 ### docker-release.yml Dependencies
 ```yaml
 Actions:
  - actions/checkout@v4
  - docker/setup-buildx-action@v3
  - docker/login-action@v3
  - docker/build-push-action@v5
 Docker:
  - Buildx
  - QEMU (for multi-arch)
 ```
 ---
 ## Cache Information
 ### Type
 - GitHub Actions Cache (`type=gha`)
 ### Storage
 - **Limit**: 10GB per repository
 - **Retention**: 7 days for unused entries
 - **Cleanup**: Automatic LRU eviction
 ### Performance
 | Scenario | Cache Hit | Build Time |
 |----------|-----------|------------|
 | First build | 0% | 10-15 min |
 | Code change only | 85% | 1-2 min |
 | Dependency update | 60% | 3-5 min |
 | No changes | 100% | 30-60 sec |
 ---
 ## Build Platforms
 | Platform | Architecture | Devices |
 |----------|--------------|---------|
 | linux/amd64 | x86_64 | Intel/AMD servers, AWS EC2, GCP |
 | linux/arm64 | aarch64 | Apple Silicon, AWS Graviton, Raspberry Pi |
 ---
 ## Version Validation
 ### Pre-Tag Checklist
 ```bash
 # Check current version
 python -c "from crawl4ai.__version__ import __version__; print(__version__)"
 # Verify it matches intended tag
 # If tag is v1.2.3, version should be "1.2.3"
 ```
 ### Post-Release Verification
 ```bash
 # PyPI
 pip install crawl4ai==1.2.3
 python -c "import crawl4ai; print(crawl4ai.__version__)"
 # Docker
 docker pull unclecode/crawl4ai:1.2.3
 docker run unclecode/crawl4ai:1.2.3 python -c "import crawl4ai; print(crawl4ai.__version__)"
 ```
 ---
 ## Monitoring URLs
 | Service | URL |
 |---------|-----|
 | GitHub Actions | `https://github.com/{owner}/{repo}/actions` |
 | PyPI Project | `https://pypi.org/project/crawl4ai/` |
 | Docker Hub | `https://hub.docker.com/r/unclecode/crawl4ai` |
 | GitHub Releases | `https://github.com/{owner}/{repo}/releases` |
 ---
 ## Rollback Strategy
 ### PyPI (Cannot Delete)
 ```bash
 # Increment patch version
 git tag v1.2.4
 git push origin v1.2.4
 ```
 ### Docker (Can Overwrite)
 ```bash
 # Rebuild with fix
 git tag docker-rebuild-v1.2.3
 git push origin docker-rebuild-v1.2.3
 ```
 ### GitHub Release
 ```bash
 # Delete release
 gh release delete v1.2.3
 # Delete tag
 git push --delete origin v1.2.3
 ```
 ---
 ## Status Badge Markdown
 ```markdown
 [![Release Pipeline](https://github.com/{owner}/{repo}/actions/workflows/release.yml/badge.svg)](https://github.com/{owner}/{repo}/actions/workflows/release.yml)
 [![Docker Release](https://github.com/{owner}/{repo}/actions/workflows/docker-release.yml/badge.svg)](https://github.com/{owner}/{repo}/actions/workflows/docker-release.yml)
 ```
 ---
 ## Timeline Example
 ```
 0:00 - Push tag v1.2.3
 0:01 - release.yml starts
 0:02 - Version validation passes
 0:03 - Package built
 0:04 - PyPI upload starts
 0:06 - PyPI upload complete ✓
 0:07 - GitHub release created ✓
 0:08 - release.yml complete
 0:08 - docker-release.yml triggered
 0:10 - Docker build starts
 0:12 - amd64 image built (cache hit)
 0:14 - arm64 image built (cache hit)
 0:15 - Images pushed to Docker Hub ✓
 0:16 - docker-release.yml complete
 Total: ~16 minutes
 Critical path (PyPI + GitHub): ~8 minutes
 ```
 ---
 ## Contact
 For workflow issues:
 1. Check Actions tab for logs
 2. Review this reference
 3. See [README.md](./README.md) for detailed docs
--- a/.github/workflows/release.yml
+++ b/.github/workflows/release.yml
@@ -66,6 +66,36 @@ jobs:
          twine upload dist/*
          echo "✅ Package uploaded to https://pypi.org/project/crawl4ai/"
      - name: Set up Docker Buildx
        uses: docker/setup-buildx-action@v3
      - name: Log in to Docker Hub
        uses: docker/login-action@v3
        with:
          username: ${{ secrets.DOCKER_USERNAME }}
          password: ${{ secrets.DOCKER_TOKEN }}
      - name: Extract major and minor versions
        id: versions
        run: |
          VERSION=${{ steps.get_version.outputs.VERSION }}
          MAJOR=$(echo $VERSION | cut -d. -f1)
          MINOR=$(echo $VERSION | cut -d. -f1-2)
          echo "MAJOR=$MAJOR" >> $GITHUB_OUTPUT
          echo "MINOR=$MINOR" >> $GITHUB_OUTPUT
      - name: Build and push Docker images
        uses: docker/build-push-action@v5
        with:
          context: .
          push: true
          tags: |
            unclecode/crawl4ai:${{ steps.get_version.outputs.VERSION }}
            unclecode/crawl4ai:${{ steps.versions.outputs.MINOR }}
            unclecode/crawl4ai:${{ steps.versions.outputs.MAJOR }}
            unclecode/crawl4ai:latest
          platforms: linux/amd64,linux/arm64
      - name: Create GitHub Release
        uses: softprops/action-gh-release@v2
        with:
@@ -87,9 +117,6 @@ jobs:
            docker pull unclecode/crawl4ai:latest
            ```
            **Note:** Docker images are being built and will be available shortly.
            Check the [Docker Release workflow](https://github.com/${{ github.repository }}/actions/workflows/docker-release.yml) for build status.
            ### 📝 What's Changed
            See [CHANGELOG.md](https://github.com/${{ github.repository }}/blob/main/CHANGELOG.md) for details.
          draft: false
@@ -105,9 +132,11 @@ jobs:
          echo "- URL: https://pypi.org/project/crawl4ai/" >> $GITHUB_STEP_SUMMARY
          echo "- Install: \`pip install crawl4ai==${{ steps.get_version.outputs.VERSION }}\`" >> $GITHUB_STEP_SUMMARY
          echo "" >> $GITHUB_STEP_SUMMARY
          echo "### 📋 GitHub Release" >> $GITHUB_STEP_SUMMARY
          echo "- https://github.com/${{ github.repository }}/releases/tag/v${{ steps.get_version.outputs.VERSION }}" >> $GITHUB_STEP_SUMMARY
          echo "" >> $GITHUB_STEP_SUMMARY
          echo "### 🐳 Docker Images" >> $GITHUB_STEP_SUMMARY
-          echo "Docker images are being built in a separate workflow." >> $GITHUB_STEP_SUMMARY
+          echo "- \`unclecode/crawl4ai:${{ steps.get_version.outputs.VERSION }}\`" >> $GITHUB_STEP_SUMMARY
-          echo "Check: https://github.com/${{ github.repository }}/actions/workflows/docker-release.yml" >> $GITHUB_STEP_SUMMARY
+          echo "- \`unclecode/crawl4ai:${{ steps.versions.outputs.MINOR }}\`" >> $GITHUB_STEP_SUMMARY
          echo "- \`unclecode/crawl4ai:${{ steps.versions.outputs.MAJOR }}\`" >> $GITHUB_STEP_SUMMARY
          echo "- \`unclecode/crawl4ai:latest\`" >> $GITHUB_STEP_SUMMARY
          echo "" >> $GITHUB_STEP_SUMMARY
          echo "### 📋 GitHub Release" >> $GITHUB_STEP_SUMMARY
          echo "https://github.com/${{ github.repository }}/releases/tag/v${{ steps.get_version.outputs.VERSION }}" >> $GITHUB_STEP_SUMMARY
--- a/.github/workflows/release.yml.backup
+++ b/.github/workflows/release.yml.backup
@@ -1,142 +0,0 @@
 name: Release Pipeline
 on:
  push:
    tags:
      - 'v*'
      - '!test-v*'  # Exclude test tags
 jobs:
  release:
    runs-on: ubuntu-latest
    permissions:
      contents: write  # Required for creating releases
    steps:
      - name: Checkout code
        uses: actions/checkout@v4
      - name: Set up Python
        uses: actions/setup-python@v5
        with:
          python-version: '3.12'
      - name: Extract version from tag
        id: get_version
        run: |
          TAG_VERSION=${GITHUB_REF#refs/tags/v}
          echo "VERSION=$TAG_VERSION" >> $GITHUB_OUTPUT
          echo "Releasing version: $TAG_VERSION"
      - name: Install package dependencies
        run: |
          pip install -e .
      - name: Check version consistency
        run: |
          TAG_VERSION=${{ steps.get_version.outputs.VERSION }}
          PACKAGE_VERSION=$(python -c "from crawl4ai.__version__ import __version__; print(__version__)")
          echo "Tag version: $TAG_VERSION"
          echo "Package version: $PACKAGE_VERSION"
          if [ "$TAG_VERSION" != "$PACKAGE_VERSION" ]; then
            echo "❌ Version mismatch! Tag: $TAG_VERSION, Package: $PACKAGE_VERSION"
            echo "Please update crawl4ai/__version__.py to match the tag version"
            exit 1
          fi
          echo "✅ Version check passed: $TAG_VERSION"
      - name: Install build dependencies
        run: |
          python -m pip install --upgrade pip
          pip install build twine
      - name: Build package
        run: python -m build
      - name: Check package
        run: twine check dist/*
      - name: Upload to PyPI
        env:
          TWINE_USERNAME: __token__
          TWINE_PASSWORD: ${{ secrets.PYPI_TOKEN }}
        run: |
          echo "📦 Uploading to PyPI..."
          twine upload dist/*
          echo "✅ Package uploaded to https://pypi.org/project/crawl4ai/"
      - name: Set up Docker Buildx
        uses: docker/setup-buildx-action@v3
      - name: Log in to Docker Hub
        uses: docker/login-action@v3
        with:
          username: ${{ secrets.DOCKER_USERNAME }}
          password: ${{ secrets.DOCKER_TOKEN }}
      - name: Extract major and minor versions
        id: versions
        run: |
          VERSION=${{ steps.get_version.outputs.VERSION }}
          MAJOR=$(echo $VERSION | cut -d. -f1)
          MINOR=$(echo $VERSION | cut -d. -f1-2)
          echo "MAJOR=$MAJOR" >> $GITHUB_OUTPUT
          echo "MINOR=$MINOR" >> $GITHUB_OUTPUT
      - name: Build and push Docker images
        uses: docker/build-push-action@v5
        with:
          context: .
          push: true
          tags: |
            unclecode/crawl4ai:${{ steps.get_version.outputs.VERSION }}
            unclecode/crawl4ai:${{ steps.versions.outputs.MINOR }}
            unclecode/crawl4ai:${{ steps.versions.outputs.MAJOR }}
            unclecode/crawl4ai:latest
          platforms: linux/amd64,linux/arm64
      - name: Create GitHub Release
        uses: softprops/action-gh-release@v2
        with:
          tag_name: v${{ steps.get_version.outputs.VERSION }}
          name: Release v${{ steps.get_version.outputs.VERSION }}
          body: |
            ## 🎉 Crawl4AI v${{ steps.get_version.outputs.VERSION }} Released!
            ### 📦 Installation
            **PyPI:**
            ```bash
            pip install crawl4ai==${{ steps.get_version.outputs.VERSION }}
            ```
            **Docker:**
            ```bash
            docker pull unclecode/crawl4ai:${{ steps.get_version.outputs.VERSION }}
            docker pull unclecode/crawl4ai:latest
            ```
            ### 📝 What's Changed
            See [CHANGELOG.md](https://github.com/${{ github.repository }}/blob/main/CHANGELOG.md) for details.
          draft: false
          prerelease: false
          token: ${{ secrets.GITHUB_TOKEN }}
      - name: Summary
        run: |
          echo "## 🚀 Release Complete!" >> $GITHUB_STEP_SUMMARY
          echo "" >> $GITHUB_STEP_SUMMARY
          echo "### 📦 PyPI Package" >> $GITHUB_STEP_SUMMARY
          echo "- Version: ${{ steps.get_version.outputs.VERSION }}" >> $GITHUB_STEP_SUMMARY
          echo "- URL: https://pypi.org/project/crawl4ai/" >> $GITHUB_STEP_SUMMARY
          echo "- Install: \`pip install crawl4ai==${{ steps.get_version.outputs.VERSION }}\`" >> $GITHUB_STEP_SUMMARY
          echo "" >> $GITHUB_STEP_SUMMARY
          echo "### 🐳 Docker Images" >> $GITHUB_STEP_SUMMARY
          echo "- \`unclecode/crawl4ai:${{ steps.get_version.outputs.VERSION }}\`" >> $GITHUB_STEP_SUMMARY
          echo "- \`unclecode/crawl4ai:${{ steps.versions.outputs.MINOR }}\`" >> $GITHUB_STEP_SUMMARY
          echo "- \`unclecode/crawl4ai:${{ steps.versions.outputs.MAJOR }}\`" >> $GITHUB_STEP_SUMMARY
          echo "- \`unclecode/crawl4ai:latest\`" >> $GITHUB_STEP_SUMMARY
          echo "" >> $GITHUB_STEP_SUMMARY
          echo "### 📋 GitHub Release" >> $GITHUB_STEP_SUMMARY
          echo "https://github.com/${{ github.repository }}/releases/tag/v${{ steps.get_version.outputs.VERSION }}" >> $GITHUB_STEP_SUMMARY
--- a/.gitignore
+++ b/.gitignore
@@ -266,11 +266,11 @@ continue_config.json
 .llm.env
 .private/
 .claude/
 CLAUDE_MONITOR.md
 CLAUDE.md
 .claude/
 tests/**/test_site
 tests/**/reports
 tests/**/benchmark_reports
@@ -282,3 +282,14 @@ docs/apps/linkdin/debug*/
 docs/apps/linkdin/samples/insights/*
 scripts/
 # Databse files
 *.sqlite3
 *.sqlite3-journal
 *.db-journal
 *.db-wal
 *.db-shm
 *.db
 *.rdb
 *.ldb
--- a/crawl4ai/init.py
+++ b/crawl4ai/init.py
@@ -103,8 +103,7 @@ from .browser_adapter import (
 from .utils import (
    start_colab_display_server,
-    setup_colab_environment,
+    setup_colab_environment
    hooks_to_string
 )
 __all__ = [
@@ -184,7 +183,6 @@ __all__ = [
    "ProxyConfig",
    "start_colab_display_server",
    "setup_colab_environment",
    "hooks_to_string",
    # C4A Script additions
    "c4a_compile",
    "c4a_validate", 
--- a/crawl4ai/docker_client.py
+++ b/crawl4ai/docker_client.py
@@ -1,4 +1,4 @@
-from typing import List, Optional, Union, AsyncGenerator, Dict, Any, Callable
+from typing import List, Optional, Union, AsyncGenerator, Dict, Any
 import httpx
 import json
 from urllib.parse import urljoin
@@ -7,7 +7,6 @@ import asyncio
 from .async_configs import BrowserConfig, CrawlerRunConfig
 from .models import CrawlResult
 from .async_logger import AsyncLogger, LogLevel
 from .utils import hooks_to_string
 class Crawl4aiClientError(Exception):
@@ -71,41 +70,17 @@ class Crawl4aiDockerClient:
            self.logger.error(f"Server unreachable: {str(e)}", tag="ERROR")
            raise ConnectionError(f"Cannot connect to server: {str(e)}")
-    def _prepare_request(
+    def _prepare_request(self, urls: List[str], browser_config: Optional[BrowserConfig] = None, 
-        self,
+                       crawler_config: Optional[CrawlerRunConfig] = None) -> Dict[str, Any]:
        urls: List[str],
        browser_config: Optional[BrowserConfig] = None,
        crawler_config: Optional[CrawlerRunConfig] = None,
        hooks: Optional[Union[Dict[str, Callable], Dict[str, str]]] = None,
        hooks_timeout: int = 30
    ) -> Dict[str, Any]:
        """Prepare request data from configs."""
        if self._token:
            self._http_client.headers["Authorization"] = f"Bearer {self._token}"
-
+        return {
        request_data = {
            "urls": urls,
            "browser_config": browser_config.dump() if browser_config else {},
            "crawler_config": crawler_config.dump() if crawler_config else {}
        }
        # Handle hooks if provided
        if hooks:
            # Check if hooks are already strings or need conversion
            if any(callable(v) for v in hooks.values()):
                # Convert function objects to strings
                hooks_code = hooks_to_string(hooks)
            else:
                # Already in string format
                hooks_code = hooks
            request_data["hooks"] = {
                "code": hooks_code,
                "timeout": hooks_timeout
            }
        return request_data
    async def _request(self, method: str, endpoint: str, **kwargs) -> httpx.Response:
        """Make an HTTP request with error handling."""
        url = urljoin(self.base_url, endpoint)
@@ -127,38 +102,12 @@ class Crawl4aiDockerClient:
        self,
        urls: List[str],
        browser_config: Optional[BrowserConfig] = None,
-        crawler_config: Optional[CrawlerRunConfig] = None,
+        crawler_config: Optional[CrawlerRunConfig] = None
        hooks: Optional[Union[Dict[str, Callable], Dict[str, str]]] = None,
        hooks_timeout: int = 30
    ) -> Union[CrawlResult, List[CrawlResult], AsyncGenerator[CrawlResult, None]]:
-        """
+        """Execute a crawl operation."""
        Execute a crawl operation.
        Args:
            urls: List of URLs to crawl
            browser_config: Browser configuration
            crawler_config: Crawler configuration
            hooks: Optional hooks - can be either:
                   - Dict[str, Callable]: Function objects that will be converted to strings
                   - Dict[str, str]: Already stringified hook code
            hooks_timeout: Timeout in seconds for each hook execution (1-120)
        Returns:
            Single CrawlResult, list of results, or async generator for streaming
        Example with function hooks:
            >>> async def my_hook(page, context, **kwargs):
            ...     await page.set_viewport_size({"width": 1920, "height": 1080})
            ...     return page
            >>>
            >>> result = await client.crawl(
            ...     ["https://example.com"],
            ...     hooks={"on_page_context_created": my_hook}
            ... )
        """
        await self._check_server()
-        data = self._prepare_request(urls, browser_config, crawler_config, hooks, hooks_timeout)
+        data = self._prepare_request(urls, browser_config, crawler_config)
        is_streaming = crawler_config and crawler_config.stream
        self.logger.info(f"Crawling {len(urls)} URLs {'(streaming)' if is_streaming else ''}", tag="CRAWL")
--- a/crawl4ai/utils.py
+++ b/crawl4ai/utils.py
@@ -47,7 +47,6 @@ from urllib.parse import (
    urljoin, urlparse, urlunparse,
    parse_qsl, urlencode, quote, unquote
 )
 import inspect
 # Monkey patch to fix wildcard handling in urllib.robotparser
@@ -3531,51 +3530,3 @@ def get_memory_stats() -> Tuple[float, float, float]:
    used_percent = get_true_memory_usage_percent()
    return used_percent, available_gb, total_gb
 # Hook utilities for Docker API
 def hooks_to_string(hooks: Dict[str, Callable]) -> Dict[str, str]:
    """
    Convert hook function objects to string representations for Docker API.
    This utility simplifies the process of using hooks with the Docker API by converting
    Python function objects into the string format required by the API.
    Args:
        hooks: Dictionary mapping hook point names to Python function objects.
               Functions should be async and follow hook signature requirements.
    Returns:
        Dictionary mapping hook point names to string representations of the functions.
    Example:
        >>> async def my_hook(page, context, **kwargs):
        ...     await page.set_viewport_size({"width": 1920, "height": 1080})
        ...     return page
        >>>
        >>> hooks_dict = {"on_page_context_created": my_hook}
        >>> api_hooks = hooks_to_string(hooks_dict)
        >>> # api_hooks is now ready to use with Docker API
    Raises:
        ValueError: If a hook is not callable or source cannot be extracted
    """
    result = {}
    for hook_name, hook_func in hooks.items():
        if not callable(hook_func):
            raise ValueError(f"Hook '{hook_name}' must be a callable function, got {type(hook_func)}")
        try:
            # Get the source code of the function
            source = inspect.getsource(hook_func)
            # Remove any leading indentation to get clean source
            source = textwrap.dedent(source)
            result[hook_name] = source
        except (OSError, TypeError) as e:
            raise ValueError(
                f"Cannot extract source code for hook '{hook_name}'. "
                f"Make sure the function is defined in a file (not interactively). Error: {e}"
            )
    return result
--- a/deploy/docker/ARCHITECTURE.md
+++ b/deploy/docker/ARCHITECTURE.md
--- a/deploy/docker/STRESS_TEST_PIPELINE.md
+++ b/deploy/docker/STRESS_TEST_PIPELINE.md
@@ -0,0 +1,241 @@
 # Crawl4AI Docker Memory & Pool Optimization - Implementation Log
 ## Critical Issues Identified
 ### Memory Management
 - **Host vs Container**: `psutil.virtual_memory()` reported host memory, not container limits
 - **Browser Pooling**: No pool reuse - every endpoint created new browsers
 - **Warmup Waste**: Permanent browser sat idle with mismatched config signature
 - **Idle Cleanup**: 30min TTL too long, janitor ran every 60s
 - **Endpoint Inconsistency**: 75% of endpoints bypassed pool (`/md`, `/html`, `/screenshot`, `/pdf`, `/execute_js`, `/llm`)
 ### Pool Design Flaws
 - **Config Mismatch**: Permanent browser used `config.yml` args, endpoints used empty `BrowserConfig()`
 - **Logging Level**: Pool hit markers at DEBUG, invisible with INFO logging
 ## Implementation Changes
 ### 1. Container-Aware Memory Detection (`utils.py`)
 ```python
 def get_container_memory_percent() -> float:
    # Try cgroup v2 → v1 → fallback to psutil
    # Reads /sys/fs/cgroup/memory.{current,max} OR memory/memory.{usage,limit}_in_bytes
 ```
 ### 2. Smart Browser Pool (`crawler_pool.py`)
 **3-Tier System:**
 - **PERMANENT**: Always-ready default browser (never cleaned)
 - **HOT_POOL**: Configs used 3+ times (longer TTL)
 - **COLD_POOL**: New/rare configs (short TTL)
 **Key Functions:**
 - `get_crawler(cfg)`: Check permanent → hot → cold → create new
 - `init_permanent(cfg)`: Initialize permanent at startup
 - `janitor()`: Adaptive cleanup (10s/30s/60s intervals based on memory)
 - `_sig(cfg)`: SHA1 hash of config dict for pool keys
 **Logging Fix**: Changed `logger.debug()` → `logger.info()` for pool hits
 ### 3. Endpoint Unification
 **Helper Function** (`server.py`):
 ```python
 def get_default_browser_config() -> BrowserConfig:
    return BrowserConfig(
        extra_args=config["crawler"]["browser"].get("extra_args", []),
        **config["crawler"]["browser"].get("kwargs", {}),
    )
 ```
 **Migrated Endpoints:**
 - `/html`, `/screenshot`, `/pdf`, `/execute_js` → use `get_default_browser_config()`
 - `handle_llm_qa()`, `handle_markdown_request()` → same
 **Result**: All endpoints now hit permanent browser pool
 ### 4. Config Updates (`config.yml`)
 - `idle_ttl_sec: 1800` → `300` (30min → 5min base TTL)
 - `port: 11234` → `11235` (fixed mismatch with Gunicorn)
 ### 5. Lifespan Fix (`server.py`)
 ```python
 await init_permanent(BrowserConfig(
    extra_args=config["crawler"]["browser"].get("extra_args", []),
    **config["crawler"]["browser"].get("kwargs", {}),
 ))
 ```
 Permanent browser now matches endpoint config signatures
 ## Test Results
 ### Test 1: Basic Health
 - 10 requests to `/health`
 - **Result**: 100% success, avg 3ms latency
 - **Baseline**: Container starts in ~5s, 270 MB idle
 ### Test 2: Memory Monitoring
 - 20 requests with Docker stats tracking
 - **Result**: 100% success, no memory leak (-0.2 MB delta)
 - **Baseline**: 269.7 MB container overhead
 ### Test 3: Pool Validation
 - 30 requests to `/html` endpoint
 - **Result**: **100% permanent browser hits**, 0 new browsers created
 - **Memory**: 287 MB baseline → 396 MB active (+109 MB)
 - **Latency**: Avg 4s (includes network to httpbin.org)
 ### Test 4: Concurrent Load
 - Light (10) → Medium (50) → Heavy (100) concurrent
 - **Total**: 320 requests
 - **Result**: 100% success, **320/320 permanent hits**, 0 new browsers
 - **Memory**: 269 MB → peak 1533 MB → final 993 MB
 - **Latency**: P99 at 100 concurrent = 34s (expected with single browser)
 ### Test 5: Pool Stress (Mixed Configs)
 - 20 requests with 4 different viewport configs
 - **Result**: 4 new browsers, 4 cold hits, **4 promotions to hot**, 8 hot hits
 - **Reuse Rate**: 60% (12 pool hits / 20 requests)
 - **Memory**: 270 MB → 928 MB peak (+658 MB = ~165 MB per browser)
 - **Proves**: Cold → hot promotion at 3 uses working perfectly
 ### Test 6: Multi-Endpoint
 - 10 requests each: `/html`, `/screenshot`, `/pdf`, `/crawl`
 - **Result**: 100% success across all 4 endpoints
 - **Latency**: 5-8s avg (PDF slowest at 7.2s)
 ### Test 7: Cleanup Verification
 - 20 requests (load spike) → 90s idle
 - **Memory**: 269 MB → peak 1107 MB → final 780 MB
 - **Recovery**: 327 MB (39%) - partial cleanup
 - **Note**: Hot pool browsers persist (by design), janitor working correctly
 ## Performance Metrics
 | Metric | Before | After | Improvement |
 |--------|--------|-------|-------------|
 | Pool Reuse | 0% | 100% (default config) | ∞ |
 | Memory Leak | Unknown | 0 MB/cycle | Stable |
 | Browser Reuse | No | Yes | ~3-5s saved per request |
 | Idle Memory | 500-700 MB × N | 270-400 MB | 10x reduction |
 | Concurrent Capacity | ~20 | 100+ | 5x |
 ## Key Learnings
 1. **Config Signature Matching**: Permanent browser MUST match endpoint default config exactly (SHA1 hash)
 2. **Logging Levels**: Pool diagnostics need INFO level, not DEBUG
 3. **Memory in Docker**: Must read cgroup files, not host metrics
 4. **Janitor Timing**: 60s interval adequate, but TTLs should be short (5min) for cold pool
 5. **Hot Promotion**: 3-use threshold works well for production patterns
 6. **Memory Per Browser**: ~150-200 MB per Chromium instance with headless + text_mode
 ## Test Infrastructure
 **Location**: `deploy/docker/tests/`
 **Dependencies**: `httpx`, `docker` (Python SDK)
 **Pattern**: Sequential build - each test adds one capability
 **Files**:
 - `test_1_basic.py`: Health check + container lifecycle
 - `test_2_memory.py`: + Docker stats monitoring
 - `test_3_pool.py`: + Log analysis for pool markers
 - `test_4_concurrent.py`: + asyncio.Semaphore for concurrency control
 - `test_5_pool_stress.py`: + Config variants (viewports)
 - `test_6_multi_endpoint.py`: + Multiple endpoint testing
 - `test_7_cleanup.py`: + Time-series memory tracking for janitor
 **Run Pattern**:
 ```bash
 cd deploy/docker/tests
 pip install -r requirements.txt
 # Rebuild after code changes:
 cd /path/to/repo && docker buildx build -t crawl4ai-local:latest --load .
 # Run test:
 python test_N_name.py
 ```
 ## Architecture Decisions
 **Why Permanent Browser?**
 - 90% of requests use default config → single browser serves most traffic
 - Eliminates 3-5s startup overhead per request
 **Why 3-Tier Pool?**
 - Permanent: Zero cost for common case
 - Hot: Amortized cost for frequent variants
 - Cold: Lazy allocation for rare configs
 **Why Adaptive Janitor?**
 - Memory pressure triggers aggressive cleanup
 - Low memory allows longer TTLs for better reuse
 **Why Not Close After Each Request?**
 - Browser startup: 3-5s overhead
 - Pool reuse: <100ms overhead
 - Net: 30-50x faster
 ## Future Optimizations
 1. **Request Queuing**: When at capacity, queue instead of reject
 2. **Pre-warming**: Predict common configs, pre-create browsers
 3. **Metrics Export**: Prometheus metrics for pool efficiency
 4. **Config Normalization**: Group similar viewports (e.g., 1920±50 → 1920)
 ## Critical Code Paths
 **Browser Acquisition** (`crawler_pool.py:34-78`):
 ```
 get_crawler(cfg) →
  _sig(cfg) →
  if sig == DEFAULT_CONFIG_SIG → PERMANENT
  elif sig in HOT_POOL → HOT_POOL[sig]
  elif sig in COLD_POOL → promote if count >= 3
  else → create new in COLD_POOL
 ```
 **Janitor Loop** (`crawler_pool.py:107-146`):
 ```
 while True:
  mem% = get_container_memory_percent()
  if mem% > 80: interval=10s, cold_ttl=30s
  elif mem% > 60: interval=30s, cold_ttl=60s
  else: interval=60s, cold_ttl=300s
  sleep(interval)
  close idle browsers (COLD then HOT)
 ```
 **Endpoint Pattern** (`server.py` example):
 ```python
@app.post("/html")
 async def generate_html(...):
    from crawler_pool import get_crawler
    crawler = await get_crawler(get_default_browser_config())
    results = await crawler.arun(url=body.url, config=cfg)
    # No crawler.close() - returned to pool
 ```
 ## Debugging Tips
 **Check Pool Activity**:
 ```bash
 docker logs crawl4ai-test | grep -E "(🔥|♨️|❄️|🆕|⬆️)"
 ```
 **Verify Config Signature**:
 ```python
 from crawl4ai import BrowserConfig
 import json, hashlib
 cfg = BrowserConfig(...)
 sig = hashlib.sha1(json.dumps(cfg.to_dict(), sort_keys=True).encode()).hexdigest()
 print(sig[:8])  # Compare with logs
 ```
 **Monitor Memory**:
 ```bash
 docker stats crawl4ai-test
 ```
 ## Known Limitations
 - **Mac Docker Stats**: CPU metrics unreliable, memory works
 - **PDF Generation**: Slowest endpoint (~7s), no optimization yet
 - **Hot Pool Persistence**: May hold memory longer than needed (trade-off for performance)
 - **Janitor Lag**: Up to 60s before cleanup triggers in low-memory scenarios
--- a/deploy/docker/api.py
+++ b/deploy/docker/api.py
@@ -66,6 +66,7 @@ async def handle_llm_qa(
    config: dict
 ) -> str:
    """Process QA using LLM with crawled content as context."""
    from crawler_pool import get_crawler
    try:
        if not url.startswith(('http://', 'https://')) and not url.startswith(("raw:", "raw://")):
            url = 'https://' + url
@@ -74,8 +75,14 @@ async def handle_llm_qa(
        if last_q_index != -1:
            url = url[:last_q_index]
-        # Get markdown content
+        # Get markdown content (use default config)
-        async with AsyncWebCrawler() as crawler:
+        from utils import load_config
        cfg = load_config()
        browser_cfg = BrowserConfig(
            extra_args=cfg["crawler"]["browser"].get("extra_args", []),
            **cfg["crawler"]["browser"].get("kwargs", {}),
        )
        crawler = await get_crawler(browser_cfg)
        result = await crawler.arun(url)
        if not result.success:
            raise HTTPException(
@@ -224,7 +231,14 @@ async def handle_markdown_request(
        cache_mode = CacheMode.ENABLED if cache == "1" else CacheMode.WRITE_ONLY
-        async with AsyncWebCrawler() as crawler:
+        from crawler_pool import get_crawler
        from utils import load_config as _load_config
        _cfg = _load_config()
        browser_cfg = BrowserConfig(
            extra_args=_cfg["crawler"]["browser"].get("extra_args", []),
            **_cfg["crawler"]["browser"].get("kwargs", {}),
        )
        crawler = await get_crawler(browser_cfg)
        result = await crawler.arun(
            url=decoded_url,
            config=CrawlerRunConfig(
@@ -446,6 +460,16 @@ async def handle_crawl_request(
    hooks_config: Optional[dict] = None
 ) -> dict:
    """Handle non-streaming crawl requests with optional hooks."""
    # Track request start
    request_id = f"req_{uuid4().hex[:8]}"
    try:
        from monitor import get_monitor
        await get_monitor().track_request_start(
            request_id, "/crawl", urls[0] if urls else "batch", browser_config
        )
    except:
        pass  # Monitor not critical
    start_mem_mb = _get_memory_mb() # <--- Get memory before
    start_time = time.time()
    mem_delta_mb = None
@@ -557,6 +581,15 @@ async def handle_crawl_request(
            "server_peak_memory_mb": peak_mem_mb
        }
        # Track request completion
        try:
            from monitor import get_monitor
            await get_monitor().track_request_end(
                request_id, success=True, pool_hit=True, status_code=200
            )
        except:
            pass
        # Add hooks information if hooks were used
        if hooks_config and hook_manager:
            from hook_manager import UserHookManager
@@ -585,6 +618,16 @@ async def handle_crawl_request(
    except Exception as e:
        logger.error(f"Crawl error: {str(e)}", exc_info=True)
        # Track request error
        try:
            from monitor import get_monitor
            await get_monitor().track_request_end(
                request_id, success=False, error=str(e), status_code=500
            )
        except:
            pass
        if 'crawler' in locals() and crawler.ready: # Check if crawler was initialized and started
            #  try:
            #      await crawler.close()
--- a/deploy/docker/config.yml
+++ b/deploy/docker/config.yml
@@ -3,7 +3,7 @@ app:
  title: "Crawl4AI API"
  version: "1.0.0"
  host: "0.0.0.0"
-  port: 11234
+  port: 11235
  reload: False
  workers: 1
  timeout_keep_alive: 300
@@ -61,7 +61,7 @@ crawler:
    batch_process: 300.0  # Timeout for batch processing
  pool:
    max_pages: 40                          # ← GLOBAL_SEM permits
-    idle_ttl_sec: 1800                     # ← 30 min janitor cutoff
+    idle_ttl_sec: 300                     # ← 30 min janitor cutoff
  browser:
    kwargs:
      headless: true
--- a/deploy/docker/crawler_pool.py
+++ b/deploy/docker/crawler_pool.py
@@ -1,60 +1,170 @@
-# crawler_pool.py  (new file)
+# crawler_pool.py - Smart browser pool with tiered management
-import asyncio, json, hashlib, time, psutil
+import asyncio, json, hashlib, time
 from contextlib import suppress
-from typing import Dict
+from typing import Dict, Optional
 from crawl4ai import AsyncWebCrawler, BrowserConfig
-from typing import Dict
+from utils import load_config, get_container_memory_percent
-from utils import load_config 
+import logging
 logger = logging.getLogger(__name__)
 CONFIG = load_config()
-POOL: Dict[str, AsyncWebCrawler] = {}
+# Pool tiers
 PERMANENT: Optional[AsyncWebCrawler] = None  # Always-ready default browser
 HOT_POOL: Dict[str, AsyncWebCrawler] = {}    # Frequent configs
 COLD_POOL: Dict[str, AsyncWebCrawler] = {}   # Rare configs
 LAST_USED: Dict[str, float] = {}
 USAGE_COUNT: Dict[str, int] = {}
 LOCK = asyncio.Lock()
-MEM_LIMIT  = CONFIG.get("crawler", {}).get("memory_threshold_percent", 95.0)   # % RAM – refuse new browsers above this
+# Config
-IDLE_TTL  = CONFIG.get("crawler", {}).get("pool", {}).get("idle_ttl_sec", 1800)   # close if unused for 30 min
+MEM_LIMIT = CONFIG.get("crawler", {}).get("memory_threshold_percent", 95.0)
 BASE_IDLE_TTL = CONFIG.get("crawler", {}).get("pool", {}).get("idle_ttl_sec", 300)
 DEFAULT_CONFIG_SIG = None  # Cached sig for default config
 def _sig(cfg: BrowserConfig) -> str:
    """Generate config signature."""
    payload = json.dumps(cfg.to_dict(), sort_keys=True, separators=(",",":"))
    return hashlib.sha1(payload.encode()).hexdigest()
 def _is_default_config(sig: str) -> bool:
    """Check if config matches default."""
    return sig == DEFAULT_CONFIG_SIG
 async def get_crawler(cfg: BrowserConfig) -> AsyncWebCrawler:
-    try:
+    """Get crawler from pool with tiered strategy."""
    sig = _sig(cfg)
    async with LOCK:
-            if sig in POOL:
+        # Check permanent browser for default config
-                LAST_USED[sig] = time.time();  
+        if PERMANENT and _is_default_config(sig):
-                return POOL[sig]
+            LAST_USED[sig] = time.time()
-            if psutil.virtual_memory().percent >= MEM_LIMIT:
+            USAGE_COUNT[sig] = USAGE_COUNT.get(sig, 0) + 1
-                raise MemoryError("RAM pressure – new browser denied")
+            logger.info("🔥 Using permanent browser")
            return PERMANENT
        # Check hot pool
        if sig in HOT_POOL:
            LAST_USED[sig] = time.time()
            USAGE_COUNT[sig] = USAGE_COUNT.get(sig, 0) + 1
            logger.info(f"♨️  Using hot pool browser (sig={sig[:8]})")
            return HOT_POOL[sig]
        # Check cold pool (promote to hot if used 3+ times)
        if sig in COLD_POOL:
            LAST_USED[sig] = time.time()
            USAGE_COUNT[sig] = USAGE_COUNT.get(sig, 0) + 1
            if USAGE_COUNT[sig] >= 3:
                logger.info(f"⬆️  Promoting to hot pool (sig={sig[:8]}, count={USAGE_COUNT[sig]})")
                HOT_POOL[sig] = COLD_POOL.pop(sig)
                # Track promotion in monitor
                try:
                    from monitor import get_monitor
                    await get_monitor().track_janitor_event("promote", sig, {"count": USAGE_COUNT[sig]})
                except:
                    pass
                return HOT_POOL[sig]
            logger.info(f"❄️  Using cold pool browser (sig={sig[:8]})")
            return COLD_POOL[sig]
        # Memory check before creating new
        mem_pct = get_container_memory_percent()
        if mem_pct >= MEM_LIMIT:
            logger.error(f"💥 Memory pressure: {mem_pct:.1f}% >= {MEM_LIMIT}%")
            raise MemoryError(f"Memory at {mem_pct:.1f}%, refusing new browser")
        # Create new in cold pool
        logger.info(f"🆕 Creating new browser in cold pool (sig={sig[:8]}, mem={mem_pct:.1f}%)")
        crawler = AsyncWebCrawler(config=cfg, thread_safe=False)
        await crawler.start()
-            POOL[sig] = crawler; LAST_USED[sig] = time.time()
+        COLD_POOL[sig] = crawler
            return crawler
    except MemoryError as e:
        raise MemoryError(f"RAM pressure – new browser denied: {e}")
    except Exception as e:
        raise RuntimeError(f"Failed to start browser: {e}")
    finally:
        if sig in POOL:
        LAST_USED[sig] = time.time()
-        else:
+        USAGE_COUNT[sig] = 1
-            # If we failed to start the browser, we should remove it from the pool
+        return crawler
-            POOL.pop(sig, None)
+
-            LAST_USED.pop(sig, None)
+async def init_permanent(cfg: BrowserConfig):
-        # If we failed to start the browser, we should remove it from the pool
+    """Initialize permanent default browser."""
-async def close_all():
+    global PERMANENT, DEFAULT_CONFIG_SIG
    async with LOCK:
-        await asyncio.gather(*(c.close() for c in POOL.values()), return_exceptions=True)
+        if PERMANENT:
-        POOL.clear(); LAST_USED.clear()
+            return
        DEFAULT_CONFIG_SIG = _sig(cfg)
        logger.info("🔥 Creating permanent default browser")
        PERMANENT = AsyncWebCrawler(config=cfg, thread_safe=False)
        await PERMANENT.start()
        LAST_USED[DEFAULT_CONFIG_SIG] = time.time()
        USAGE_COUNT[DEFAULT_CONFIG_SIG] = 0
 async def close_all():
    """Close all browsers."""
    async with LOCK:
        tasks = []
        if PERMANENT:
            tasks.append(PERMANENT.close())
        tasks.extend([c.close() for c in HOT_POOL.values()])
        tasks.extend([c.close() for c in COLD_POOL.values()])
        await asyncio.gather(*tasks, return_exceptions=True)
        HOT_POOL.clear()
        COLD_POOL.clear()
        LAST_USED.clear()
        USAGE_COUNT.clear()
 async def janitor():
    """Adaptive cleanup based on memory pressure."""
    while True:
-        await asyncio.sleep(60)
+        mem_pct = get_container_memory_percent()
        # Adaptive intervals and TTLs
        if mem_pct > 80:
            interval, cold_ttl, hot_ttl = 10, 30, 120
        elif mem_pct > 60:
            interval, cold_ttl, hot_ttl = 30, 60, 300
        else:
            interval, cold_ttl, hot_ttl = 60, BASE_IDLE_TTL, BASE_IDLE_TTL * 2
        await asyncio.sleep(interval)
        now = time.time()
        async with LOCK:
-            for sig, crawler in list(POOL.items()):
+            # Clean cold pool
-                if now - LAST_USED[sig] > IDLE_TTL:
+            for sig in list(COLD_POOL.keys()):
-                    with suppress(Exception): await crawler.close()
+                if now - LAST_USED.get(sig, now) > cold_ttl:
-                    POOL.pop(sig, None); LAST_USED.pop(sig, None)
+                    idle_time = now - LAST_USED[sig]
                    logger.info(f"🧹 Closing cold browser (sig={sig[:8]}, idle={idle_time:.0f}s)")
                    with suppress(Exception):
                        await COLD_POOL[sig].close()
                    COLD_POOL.pop(sig, None)
                    LAST_USED.pop(sig, None)
                    USAGE_COUNT.pop(sig, None)
                    # Track in monitor
                    try:
                        from monitor import get_monitor
                        await get_monitor().track_janitor_event("close_cold", sig, {"idle_seconds": int(idle_time), "ttl": cold_ttl})
                    except:
                        pass
            # Clean hot pool (more conservative)
            for sig in list(HOT_POOL.keys()):
                if now - LAST_USED.get(sig, now) > hot_ttl:
                    idle_time = now - LAST_USED[sig]
                    logger.info(f"🧹 Closing hot browser (sig={sig[:8]}, idle={idle_time:.0f}s)")
                    with suppress(Exception):
                        await HOT_POOL[sig].close()
                    HOT_POOL.pop(sig, None)
                    LAST_USED.pop(sig, None)
                    USAGE_COUNT.pop(sig, None)
                    # Track in monitor
                    try:
                        from monitor import get_monitor
                        await get_monitor().track_janitor_event("close_hot", sig, {"idle_seconds": int(idle_time), "ttl": hot_ttl})
                    except:
                        pass
            # Log pool stats
            if mem_pct > 60:
                logger.info(f"📊 Pool: hot={len(HOT_POOL)}, cold={len(COLD_POOL)}, mem={mem_pct:.1f}%")
--- a/deploy/docker/monitor.py
+++ b/deploy/docker/monitor.py
@@ -0,0 +1,382 @@
 # monitor.py - Real-time monitoring stats with Redis persistence
 import time
 import json
 import asyncio
 from typing import Dict, List, Optional
 from datetime import datetime, timezone
 from collections import deque
 from redis import asyncio as aioredis
 from utils import get_container_memory_percent
 import psutil
 import logging
 logger = logging.getLogger(__name__)
 class MonitorStats:
    """Tracks real-time server stats with Redis persistence."""
    def __init__(self, redis: aioredis.Redis):
        self.redis = redis
        self.start_time = time.time()
        # In-memory queues (fast reads, Redis backup)
        self.active_requests: Dict[str, Dict] = {}  # id -> request info
        self.completed_requests: deque = deque(maxlen=100)  # Last 100
        self.janitor_events: deque = deque(maxlen=100)
        self.errors: deque = deque(maxlen=100)
        # Endpoint stats (persisted in Redis)
        self.endpoint_stats: Dict[str, Dict] = {}  # endpoint -> {count, total_time, errors, ...}
        # Background persistence queue (max 10 pending persist requests)
        self._persist_queue: asyncio.Queue = asyncio.Queue(maxsize=10)
        self._persist_worker_task: Optional[asyncio.Task] = None
        # Timeline data (5min window, 5s resolution = 60 points)
        self.memory_timeline: deque = deque(maxlen=60)
        self.requests_timeline: deque = deque(maxlen=60)
        self.browser_timeline: deque = deque(maxlen=60)
    async def track_request_start(self, request_id: str, endpoint: str, url: str, config: Dict = None):
        """Track new request start."""
        req_info = {
            "id": request_id,
            "endpoint": endpoint,
            "url": url[:100],  # Truncate long URLs
            "start_time": time.time(),
            "config_sig": config.get("sig", "default") if config else "default",
            "mem_start": psutil.Process().memory_info().rss / (1024 * 1024)
        }
        self.active_requests[request_id] = req_info
        # Increment endpoint counter
        if endpoint not in self.endpoint_stats:
            self.endpoint_stats[endpoint] = {
                "count": 0, "total_time": 0, "errors": 0,
                "pool_hits": 0, "success": 0
            }
        self.endpoint_stats[endpoint]["count"] += 1
        # Queue persistence (handled by background worker)
        try:
            self._persist_queue.put_nowait(True)
        except asyncio.QueueFull:
            logger.warning("Persistence queue full, skipping")
    async def track_request_end(self, request_id: str, success: bool, error: str = None,
                               pool_hit: bool = True, status_code: int = 200):
        """Track request completion."""
        if request_id not in self.active_requests:
            return
        req_info = self.active_requests.pop(request_id)
        end_time = time.time()
        elapsed = end_time - req_info["start_time"]
        mem_end = psutil.Process().memory_info().rss / (1024 * 1024)
        mem_delta = mem_end - req_info["mem_start"]
        # Update stats
        endpoint = req_info["endpoint"]
        if endpoint in self.endpoint_stats:
            self.endpoint_stats[endpoint]["total_time"] += elapsed
            if success:
                self.endpoint_stats[endpoint]["success"] += 1
            else:
                self.endpoint_stats[endpoint]["errors"] += 1
            if pool_hit:
                self.endpoint_stats[endpoint]["pool_hits"] += 1
        # Add to completed queue
        completed = {
            **req_info,
            "end_time": end_time,
            "elapsed": round(elapsed, 2),
            "mem_delta": round(mem_delta, 1),
            "success": success,
            "error": error,
            "status_code": status_code,
            "pool_hit": pool_hit
        }
        self.completed_requests.append(completed)
        # Track errors
        if not success and error:
            self.errors.append({
                "timestamp": end_time,
                "endpoint": endpoint,
                "url": req_info["url"],
                "error": error,
                "request_id": request_id
            })
        await self._persist_endpoint_stats()
    async def track_janitor_event(self, event_type: str, sig: str, details: Dict):
        """Track janitor cleanup events."""
        self.janitor_events.append({
            "timestamp": time.time(),
            "type": event_type,  # "close_cold", "close_hot", "promote"
            "sig": sig[:8],
            "details": details
        })
    def _cleanup_old_entries(self, max_age_seconds: int = 300):
        """Remove entries older than max_age_seconds (default 5min)."""
        now = time.time()
        cutoff = now - max_age_seconds
        # Clean completed requests
        while self.completed_requests and self.completed_requests[0].get("end_time", 0) < cutoff:
            self.completed_requests.popleft()
        # Clean janitor events
        while self.janitor_events and self.janitor_events[0].get("timestamp", 0) < cutoff:
            self.janitor_events.popleft()
        # Clean errors
        while self.errors and self.errors[0].get("timestamp", 0) < cutoff:
            self.errors.popleft()
    async def update_timeline(self):
        """Update timeline data points (called every 5s)."""
        now = time.time()
        mem_pct = get_container_memory_percent()
        # Clean old entries (keep last 5 minutes)
        self._cleanup_old_entries(max_age_seconds=300)
        # Count requests in last 5s
        recent_reqs = sum(1 for req in self.completed_requests
                         if now - req.get("end_time", 0) < 5)
        # Browser counts (acquire lock to prevent race conditions)
        from crawler_pool import PERMANENT, HOT_POOL, COLD_POOL, LOCK
        async with LOCK:
            browser_count = {
                "permanent": 1 if PERMANENT else 0,
                "hot": len(HOT_POOL),
                "cold": len(COLD_POOL)
            }
        self.memory_timeline.append({"time": now, "value": mem_pct})
        self.requests_timeline.append({"time": now, "value": recent_reqs})
        self.browser_timeline.append({"time": now, "browsers": browser_count})
    async def _persist_endpoint_stats(self):
        """Persist endpoint stats to Redis."""
        try:
            await self.redis.set(
                "monitor:endpoint_stats",
                json.dumps(self.endpoint_stats),
                ex=86400  # 24h TTL
            )
        except Exception as e:
            logger.warning(f"Failed to persist endpoint stats: {e}")
    async def _persistence_worker(self):
        """Background worker to persist stats to Redis."""
        while True:
            try:
                await self._persist_queue.get()
                await self._persist_endpoint_stats()
                self._persist_queue.task_done()
            except asyncio.CancelledError:
                break
            except Exception as e:
                logger.error(f"Persistence worker error: {e}")
    def start_persistence_worker(self):
        """Start the background persistence worker."""
        if not self._persist_worker_task:
            self._persist_worker_task = asyncio.create_task(self._persistence_worker())
            logger.info("Started persistence worker")
    async def stop_persistence_worker(self):
        """Stop the background persistence worker."""
        if self._persist_worker_task:
            self._persist_worker_task.cancel()
            try:
                await self._persist_worker_task
            except asyncio.CancelledError:
                pass
            self._persist_worker_task = None
            logger.info("Stopped persistence worker")
    async def cleanup(self):
        """Cleanup on shutdown - persist final stats and stop workers."""
        logger.info("Monitor cleanup starting...")
        try:
            # Persist final stats before shutdown
            await self._persist_endpoint_stats()
            # Stop background worker
            await self.stop_persistence_worker()
            logger.info("Monitor cleanup completed")
        except Exception as e:
            logger.error(f"Monitor cleanup error: {e}")
    async def load_from_redis(self):
        """Load persisted stats from Redis."""
        try:
            data = await self.redis.get("monitor:endpoint_stats")
            if data:
                self.endpoint_stats = json.loads(data)
                logger.info("Loaded endpoint stats from Redis")
        except Exception as e:
            logger.warning(f"Failed to load from Redis: {e}")
    async def get_health_summary(self) -> Dict:
        """Get current system health snapshot."""
        mem_pct = get_container_memory_percent()
        cpu_pct = psutil.cpu_percent(interval=0.1)
        # Network I/O (delta since last call)
        net = psutil.net_io_counters()
        # Pool status (acquire lock to prevent race conditions)
        from crawler_pool import PERMANENT, HOT_POOL, COLD_POOL, LOCK
        async with LOCK:
            # TODO: Track actual browser process memory instead of estimates
            # These are conservative estimates based on typical Chromium usage
            permanent_mem = 270 if PERMANENT else 0  # Estimate: ~270MB for permanent browser
            hot_mem = len(HOT_POOL) * 180  # Estimate: ~180MB per hot pool browser
            cold_mem = len(COLD_POOL) * 180  # Estimate: ~180MB per cold pool browser
            permanent_active = PERMANENT is not None
            hot_count = len(HOT_POOL)
            cold_count = len(COLD_POOL)
        return {
            "container": {
                "memory_percent": round(mem_pct, 1),
                "cpu_percent": round(cpu_pct, 1),
                "network_sent_mb": round(net.bytes_sent / (1024**2), 2),
                "network_recv_mb": round(net.bytes_recv / (1024**2), 2),
                "uptime_seconds": int(time.time() - self.start_time)
            },
            "pool": {
                "permanent": {"active": permanent_active, "memory_mb": permanent_mem},
                "hot": {"count": hot_count, "memory_mb": hot_mem},
                "cold": {"count": cold_count, "memory_mb": cold_mem},
                "total_memory_mb": permanent_mem + hot_mem + cold_mem
            },
            "janitor": {
                "next_cleanup_estimate": "adaptive",  # Would need janitor state
                "memory_pressure": "LOW" if mem_pct < 60 else "MEDIUM" if mem_pct < 80 else "HIGH"
            }
        }
    def get_active_requests(self) -> List[Dict]:
        """Get list of currently active requests."""
        now = time.time()
        return [
            {
                **req,
                "elapsed": round(now - req["start_time"], 1),
                "status": "running"
            }
            for req in self.active_requests.values()
        ]
    def get_completed_requests(self, limit: int = 50, filter_status: str = "all") -> List[Dict]:
        """Get recent completed requests."""
        requests = list(self.completed_requests)[-limit:]
        if filter_status == "success":
            requests = [r for r in requests if r.get("success")]
        elif filter_status == "error":
            requests = [r for r in requests if not r.get("success")]
        return requests
    async def get_browser_list(self) -> List[Dict]:
        """Get detailed browser pool information."""
        from crawler_pool import PERMANENT, HOT_POOL, COLD_POOL, LAST_USED, USAGE_COUNT, DEFAULT_CONFIG_SIG, LOCK
        browsers = []
        now = time.time()
        # Acquire lock to prevent race conditions during iteration
        async with LOCK:
            if PERMANENT:
                browsers.append({
                    "type": "permanent",
                    "sig": DEFAULT_CONFIG_SIG[:8] if DEFAULT_CONFIG_SIG else "unknown",
                    "age_seconds": int(now - self.start_time),
                    "last_used_seconds": int(now - LAST_USED.get(DEFAULT_CONFIG_SIG, now)),
                    "memory_mb": 270,
                    "hits": USAGE_COUNT.get(DEFAULT_CONFIG_SIG, 0),
                    "killable": False
                })
            for sig, crawler in HOT_POOL.items():
                browsers.append({
                    "type": "hot",
                    "sig": sig[:8],
                    "age_seconds": int(now - self.start_time),  # Approximation
                    "last_used_seconds": int(now - LAST_USED.get(sig, now)),
                    "memory_mb": 180,  # Estimate
                    "hits": USAGE_COUNT.get(sig, 0),
                    "killable": True
                })
            for sig, crawler in COLD_POOL.items():
                browsers.append({
                    "type": "cold",
                    "sig": sig[:8],
                    "age_seconds": int(now - self.start_time),
                    "last_used_seconds": int(now - LAST_USED.get(sig, now)),
                    "memory_mb": 180,
                    "hits": USAGE_COUNT.get(sig, 0),
                    "killable": True
                })
        return browsers
    def get_endpoint_stats_summary(self) -> Dict[str, Dict]:
        """Get aggregated endpoint statistics."""
        summary = {}
        for endpoint, stats in self.endpoint_stats.items():
            count = stats["count"]
            avg_time = (stats["total_time"] / count) if count > 0 else 0
            success_rate = (stats["success"] / count * 100) if count > 0 else 0
            pool_hit_rate = (stats["pool_hits"] / count * 100) if count > 0 else 0
            summary[endpoint] = {
                "count": count,
                "avg_latency_ms": round(avg_time * 1000, 1),
                "success_rate_percent": round(success_rate, 1),
                "pool_hit_rate_percent": round(pool_hit_rate, 1),
                "errors": stats["errors"]
            }
        return summary
    def get_timeline_data(self, metric: str, window: str = "5m") -> Dict:
        """Get timeline data for charts."""
        # For now, only 5m window supported
        if metric == "memory":
            data = list(self.memory_timeline)
        elif metric == "requests":
            data = list(self.requests_timeline)
        elif metric == "browsers":
            data = list(self.browser_timeline)
        else:
            return {"timestamps": [], "values": []}
        return {
            "timestamps": [int(d["time"]) for d in data],
            "values": [d.get("value", d.get("browsers")) for d in data]
        }
    def get_janitor_log(self, limit: int = 100) -> List[Dict]:
        """Get recent janitor events."""
        return list(self.janitor_events)[-limit:]
    def get_errors_log(self, limit: int = 100) -> List[Dict]:
        """Get recent errors."""
        return list(self.errors)[-limit:]
 # Global instance (initialized in server.py)
 monitor_stats: Optional[MonitorStats] = None
 def get_monitor() -> MonitorStats:
    """Get global monitor instance."""
    if monitor_stats is None:
        raise RuntimeError("Monitor not initialized")
    return monitor_stats
--- a/deploy/docker/monitor_routes.py
+++ b/deploy/docker/monitor_routes.py
@@ -0,0 +1,405 @@
 # monitor_routes.py - Monitor API endpoints
 from fastapi import APIRouter, HTTPException, WebSocket, WebSocketDisconnect
 from pydantic import BaseModel
 from typing import Optional
 from monitor import get_monitor
 import logging
 import asyncio
 import json
 logger = logging.getLogger(__name__)
 router = APIRouter(prefix="/monitor", tags=["monitor"])
@router.get("/health")
 async def get_health():
    """Get current system health snapshot."""
    try:
        monitor = get_monitor()
        return await monitor.get_health_summary()
    except Exception as e:
        logger.error(f"Error getting health: {e}")
        raise HTTPException(500, str(e))
@router.get("/requests")
 async def get_requests(status: str = "all", limit: int = 50):
    """Get active and completed requests.
    Args:
        status: Filter by 'active', 'completed', 'success', 'error', or 'all'
        limit: Max number of completed requests to return (default 50)
    """
    # Input validation
    if status not in ["all", "active", "completed", "success", "error"]:
        raise HTTPException(400, f"Invalid status: {status}. Must be one of: all, active, completed, success, error")
    if limit < 1 or limit > 1000:
        raise HTTPException(400, f"Invalid limit: {limit}. Must be between 1 and 1000")
    try:
        monitor = get_monitor()
        if status == "active":
            return {"active": monitor.get_active_requests(), "completed": []}
        elif status == "completed":
            return {"active": [], "completed": monitor.get_completed_requests(limit)}
        elif status in ["success", "error"]:
            return {"active": [], "completed": monitor.get_completed_requests(limit, status)}
        else:  # "all"
            return {
                "active": monitor.get_active_requests(),
                "completed": monitor.get_completed_requests(limit)
            }
    except Exception as e:
        logger.error(f"Error getting requests: {e}")
        raise HTTPException(500, str(e))
@router.get("/browsers")
 async def get_browsers():
    """Get detailed browser pool information."""
    try:
        monitor = get_monitor()
        browsers = await monitor.get_browser_list()
        # Calculate summary stats
        total_browsers = len(browsers)
        total_memory = sum(b["memory_mb"] for b in browsers)
        # Calculate reuse rate from recent requests
        recent = monitor.get_completed_requests(100)
        pool_hits = sum(1 for r in recent if r.get("pool_hit", False))
        reuse_rate = (pool_hits / len(recent) * 100) if recent else 0
        return {
            "browsers": browsers,
            "summary": {
                "total_count": total_browsers,
                "total_memory_mb": total_memory,
                "reuse_rate_percent": round(reuse_rate, 1)
            }
        }
    except Exception as e:
        logger.error(f"Error getting browsers: {e}")
        raise HTTPException(500, str(e))
@router.get("/endpoints/stats")
 async def get_endpoint_stats():
    """Get aggregated endpoint statistics."""
    try:
        monitor = get_monitor()
        return monitor.get_endpoint_stats_summary()
    except Exception as e:
        logger.error(f"Error getting endpoint stats: {e}")
        raise HTTPException(500, str(e))
@router.get("/timeline")
 async def get_timeline(metric: str = "memory", window: str = "5m"):
    """Get timeline data for charts.
    Args:
        metric: 'memory', 'requests', or 'browsers'
        window: Time window (only '5m' supported for now)
    """
    # Input validation
    if metric not in ["memory", "requests", "browsers"]:
        raise HTTPException(400, f"Invalid metric: {metric}. Must be one of: memory, requests, browsers")
    if window != "5m":
        raise HTTPException(400, f"Invalid window: {window}. Only '5m' is currently supported")
    try:
        monitor = get_monitor()
        return monitor.get_timeline_data(metric, window)
    except Exception as e:
        logger.error(f"Error getting timeline: {e}")
        raise HTTPException(500, str(e))
@router.get("/logs/janitor")
 async def get_janitor_log(limit: int = 100):
    """Get recent janitor cleanup events."""
    # Input validation
    if limit < 1 or limit > 1000:
        raise HTTPException(400, f"Invalid limit: {limit}. Must be between 1 and 1000")
    try:
        monitor = get_monitor()
        return {"events": monitor.get_janitor_log(limit)}
    except Exception as e:
        logger.error(f"Error getting janitor log: {e}")
        raise HTTPException(500, str(e))
@router.get("/logs/errors")
 async def get_errors_log(limit: int = 100):
    """Get recent errors."""
    # Input validation
    if limit < 1 or limit > 1000:
        raise HTTPException(400, f"Invalid limit: {limit}. Must be between 1 and 1000")
    try:
        monitor = get_monitor()
        return {"errors": monitor.get_errors_log(limit)}
    except Exception as e:
        logger.error(f"Error getting errors log: {e}")
        raise HTTPException(500, str(e))
 # ========== Control Actions ==========
 class KillBrowserRequest(BaseModel):
    sig: str
@router.post("/actions/cleanup")
 async def force_cleanup():
    """Force immediate janitor cleanup (kills idle cold pool browsers)."""
    try:
        from crawler_pool import COLD_POOL, LAST_USED, USAGE_COUNT, LOCK
        import time
        from contextlib import suppress
        killed_count = 0
        now = time.time()
        async with LOCK:
            for sig in list(COLD_POOL.keys()):
                # Kill all cold pool browsers immediately
                logger.info(f"🧹 Force cleanup: closing cold browser (sig={sig[:8]})")
                with suppress(Exception):
                    await COLD_POOL[sig].close()
                COLD_POOL.pop(sig, None)
                LAST_USED.pop(sig, None)
                USAGE_COUNT.pop(sig, None)
                killed_count += 1
        monitor = get_monitor()
        await monitor.track_janitor_event("force_cleanup", "manual", {"killed": killed_count})
        return {"success": True, "killed_browsers": killed_count}
    except Exception as e:
        logger.error(f"Error during force cleanup: {e}")
        raise HTTPException(500, str(e))
@router.post("/actions/kill_browser")
 async def kill_browser(req: KillBrowserRequest):
    """Kill a specific browser by signature (hot or cold only).
    Args:
        sig: Browser config signature (first 8 chars)
    """
    try:
        from crawler_pool import HOT_POOL, COLD_POOL, LAST_USED, USAGE_COUNT, LOCK, DEFAULT_CONFIG_SIG
        from contextlib import suppress
        # Find full signature matching prefix
        target_sig = None
        pool_type = None
        async with LOCK:
            # Check hot pool
            for sig in HOT_POOL.keys():
                if sig.startswith(req.sig):
                    target_sig = sig
                    pool_type = "hot"
                    break
            # Check cold pool
            if not target_sig:
                for sig in COLD_POOL.keys():
                    if sig.startswith(req.sig):
                        target_sig = sig
                        pool_type = "cold"
                        break
            # Check if trying to kill permanent
            if DEFAULT_CONFIG_SIG and DEFAULT_CONFIG_SIG.startswith(req.sig):
                raise HTTPException(403, "Cannot kill permanent browser. Use restart instead.")
            if not target_sig:
                raise HTTPException(404, f"Browser with sig={req.sig} not found")
            # Warn if there are active requests (browser might be in use)
            monitor = get_monitor()
            active_count = len(monitor.get_active_requests())
            if active_count > 0:
                logger.warning(f"Killing browser {target_sig[:8]} while {active_count} requests are active - may cause failures")
            # Kill the browser
            if pool_type == "hot":
                browser = HOT_POOL.pop(target_sig)
            else:
                browser = COLD_POOL.pop(target_sig)
            with suppress(Exception):
                await browser.close()
            LAST_USED.pop(target_sig, None)
            USAGE_COUNT.pop(target_sig, None)
        logger.info(f"🔪 Killed {pool_type} browser (sig={target_sig[:8]})")
        monitor = get_monitor()
        await monitor.track_janitor_event("kill_browser", target_sig, {"pool": pool_type, "manual": True})
        return {"success": True, "killed_sig": target_sig[:8], "pool_type": pool_type}
    except HTTPException:
        raise
    except Exception as e:
        logger.error(f"Error killing browser: {e}")
        raise HTTPException(500, str(e))
@router.post("/actions/restart_browser")
 async def restart_browser(req: KillBrowserRequest):
    """Restart a browser (kill + recreate). Works for permanent too.
    Args:
        sig: Browser config signature (first 8 chars), or "permanent"
    """
    try:
        from crawler_pool import (PERMANENT, HOT_POOL, COLD_POOL, LAST_USED,
                                  USAGE_COUNT, LOCK, DEFAULT_CONFIG_SIG, init_permanent)
        from crawl4ai import AsyncWebCrawler, BrowserConfig
        from contextlib import suppress
        import time
        # Handle permanent browser restart
        if req.sig == "permanent" or (DEFAULT_CONFIG_SIG and DEFAULT_CONFIG_SIG.startswith(req.sig)):
            async with LOCK:
                if PERMANENT:
                    with suppress(Exception):
                        await PERMANENT.close()
                # Reinitialize permanent
                from utils import load_config
                config = load_config()
                await init_permanent(BrowserConfig(
                    extra_args=config["crawler"]["browser"].get("extra_args", []),
                    **config["crawler"]["browser"].get("kwargs", {}),
                ))
            logger.info("🔄 Restarted permanent browser")
            return {"success": True, "restarted": "permanent"}
        # Handle hot/cold browser restart
        target_sig = None
        pool_type = None
        browser_config = None
        async with LOCK:
            # Find browser
            for sig in HOT_POOL.keys():
                if sig.startswith(req.sig):
                    target_sig = sig
                    pool_type = "hot"
                    # Would need to reconstruct config (not stored currently)
                    break
            if not target_sig:
                for sig in COLD_POOL.keys():
                    if sig.startswith(req.sig):
                        target_sig = sig
                        pool_type = "cold"
                        break
            if not target_sig:
                raise HTTPException(404, f"Browser with sig={req.sig} not found")
            # Kill existing
            if pool_type == "hot":
                browser = HOT_POOL.pop(target_sig)
            else:
                browser = COLD_POOL.pop(target_sig)
            with suppress(Exception):
                await browser.close()
            # Note: We can't easily recreate with same config without storing it
            # For now, just kill and let new requests create fresh ones
            LAST_USED.pop(target_sig, None)
            USAGE_COUNT.pop(target_sig, None)
        logger.info(f"🔄 Restarted {pool_type} browser (sig={target_sig[:8]})")
        monitor = get_monitor()
        await monitor.track_janitor_event("restart_browser", target_sig, {"pool": pool_type})
        return {"success": True, "restarted_sig": target_sig[:8], "note": "Browser will be recreated on next request"}
    except HTTPException:
        raise
    except Exception as e:
        logger.error(f"Error restarting browser: {e}")
        raise HTTPException(500, str(e))
@router.post("/stats/reset")
 async def reset_stats():
    """Reset today's endpoint counters."""
    try:
        monitor = get_monitor()
        monitor.endpoint_stats.clear()
        await monitor._persist_endpoint_stats()
        return {"success": True, "message": "Endpoint stats reset"}
    except Exception as e:
        logger.error(f"Error resetting stats: {e}")
        raise HTTPException(500, str(e))
@router.websocket("/ws")
 async def websocket_endpoint(websocket: WebSocket):
    """WebSocket endpoint for real-time monitoring updates.
    Sends updates every 2 seconds with:
    - Health stats
    - Active/completed requests
    - Browser pool status
    - Timeline data
    """
    await websocket.accept()
    logger.info("WebSocket client connected")
    try:
        while True:
            try:
                # Gather all monitoring data
                monitor = get_monitor()
                data = {
                    "timestamp": asyncio.get_event_loop().time(),
                    "health": await monitor.get_health_summary(),
                    "requests": {
                        "active": monitor.get_active_requests(),
                        "completed": monitor.get_completed_requests(limit=10)
                    },
                    "browsers": await monitor.get_browser_list(),
                    "timeline": {
                        "memory": monitor.get_timeline_data("memory", "5m"),
                        "requests": monitor.get_timeline_data("requests", "5m"),
                        "browsers": monitor.get_timeline_data("browsers", "5m")
                    },
                    "janitor": monitor.get_janitor_log(limit=10),
                    "errors": monitor.get_errors_log(limit=10)
                }
                # Send update to client
                await websocket.send_json(data)
                # Wait 2 seconds before next update
                await asyncio.sleep(2)
            except WebSocketDisconnect:
                logger.info("WebSocket client disconnected")
                break
            except Exception as e:
                logger.error(f"WebSocket error: {e}", exc_info=True)
                await asyncio.sleep(2)  # Continue trying
    except Exception as e:
        logger.error(f"WebSocket connection error: {e}", exc_info=True)
    finally:
        logger.info("WebSocket connection closed")
--- a/deploy/docker/server.py
+++ b/deploy/docker/server.py
@@ -16,6 +16,7 @@ from fastapi import Request, Depends
 from fastapi.responses import FileResponse
 import base64
 import re
 import logging
 from crawl4ai import AsyncWebCrawler, BrowserConfig, CrawlerRunConfig
 from api import (
    handle_markdown_request, handle_llm_qa,
@@ -78,6 +79,14 @@ __version__ = "0.5.1-d1"
 MAX_PAGES = config["crawler"]["pool"].get("max_pages", 30)
 GLOBAL_SEM = asyncio.Semaphore(MAX_PAGES)
 # ── default browser config helper ─────────────────────────────
 def get_default_browser_config() -> BrowserConfig:
    """Get default BrowserConfig from config.yml."""
    return BrowserConfig(
        extra_args=config["crawler"]["browser"].get("extra_args", []),
        **config["crawler"]["browser"].get("kwargs", {}),
    )
 # import logging
 # page_log = logging.getLogger("page_cap")
 # orig_arun = AsyncWebCrawler.arun
@@ -103,15 +112,52 @@ AsyncWebCrawler.arun = capped_arun
@asynccontextmanager
 async def lifespan(_: FastAPI):
-    await get_crawler(BrowserConfig(
+    from crawler_pool import init_permanent
    from monitor import MonitorStats
    import monitor as monitor_module
    # Initialize monitor
    monitor_module.monitor_stats = MonitorStats(redis)
    await monitor_module.monitor_stats.load_from_redis()
    monitor_module.monitor_stats.start_persistence_worker()
    # Initialize browser pool
    await init_permanent(BrowserConfig(
        extra_args=config["crawler"]["browser"].get("extra_args", []),
        **config["crawler"]["browser"].get("kwargs", {}),
-    ))           # warm‑up
+    ))
-    app.state.janitor = asyncio.create_task(janitor())        # idle GC
+
    # Start background tasks
    app.state.janitor = asyncio.create_task(janitor())
    app.state.timeline_updater = asyncio.create_task(_timeline_updater())
    yield
    # Cleanup
    app.state.janitor.cancel()
    app.state.timeline_updater.cancel()
    # Monitor cleanup (persist stats and stop workers)
    from monitor import get_monitor
    try:
        await get_monitor().cleanup()
    except Exception as e:
        logger.error(f"Monitor cleanup failed: {e}")
    await close_all()
 async def _timeline_updater():
    """Update timeline data every 5 seconds."""
    from monitor import get_monitor
    while True:
        await asyncio.sleep(5)
        try:
            await asyncio.wait_for(get_monitor().update_timeline(), timeout=4.0)
        except asyncio.TimeoutError:
            logger.warning("Timeline update timeout after 4s")
        except Exception as e:
            logger.warning(f"Timeline update error: {e}")
 # ───────────────────── FastAPI instance ──────────────────────
 app = FastAPI(
    title=config["app"]["title"],
@@ -129,6 +175,25 @@ app.mount(
    name="play",
 )
 # ── static monitor dashboard ────────────────────────────────
 MONITOR_DIR = pathlib.Path(__file__).parent / "static" / "monitor"
 if not MONITOR_DIR.exists():
    raise RuntimeError(f"Monitor assets not found at {MONITOR_DIR}")
 app.mount(
    "/dashboard",
    StaticFiles(directory=MONITOR_DIR, html=True),
    name="monitor_ui",
 )
 # ── static assets (logo, etc) ────────────────────────────────
 ASSETS_DIR = pathlib.Path(__file__).parent / "static" / "assets"
 if ASSETS_DIR.exists():
    app.mount(
        "/static/assets",
        StaticFiles(directory=ASSETS_DIR),
        name="assets",
    )
@app.get("/")
 async def root():
@@ -212,6 +277,12 @@ def _safe_eval_config(expr: str) -> dict:
 # ── job router ──────────────────────────────────────────────
 app.include_router(init_job_router(redis, config, token_dep))
 # ── monitor router ──────────────────────────────────────────
 from monitor_routes import router as monitor_router
 app.include_router(monitor_router)
 logger = logging.getLogger(__name__)
 # ──────────────────────── Endpoints ──────────────────────────
@app.post("/token")
 async def get_token(req: TokenRequest):
@@ -266,27 +337,20 @@ async def generate_html(
    Crawls the URL, preprocesses the raw HTML for schema extraction, and returns the processed HTML.
    Use when you need sanitized HTML structures for building schemas or further processing.
    """
    from crawler_pool import get_crawler
    cfg = CrawlerRunConfig()
    try:
-        async with AsyncWebCrawler(config=BrowserConfig()) as crawler:
+        crawler = await get_crawler(get_default_browser_config())
        results = await crawler.arun(url=body.url, config=cfg)
        # Check if the crawl was successful
        if not results[0].success:
-            raise HTTPException(
+            raise HTTPException(500, detail=results[0].error_message or "Crawl failed")
                status_code=500,
                detail=results[0].error_message or "Crawl failed"
            )
        raw_html = results[0].html
        from crawl4ai.utils import preprocess_html_for_schema
        processed_html = preprocess_html_for_schema(raw_html)
        return JSONResponse({"html": processed_html, "url": body.url, "success": True})
    except Exception as e:
-        # Log and raise as HTTP 500 for other exceptions
+        raise HTTPException(500, detail=str(e))
        raise HTTPException(
            status_code=500,
            detail=str(e)
        )
 # Screenshot endpoint
@@ -304,16 +368,13 @@ async def generate_screenshot(
    Use when you need an image snapshot of the rendered page. Its recommened to provide an output path to save the screenshot.
    Then in result instead of the screenshot you will get a path to the saved file.
    """
    from crawler_pool import get_crawler
    try:
-        cfg = CrawlerRunConfig(
+        cfg = CrawlerRunConfig(screenshot=True, screenshot_wait_for=body.screenshot_wait_for)
-            screenshot=True, screenshot_wait_for=body.screenshot_wait_for)
+        crawler = await get_crawler(get_default_browser_config())
        async with AsyncWebCrawler(config=BrowserConfig()) as crawler:
        results = await crawler.arun(url=body.url, config=cfg)
        if not results[0].success:
-            raise HTTPException(
+            raise HTTPException(500, detail=results[0].error_message or "Crawl failed")
                status_code=500,
                detail=results[0].error_message or "Crawl failed"
            )
        screenshot_data = results[0].screenshot
        if body.output_path:
            abs_path = os.path.abspath(body.output_path)
@@ -323,10 +384,7 @@ async def generate_screenshot(
            return {"success": True, "path": abs_path}
        return {"success": True, "screenshot": screenshot_data}
    except Exception as e:
-        raise HTTPException(
+        raise HTTPException(500, detail=str(e))
            status_code=500,
            detail=str(e)
        )
 # PDF endpoint
@@ -344,15 +402,13 @@ async def generate_pdf(
    Use when you need a printable or archivable snapshot of the page. It is recommended to provide an output path to save the PDF.
    Then in result instead of the PDF you will get a path to the saved file.
    """
    from crawler_pool import get_crawler
    try:
        cfg = CrawlerRunConfig(pdf=True)
-        async with AsyncWebCrawler(config=BrowserConfig()) as crawler:
+        crawler = await get_crawler(get_default_browser_config())
        results = await crawler.arun(url=body.url, config=cfg)
        if not results[0].success:
-            raise HTTPException(
+            raise HTTPException(500, detail=results[0].error_message or "Crawl failed")
                status_code=500,
                detail=results[0].error_message or "Crawl failed"
            )
        pdf_data = results[0].pdf
        if body.output_path:
            abs_path = os.path.abspath(body.output_path)
@@ -362,10 +418,7 @@ async def generate_pdf(
            return {"success": True, "path": abs_path}
        return {"success": True, "pdf": base64.b64encode(pdf_data).decode()}
    except Exception as e:
-        raise HTTPException(
+        raise HTTPException(500, detail=str(e))
            status_code=500,
            detail=str(e)
        )
@app.post("/execute_js")
@@ -421,23 +474,17 @@ async def execute_js(
        ```
    """
    from crawler_pool import get_crawler
    try:
        cfg = CrawlerRunConfig(js_code=body.scripts)
-        async with AsyncWebCrawler(config=BrowserConfig()) as crawler:
+        crawler = await get_crawler(get_default_browser_config())
        results = await crawler.arun(url=body.url, config=cfg)
        if not results[0].success:
-            raise HTTPException(
+            raise HTTPException(500, detail=results[0].error_message or "Crawl failed")
                status_code=500,
                detail=results[0].error_message or "Crawl failed"
            )
        # Return JSON-serializable dict of the first CrawlResult
        data = results[0].model_dump()
        return JSONResponse(data)
    except Exception as e:
-        raise HTTPException(
+        raise HTTPException(500, detail=str(e))
            status_code=500,
            detail=str(e)
        )
@app.get("/llm/{url:path}")
--- a/deploy/docker/static/assets/crawl4ai-logo.jpg
+++ b/deploy/docker/static/assets/crawl4ai-logo.jpg
--- a/deploy/docker/static/assets/crawl4ai-logo.png
+++ b/deploy/docker/static/assets/crawl4ai-logo.png
--- a/deploy/docker/static/assets/logo.png
+++ b/deploy/docker/static/assets/logo.png
--- a/deploy/docker/static/monitor/index.html
+++ b/deploy/docker/static/monitor/index.html
--- a/deploy/docker/static/playground/index.html
+++ b/deploy/docker/static/playground/index.html
@@ -167,12 +167,15 @@
            </a>
        </h1>
-        <div class="ml-auto flex space-x-2">
+        <div class="ml-auto flex items-center space-x-4">
            <a href="/dashboard" class="text-xs text-secondary hover:text-primary underline">Monitor</a>
            <div class="flex space-x-2">
                <button id="play-tab"
                    class="px-3 py-1 rounded-t bg-surface border border-b-0 border-border text-primary">Playground</button>
                <button id="stress-tab" class="px-3 py-1 rounded-t border border-border hover:bg-surface">Stress
                    Test</button>
            </div>
        </div>
    </header>
    <!-- Main Playground -->
--- a/deploy/docker/test-websocket.py
+++ b/deploy/docker/test-websocket.py
@@ -0,0 +1,34 @@
 #!/usr/bin/env python3
 """
 Quick WebSocket test - Connect to monitor WebSocket and print updates
 """
 import asyncio
 import websockets
 import json
 async def test_websocket():
    uri = "ws://localhost:11235/monitor/ws"
    print(f"Connecting to {uri}...")
    try:
        async with websockets.connect(uri) as websocket:
            print("✅ Connected!")
            # Receive and print 5 updates
            for i in range(5):
                message = await websocket.recv()
                data = json.loads(message)
                print(f"\n📊 Update #{i+1}:")
                print(f"  - Health: CPU {data['health']['container']['cpu_percent']}%, Memory {data['health']['container']['memory_percent']}%")
                print(f"  - Active Requests: {len(data['requests']['active'])}")
                print(f"  - Browsers: {len(data['browsers'])}")
    except Exception as e:
        print(f"❌ Error: {e}")
        return 1
    print("\n✅ WebSocket test passed!")
    return 0
 if __name__ == "__main__":
    exit(asyncio.run(test_websocket()))
--- a/deploy/docker/tests/demo_monitor_dashboard.py
+++ b/deploy/docker/tests/demo_monitor_dashboard.py
@@ -0,0 +1,164 @@
 #!/usr/bin/env python3
 """
 Monitor Dashboard Demo Script
 Generates varied activity to showcase all monitoring features for video recording.
 """
 import httpx
 import asyncio
 import time
 from datetime import datetime
 BASE_URL = "http://localhost:11235"
 async def demo_dashboard():
    print("🎬 Monitor Dashboard Demo - Starting...\n")
    print(f"📊 Dashboard: {BASE_URL}/dashboard")
    print("=" * 60)
    async with httpx.AsyncClient(timeout=60.0) as client:
        # Phase 1: Simple requests (permanent browser)
        print("\n🔷 Phase 1: Testing permanent browser pool")
        print("-" * 60)
        for i in range(5):
            print(f"  {i+1}/5 Request to /crawl (default config)...")
            try:
                r = await client.post(
                    f"{BASE_URL}/crawl",
                    json={"urls": [f"https://httpbin.org/html?req={i}"], "crawler_config": {}}
                )
                print(f"     ✅ Status: {r.status_code}, Time: {r.elapsed.total_seconds():.2f}s")
            except Exception as e:
                print(f"     ❌ Error: {e}")
            await asyncio.sleep(1)  # Small delay between requests
        # Phase 2: Create variant browsers (different configs)
        print("\n🔶 Phase 2: Testing cold→hot pool promotion")
        print("-" * 60)
        viewports = [
            {"width": 1920, "height": 1080},
            {"width": 1280, "height": 720},
            {"width": 800, "height": 600}
        ]
        for idx, viewport in enumerate(viewports):
            print(f"  Viewport {viewport['width']}x{viewport['height']}:")
            for i in range(4):  # 4 requests each to trigger promotion at 3
                try:
                    r = await client.post(
                        f"{BASE_URL}/crawl",
                        json={
                            "urls": [f"https://httpbin.org/json?v={idx}&r={i}"],
                            "browser_config": {"viewport": viewport},
                            "crawler_config": {}
                        }
                    )
                    print(f"    {i+1}/4 ✅ {r.status_code} - Should see cold→hot after 3 uses")
                except Exception as e:
                    print(f"    {i+1}/4 ❌ {e}")
                await asyncio.sleep(0.5)
        # Phase 3: Concurrent burst (stress pool)
        print("\n🔷 Phase 3: Concurrent burst (10 parallel)")
        print("-" * 60)
        tasks = []
        for i in range(10):
            tasks.append(
                client.post(
                    f"{BASE_URL}/crawl",
                    json={"urls": [f"https://httpbin.org/delay/2?burst={i}"], "crawler_config": {}}
                )
            )
        print("  Sending 10 concurrent requests...")
        start = time.time()
        results = await asyncio.gather(*tasks, return_exceptions=True)
        elapsed = time.time() - start
        successes = sum(1 for r in results if not isinstance(r, Exception) and r.status_code == 200)
        print(f"  ✅ {successes}/10 succeeded in {elapsed:.2f}s")
        # Phase 4: Multi-endpoint coverage
        print("\n🔶 Phase 4: Testing multiple endpoints")
        print("-" * 60)
        endpoints = [
            ("/md", {"url": "https://httpbin.org/html", "f": "fit", "c": "0"}),
            ("/screenshot", {"url": "https://httpbin.org/html"}),
            ("/pdf", {"url": "https://httpbin.org/html"}),
        ]
        for endpoint, payload in endpoints:
            print(f"  Testing {endpoint}...")
            try:
                if endpoint == "/md":
                    r = await client.post(f"{BASE_URL}{endpoint}", json=payload)
                else:
                    r = await client.post(f"{BASE_URL}{endpoint}", json=payload)
                print(f"    ✅ {r.status_code}")
            except Exception as e:
                print(f"    ❌ {e}")
            await asyncio.sleep(1)
        # Phase 5: Intentional error (to populate errors tab)
        print("\n🔷 Phase 5: Generating error examples")
        print("-" * 60)
        print("  Triggering invalid URL error...")
        try:
            r = await client.post(
                f"{BASE_URL}/crawl",
                json={"urls": ["invalid://bad-url"], "crawler_config": {}}
            )
            print(f"    Response: {r.status_code}")
        except Exception as e:
            print(f"    ✅ Error captured: {type(e).__name__}")
        # Phase 6: Wait for janitor activity
        print("\n🔶 Phase 6: Waiting for janitor cleanup...")
        print("-" * 60)
        print("  Idle for 40s to allow janitor to clean cold pool browsers...")
        for i in range(40, 0, -10):
            print(f"    {i}s remaining... (Check dashboard for cleanup events)")
            await asyncio.sleep(10)
        # Phase 7: Final stats check
        print("\n🔷 Phase 7: Final dashboard state")
        print("-" * 60)
        r = await client.get(f"{BASE_URL}/monitor/health")
        health = r.json()
        print(f"  Memory: {health['container']['memory_percent']:.1f}%")
        print(f"  Browsers: Perm={health['pool']['permanent']['active']}, "
              f"Hot={health['pool']['hot']['count']}, Cold={health['pool']['cold']['count']}")
        r = await client.get(f"{BASE_URL}/monitor/endpoints/stats")
        stats = r.json()
        print(f"\n  Endpoint Stats:")
        for endpoint, data in stats.items():
            print(f"    {endpoint}: {data['count']} req, "
                  f"{data['avg_latency_ms']:.0f}ms avg, "
                  f"{data['success_rate_percent']:.1f}% success")
        r = await client.get(f"{BASE_URL}/monitor/browsers")
        browsers = r.json()
        print(f"\n  Pool Efficiency:")
        print(f"    Total browsers: {browsers['summary']['total_count']}")
        print(f"    Memory usage: {browsers['summary']['total_memory_mb']} MB")
        print(f"    Reuse rate: {browsers['summary']['reuse_rate_percent']:.1f}%")
    print("\n" + "=" * 60)
    print("✅ Demo complete! Dashboard is now populated with rich data.")
    print(f"\n📹 Recording tip: Refresh {BASE_URL}/dashboard")
    print("   You should see:")
    print("   • Active & completed requests")
    print("   • Browser pool (permanent + hot/cold)")
    print("   • Janitor cleanup events")
    print("   • Endpoint analytics")
    print("   • Memory timeline")
 if __name__ == "__main__":
    try:
        asyncio.run(demo_dashboard())
    except KeyboardInterrupt:
        print("\n\n⚠️  Demo interrupted by user")
    except Exception as e:
        print(f"\n\n❌ Demo failed: {e}")
--- a/deploy/docker/tests/requirements.txt
+++ b/deploy/docker/tests/requirements.txt
@@ -0,0 +1,2 @@
 httpx>=0.25.0
 docker>=7.0.0
--- a/deploy/docker/tests/test_1_basic.py
+++ b/deploy/docker/tests/test_1_basic.py
@@ -0,0 +1,138 @@
 #!/usr/bin/env python3
 """
 Test 1: Basic Container Health + Single Endpoint
 - Starts container
 - Hits /health endpoint 10 times
 - Reports success rate and basic latency
 """
 import asyncio
 import time
 import docker
 import httpx
 # Config
 IMAGE = "crawl4ai-local:latest"
 CONTAINER_NAME = "crawl4ai-test"
 PORT = 11235
 REQUESTS = 10
 async def test_endpoint(url: str, count: int):
    """Hit endpoint multiple times, return stats."""
    results = []
    async with httpx.AsyncClient(timeout=30.0) as client:
        for i in range(count):
            start = time.time()
            try:
                resp = await client.get(url)
                elapsed = (time.time() - start) * 1000  # ms
                results.append({
                    "success": resp.status_code == 200,
                    "latency_ms": elapsed,
                    "status": resp.status_code
                })
                print(f"  [{i+1}/{count}] ✓ {resp.status_code} - {elapsed:.0f}ms")
            except Exception as e:
                results.append({
                    "success": False,
                    "latency_ms": None,
                    "error": str(e)
                })
                print(f"  [{i+1}/{count}] ✗ Error: {e}")
    return results
 def start_container(client, image: str, name: str, port: int):
    """Start container, return container object."""
    # Clean up existing
    try:
        old = client.containers.get(name)
        print(f"🧹 Stopping existing container '{name}'...")
        old.stop()
        old.remove()
    except docker.errors.NotFound:
        pass
    print(f"🚀 Starting container '{name}' from image '{image}'...")
    container = client.containers.run(
        image,
        name=name,
        ports={f"{port}/tcp": port},
        detach=True,
        shm_size="1g",
        environment={"PYTHON_ENV": "production"}
    )
    # Wait for health
    print(f"⏳ Waiting for container to be healthy...")
    for _ in range(30):  # 30s timeout
        time.sleep(1)
        container.reload()
        if container.status == "running":
            try:
                # Quick health check
                import requests
                resp = requests.get(f"http://localhost:{port}/health", timeout=2)
                if resp.status_code == 200:
                    print(f"✅ Container healthy!")
                    return container
            except:
                pass
    raise TimeoutError("Container failed to start")
 def stop_container(container):
    """Stop and remove container."""
    print(f"🛑 Stopping container...")
    container.stop()
    container.remove()
    print(f"✅ Container removed")
 async def main():
    print("="*60)
    print("TEST 1: Basic Container Health + Single Endpoint")
    print("="*60)
    client = docker.from_env()
    container = None
    try:
        # Start container
        container = start_container(client, IMAGE, CONTAINER_NAME, PORT)
        # Test /health endpoint
        print(f"\n📊 Testing /health endpoint ({REQUESTS} requests)...")
        url = f"http://localhost:{PORT}/health"
        results = await test_endpoint(url, REQUESTS)
        # Calculate stats
        successes = sum(1 for r in results if r["success"])
        success_rate = (successes / len(results)) * 100
        latencies = [r["latency_ms"] for r in results if r["latency_ms"] is not None]
        avg_latency = sum(latencies) / len(latencies) if latencies else 0
        # Print results
        print(f"\n{'='*60}")
        print(f"RESULTS:")
        print(f"  Success Rate: {success_rate:.1f}% ({successes}/{len(results)})")
        print(f"  Avg Latency:  {avg_latency:.0f}ms")
        if latencies:
            print(f"  Min Latency:  {min(latencies):.0f}ms")
            print(f"  Max Latency:  {max(latencies):.0f}ms")
        print(f"{'='*60}")
        # Pass/Fail
        if success_rate >= 100:
            print(f"✅ TEST PASSED")
            return 0
        else:
            print(f"❌ TEST FAILED (expected 100% success rate)")
            return 1
    except Exception as e:
        print(f"\n❌ TEST ERROR: {e}")
        return 1
    finally:
        if container:
            stop_container(container)
 if __name__ == "__main__":
    exit_code = asyncio.run(main())
    exit(exit_code)
--- a/deploy/docker/tests/test_2_memory.py
+++ b/deploy/docker/tests/test_2_memory.py
@@ -0,0 +1,205 @@
 #!/usr/bin/env python3
 """
 Test 2: Docker Stats Monitoring
 - Extends Test 1 with real-time container stats
 - Monitors memory % and CPU during requests
 - Reports baseline, peak, and final memory
 """
 import asyncio
 import time
 import docker
 import httpx
 from threading import Thread, Event
 # Config
 IMAGE = "crawl4ai-local:latest"
 CONTAINER_NAME = "crawl4ai-test"
 PORT = 11235
 REQUESTS = 20  # More requests to see memory usage
 # Stats tracking
 stats_history = []
 stop_monitoring = Event()
 def monitor_stats(container):
    """Background thread to collect container stats."""
    for stat in container.stats(decode=True, stream=True):
        if stop_monitoring.is_set():
            break
        try:
            # Extract memory stats
            mem_usage = stat['memory_stats'].get('usage', 0) / (1024 * 1024)  # MB
            mem_limit = stat['memory_stats'].get('limit', 1) / (1024 * 1024)
            mem_percent = (mem_usage / mem_limit * 100) if mem_limit > 0 else 0
            # Extract CPU stats (handle missing fields on Mac)
            cpu_percent = 0
            try:
                cpu_delta = stat['cpu_stats']['cpu_usage']['total_usage'] - \
                           stat['precpu_stats']['cpu_usage']['total_usage']
                system_delta = stat['cpu_stats'].get('system_cpu_usage', 0) - \
                              stat['precpu_stats'].get('system_cpu_usage', 0)
                if system_delta > 0:
                    num_cpus = stat['cpu_stats'].get('online_cpus', 1)
                    cpu_percent = (cpu_delta / system_delta * num_cpus * 100.0)
            except (KeyError, ZeroDivisionError):
                pass
            stats_history.append({
                'timestamp': time.time(),
                'memory_mb': mem_usage,
                'memory_percent': mem_percent,
                'cpu_percent': cpu_percent
            })
        except Exception as e:
            # Skip malformed stats
            pass
        time.sleep(0.5)  # Sample every 500ms
 async def test_endpoint(url: str, count: int):
    """Hit endpoint, return stats."""
    results = []
    async with httpx.AsyncClient(timeout=30.0) as client:
        for i in range(count):
            start = time.time()
            try:
                resp = await client.get(url)
                elapsed = (time.time() - start) * 1000
                results.append({
                    "success": resp.status_code == 200,
                    "latency_ms": elapsed,
                })
                if (i + 1) % 5 == 0:  # Print every 5 requests
                    print(f"  [{i+1}/{count}] ✓ {resp.status_code} - {elapsed:.0f}ms")
            except Exception as e:
                results.append({"success": False, "error": str(e)})
                print(f"  [{i+1}/{count}] ✗ Error: {e}")
    return results
 def start_container(client, image: str, name: str, port: int):
    """Start container."""
    try:
        old = client.containers.get(name)
        print(f"🧹 Stopping existing container '{name}'...")
        old.stop()
        old.remove()
    except docker.errors.NotFound:
        pass
    print(f"🚀 Starting container '{name}'...")
    container = client.containers.run(
        image,
        name=name,
        ports={f"{port}/tcp": port},
        detach=True,
        shm_size="1g",
        mem_limit="4g",  # Set explicit memory limit
    )
    print(f"⏳ Waiting for health...")
    for _ in range(30):
        time.sleep(1)
        container.reload()
        if container.status == "running":
            try:
                import requests
                resp = requests.get(f"http://localhost:{port}/health", timeout=2)
                if resp.status_code == 200:
                    print(f"✅ Container healthy!")
                    return container
            except:
                pass
    raise TimeoutError("Container failed to start")
 def stop_container(container):
    """Stop container."""
    print(f"🛑 Stopping container...")
    container.stop()
    container.remove()
 async def main():
    print("="*60)
    print("TEST 2: Docker Stats Monitoring")
    print("="*60)
    client = docker.from_env()
    container = None
    monitor_thread = None
    try:
        # Start container
        container = start_container(client, IMAGE, CONTAINER_NAME, PORT)
        # Start stats monitoring in background
        print(f"\n📊 Starting stats monitor...")
        stop_monitoring.clear()
        stats_history.clear()
        monitor_thread = Thread(target=monitor_stats, args=(container,), daemon=True)
        monitor_thread.start()
        # Wait a bit for baseline
        await asyncio.sleep(2)
        baseline_mem = stats_history[-1]['memory_mb'] if stats_history else 0
        print(f"📏 Baseline memory: {baseline_mem:.1f} MB")
        # Test /health endpoint
        print(f"\n🔄 Running {REQUESTS} requests to /health...")
        url = f"http://localhost:{PORT}/health"
        results = await test_endpoint(url, REQUESTS)
        # Wait a bit to capture peak
        await asyncio.sleep(1)
        # Stop monitoring
        stop_monitoring.set()
        if monitor_thread:
            monitor_thread.join(timeout=2)
        # Calculate stats
        successes = sum(1 for r in results if r.get("success"))
        success_rate = (successes / len(results)) * 100
        latencies = [r["latency_ms"] for r in results if "latency_ms" in r]
        avg_latency = sum(latencies) / len(latencies) if latencies else 0
        # Memory stats
        memory_samples = [s['memory_mb'] for s in stats_history]
        peak_mem = max(memory_samples) if memory_samples else 0
        final_mem = memory_samples[-1] if memory_samples else 0
        mem_delta = final_mem - baseline_mem
        # Print results
        print(f"\n{'='*60}")
        print(f"RESULTS:")
        print(f"  Success Rate: {success_rate:.1f}% ({successes}/{len(results)})")
        print(f"  Avg Latency:  {avg_latency:.0f}ms")
        print(f"\n  Memory Stats:")
        print(f"    Baseline: {baseline_mem:.1f} MB")
        print(f"    Peak:     {peak_mem:.1f} MB")
        print(f"    Final:    {final_mem:.1f} MB")
        print(f"    Delta:    {mem_delta:+.1f} MB")
        print(f"{'='*60}")
        # Pass/Fail
        if success_rate >= 100 and mem_delta < 100:  # No significant memory growth
            print(f"✅ TEST PASSED")
            return 0
        else:
            if success_rate < 100:
                print(f"❌ TEST FAILED (success rate < 100%)")
            if mem_delta >= 100:
                print(f"⚠️  WARNING: Memory grew by {mem_delta:.1f} MB")
            return 1
    except Exception as e:
        print(f"\n❌ TEST ERROR: {e}")
        return 1
    finally:
        stop_monitoring.set()
        if container:
            stop_container(container)
 if __name__ == "__main__":
    exit_code = asyncio.run(main())
    exit(exit_code)
--- a/deploy/docker/tests/test_3_pool.py
+++ b/deploy/docker/tests/test_3_pool.py
@@ -0,0 +1,229 @@
 #!/usr/bin/env python3
 """
 Test 3: Pool Validation - Permanent Browser Reuse
 - Tests /html endpoint (should use permanent browser)
 - Monitors container logs for pool hit markers
 - Validates browser reuse rate
 - Checks memory after browser creation
 """
 import asyncio
 import time
 import docker
 import httpx
 from threading import Thread, Event
 # Config
 IMAGE = "crawl4ai-local:latest"
 CONTAINER_NAME = "crawl4ai-test"
 PORT = 11235
 REQUESTS = 30
 # Stats tracking
 stats_history = []
 stop_monitoring = Event()
 def monitor_stats(container):
    """Background stats collector."""
    for stat in container.stats(decode=True, stream=True):
        if stop_monitoring.is_set():
            break
        try:
            mem_usage = stat['memory_stats'].get('usage', 0) / (1024 * 1024)
            stats_history.append({
                'timestamp': time.time(),
                'memory_mb': mem_usage,
            })
        except:
            pass
        time.sleep(0.5)
 def count_log_markers(container):
    """Extract pool usage markers from logs."""
    logs = container.logs().decode('utf-8')
    permanent_hits = logs.count("🔥 Using permanent browser")
    hot_hits = logs.count("♨️  Using hot pool browser")
    cold_hits = logs.count("❄️  Using cold pool browser")
    new_created = logs.count("🆕 Creating new browser")
    return {
        'permanent_hits': permanent_hits,
        'hot_hits': hot_hits,
        'cold_hits': cold_hits,
        'new_created': new_created,
        'total_hits': permanent_hits + hot_hits + cold_hits
    }
 async def test_endpoint(url: str, count: int):
    """Hit endpoint multiple times."""
    results = []
    async with httpx.AsyncClient(timeout=60.0) as client:
        for i in range(count):
            start = time.time()
            try:
                resp = await client.post(url, json={"url": "https://httpbin.org/html"})
                elapsed = (time.time() - start) * 1000
                results.append({
                    "success": resp.status_code == 200,
                    "latency_ms": elapsed,
                })
                if (i + 1) % 10 == 0:
                    print(f"  [{i+1}/{count}] ✓ {resp.status_code} - {elapsed:.0f}ms")
            except Exception as e:
                results.append({"success": False, "error": str(e)})
                print(f"  [{i+1}/{count}] ✗ Error: {e}")
    return results
 def start_container(client, image: str, name: str, port: int):
    """Start container."""
    try:
        old = client.containers.get(name)
        print(f"🧹 Stopping existing container...")
        old.stop()
        old.remove()
    except docker.errors.NotFound:
        pass
    print(f"🚀 Starting container...")
    container = client.containers.run(
        image,
        name=name,
        ports={f"{port}/tcp": port},
        detach=True,
        shm_size="1g",
        mem_limit="4g",
    )
    print(f"⏳ Waiting for health...")
    for _ in range(30):
        time.sleep(1)
        container.reload()
        if container.status == "running":
            try:
                import requests
                resp = requests.get(f"http://localhost:{port}/health", timeout=2)
                if resp.status_code == 200:
                    print(f"✅ Container healthy!")
                    return container
            except:
                pass
    raise TimeoutError("Container failed to start")
 def stop_container(container):
    """Stop container."""
    print(f"🛑 Stopping container...")
    container.stop()
    container.remove()
 async def main():
    print("="*60)
    print("TEST 3: Pool Validation - Permanent Browser Reuse")
    print("="*60)
    client = docker.from_env()
    container = None
    monitor_thread = None
    try:
        # Start container
        container = start_container(client, IMAGE, CONTAINER_NAME, PORT)
        # Wait for permanent browser initialization
        print(f"\n⏳ Waiting for permanent browser init (3s)...")
        await asyncio.sleep(3)
        # Start stats monitoring
        print(f"📊 Starting stats monitor...")
        stop_monitoring.clear()
        stats_history.clear()
        monitor_thread = Thread(target=monitor_stats, args=(container,), daemon=True)
        monitor_thread.start()
        await asyncio.sleep(1)
        baseline_mem = stats_history[-1]['memory_mb'] if stats_history else 0
        print(f"📏 Baseline (with permanent browser): {baseline_mem:.1f} MB")
        # Test /html endpoint (uses permanent browser for default config)
        print(f"\n🔄 Running {REQUESTS} requests to /html...")
        url = f"http://localhost:{PORT}/html"
        results = await test_endpoint(url, REQUESTS)
        # Wait a bit
        await asyncio.sleep(1)
        # Stop monitoring
        stop_monitoring.set()
        if monitor_thread:
            monitor_thread.join(timeout=2)
        # Analyze logs for pool markers
        print(f"\n📋 Analyzing pool usage...")
        pool_stats = count_log_markers(container)
        # Calculate request stats
        successes = sum(1 for r in results if r.get("success"))
        success_rate = (successes / len(results)) * 100
        latencies = [r["latency_ms"] for r in results if "latency_ms" in r]
        avg_latency = sum(latencies) / len(latencies) if latencies else 0
        # Memory stats
        memory_samples = [s['memory_mb'] for s in stats_history]
        peak_mem = max(memory_samples) if memory_samples else 0
        final_mem = memory_samples[-1] if memory_samples else 0
        mem_delta = final_mem - baseline_mem
        # Calculate reuse rate
        total_requests = len(results)
        total_pool_hits = pool_stats['total_hits']
        reuse_rate = (total_pool_hits / total_requests * 100) if total_requests > 0 else 0
        # Print results
        print(f"\n{'='*60}")
        print(f"RESULTS:")
        print(f"  Success Rate: {success_rate:.1f}% ({successes}/{len(results)})")
        print(f"  Avg Latency:  {avg_latency:.0f}ms")
        print(f"\n  Pool Stats:")
        print(f"    🔥 Permanent Hits: {pool_stats['permanent_hits']}")
        print(f"    ♨️  Hot Pool Hits:   {pool_stats['hot_hits']}")
        print(f"    ❄️  Cold Pool Hits:  {pool_stats['cold_hits']}")
        print(f"    🆕 New Created:    {pool_stats['new_created']}")
        print(f"    📊 Reuse Rate:     {reuse_rate:.1f}%")
        print(f"\n  Memory Stats:")
        print(f"    Baseline: {baseline_mem:.1f} MB")
        print(f"    Peak:     {peak_mem:.1f} MB")
        print(f"    Final:    {final_mem:.1f} MB")
        print(f"    Delta:    {mem_delta:+.1f} MB")
        print(f"{'='*60}")
        # Pass/Fail
        passed = True
        if success_rate < 100:
            print(f"❌ FAIL: Success rate {success_rate:.1f}% < 100%")
            passed = False
        if reuse_rate < 80:
            print(f"❌ FAIL: Reuse rate {reuse_rate:.1f}% < 80% (expected high permanent browser usage)")
            passed = False
        if pool_stats['permanent_hits'] < (total_requests * 0.8):
            print(f"⚠️  WARNING: Only {pool_stats['permanent_hits']} permanent hits out of {total_requests} requests")
        if mem_delta > 200:
            print(f"⚠️  WARNING: Memory grew by {mem_delta:.1f} MB (possible browser leak)")
        if passed:
            print(f"✅ TEST PASSED")
            return 0
        else:
            return 1
    except Exception as e:
        print(f"\n❌ TEST ERROR: {e}")
        import traceback
        traceback.print_exc()
        return 1
    finally:
        stop_monitoring.set()
        if container:
            stop_container(container)
 if __name__ == "__main__":
    exit_code = asyncio.run(main())
    exit(exit_code)
--- a/deploy/docker/tests/test_4_concurrent.py
+++ b/deploy/docker/tests/test_4_concurrent.py
@@ -0,0 +1,236 @@
 #!/usr/bin/env python3
 """
 Test 4: Concurrent Load Testing
 - Tests pool under concurrent load
 - Escalates: 10 → 50 → 100 concurrent requests
 - Validates latency distribution (P50, P95, P99)
 - Monitors memory stability
 """
 import asyncio
 import time
 import docker
 import httpx
 from threading import Thread, Event
 from collections import defaultdict
 # Config
 IMAGE = "crawl4ai-local:latest"
 CONTAINER_NAME = "crawl4ai-test"
 PORT = 11235
 LOAD_LEVELS = [
    {"name": "Light", "concurrent": 10, "requests": 20},
    {"name": "Medium", "concurrent": 50, "requests": 100},
    {"name": "Heavy", "concurrent": 100, "requests": 200},
 ]
 # Stats
 stats_history = []
 stop_monitoring = Event()
 def monitor_stats(container):
    """Background stats collector."""
    for stat in container.stats(decode=True, stream=True):
        if stop_monitoring.is_set():
            break
        try:
            mem_usage = stat['memory_stats'].get('usage', 0) / (1024 * 1024)
            stats_history.append({'timestamp': time.time(), 'memory_mb': mem_usage})
        except:
            pass
        time.sleep(0.5)
 def count_log_markers(container):
    """Extract pool markers."""
    logs = container.logs().decode('utf-8')
    return {
        'permanent': logs.count("🔥 Using permanent browser"),
        'hot': logs.count("♨️  Using hot pool browser"),
        'cold': logs.count("❄️  Using cold pool browser"),
        'new': logs.count("🆕 Creating new browser"),
    }
 async def hit_endpoint(client, url, payload, semaphore):
    """Single request with concurrency control."""
    async with semaphore:
        start = time.time()
        try:
            resp = await client.post(url, json=payload, timeout=60.0)
            elapsed = (time.time() - start) * 1000
            return {"success": resp.status_code == 200, "latency_ms": elapsed}
        except Exception as e:
            return {"success": False, "error": str(e)}
 async def run_concurrent_test(url, payload, concurrent, total_requests):
    """Run concurrent requests."""
    semaphore = asyncio.Semaphore(concurrent)
    async with httpx.AsyncClient() as client:
        tasks = [hit_endpoint(client, url, payload, semaphore) for _ in range(total_requests)]
        results = await asyncio.gather(*tasks)
    return results
 def calculate_percentiles(latencies):
    """Calculate P50, P95, P99."""
    if not latencies:
        return 0, 0, 0
    sorted_lat = sorted(latencies)
    n = len(sorted_lat)
    return (
        sorted_lat[int(n * 0.50)],
        sorted_lat[int(n * 0.95)],
        sorted_lat[int(n * 0.99)],
    )
 def start_container(client, image, name, port):
    """Start container."""
    try:
        old = client.containers.get(name)
        print(f"🧹 Stopping existing container...")
        old.stop()
        old.remove()
    except docker.errors.NotFound:
        pass
    print(f"🚀 Starting container...")
    container = client.containers.run(
        image, name=name, ports={f"{port}/tcp": port},
        detach=True, shm_size="1g", mem_limit="4g",
    )
    print(f"⏳ Waiting for health...")
    for _ in range(30):
        time.sleep(1)
        container.reload()
        if container.status == "running":
            try:
                import requests
                if requests.get(f"http://localhost:{port}/health", timeout=2).status_code == 200:
                    print(f"✅ Container healthy!")
                    return container
            except:
                pass
    raise TimeoutError("Container failed to start")
 async def main():
    print("="*60)
    print("TEST 4: Concurrent Load Testing")
    print("="*60)
    client = docker.from_env()
    container = None
    monitor_thread = None
    try:
        container = start_container(client, IMAGE, CONTAINER_NAME, PORT)
        print(f"\n⏳ Waiting for permanent browser init (3s)...")
        await asyncio.sleep(3)
        # Start monitoring
        stop_monitoring.clear()
        stats_history.clear()
        monitor_thread = Thread(target=monitor_stats, args=(container,), daemon=True)
        monitor_thread.start()
        await asyncio.sleep(1)
        baseline_mem = stats_history[-1]['memory_mb'] if stats_history else 0
        print(f"📏 Baseline: {baseline_mem:.1f} MB\n")
        url = f"http://localhost:{PORT}/html"
        payload = {"url": "https://httpbin.org/html"}
        all_results = []
        level_stats = []
        # Run load levels
        for level in LOAD_LEVELS:
            print(f"{'='*60}")
            print(f"🔄 {level['name']} Load: {level['concurrent']} concurrent, {level['requests']} total")
            print(f"{'='*60}")
            start_time = time.time()
            results = await run_concurrent_test(url, payload, level['concurrent'], level['requests'])
            duration = time.time() - start_time
            successes = sum(1 for r in results if r.get("success"))
            success_rate = (successes / len(results)) * 100
            latencies = [r["latency_ms"] for r in results if "latency_ms" in r]
            p50, p95, p99 = calculate_percentiles(latencies)
            avg_lat = sum(latencies) / len(latencies) if latencies else 0
            print(f"  Duration:     {duration:.1f}s")
            print(f"  Success:      {success_rate:.1f}% ({successes}/{len(results)})")
            print(f"  Avg Latency:  {avg_lat:.0f}ms")
            print(f"  P50/P95/P99:  {p50:.0f}ms / {p95:.0f}ms / {p99:.0f}ms")
            level_stats.append({
                'name': level['name'],
                'concurrent': level['concurrent'],
                'success_rate': success_rate,
                'avg_latency': avg_lat,
                'p50': p50, 'p95': p95, 'p99': p99,
            })
            all_results.extend(results)
            await asyncio.sleep(2)  # Cool down between levels
        # Stop monitoring
        await asyncio.sleep(1)
        stop_monitoring.set()
        if monitor_thread:
            monitor_thread.join(timeout=2)
        # Final stats
        pool_stats = count_log_markers(container)
        memory_samples = [s['memory_mb'] for s in stats_history]
        peak_mem = max(memory_samples) if memory_samples else 0
        final_mem = memory_samples[-1] if memory_samples else 0
        print(f"\n{'='*60}")
        print(f"FINAL RESULTS:")
        print(f"{'='*60}")
        print(f"  Total Requests: {len(all_results)}")
        print(f"\n  Pool Utilization:")
        print(f"    🔥 Permanent: {pool_stats['permanent']}")
        print(f"    ♨️  Hot:       {pool_stats['hot']}")
        print(f"    ❄️  Cold:      {pool_stats['cold']}")
        print(f"    🆕 New:       {pool_stats['new']}")
        print(f"\n  Memory:")
        print(f"    Baseline: {baseline_mem:.1f} MB")
        print(f"    Peak:     {peak_mem:.1f} MB")
        print(f"    Final:    {final_mem:.1f} MB")
        print(f"    Delta:    {final_mem - baseline_mem:+.1f} MB")
        print(f"{'='*60}")
        # Pass/Fail
        passed = True
        for ls in level_stats:
            if ls['success_rate'] < 99:
                print(f"❌ FAIL: {ls['name']} success rate {ls['success_rate']:.1f}% < 99%")
                passed = False
            if ls['p99'] > 10000:  # 10s threshold
                print(f"⚠️  WARNING: {ls['name']} P99 latency {ls['p99']:.0f}ms very high")
        if final_mem - baseline_mem > 300:
            print(f"⚠️  WARNING: Memory grew {final_mem - baseline_mem:.1f} MB")
        if passed:
            print(f"✅ TEST PASSED")
            return 0
        else:
            return 1
    except Exception as e:
        print(f"\n❌ TEST ERROR: {e}")
        import traceback
        traceback.print_exc()
        return 1
    finally:
        stop_monitoring.set()
        if container:
            print(f"🛑 Stopping container...")
            container.stop()
            container.remove()
 if __name__ == "__main__":
    exit_code = asyncio.run(main())
    exit(exit_code)
--- a/deploy/docker/tests/test_5_pool_stress.py
+++ b/deploy/docker/tests/test_5_pool_stress.py
@@ -0,0 +1,267 @@
 #!/usr/bin/env python3
 """
 Test 5: Pool Stress - Mixed Configs
 - Tests hot/cold pool with different browser configs
 - Uses different viewports to create config variants
 - Validates cold → hot promotion after 3 uses
 - Monitors pool tier distribution
 """
 import asyncio
 import time
 import docker
 import httpx
 from threading import Thread, Event
 import random
 # Config
 IMAGE = "crawl4ai-local:latest"
 CONTAINER_NAME = "crawl4ai-test"
 PORT = 11235
 REQUESTS_PER_CONFIG = 5  # 5 requests per config variant
 # Different viewport configs to test pool tiers
 VIEWPORT_CONFIGS = [
    None,  # Default (permanent browser)
    {"width": 1920, "height": 1080},  # Desktop
    {"width": 1024, "height": 768},   # Tablet
    {"width": 375, "height": 667},    # Mobile
 ]
 # Stats
 stats_history = []
 stop_monitoring = Event()
 def monitor_stats(container):
    """Background stats collector."""
    for stat in container.stats(decode=True, stream=True):
        if stop_monitoring.is_set():
            break
        try:
            mem_usage = stat['memory_stats'].get('usage', 0) / (1024 * 1024)
            stats_history.append({'timestamp': time.time(), 'memory_mb': mem_usage})
        except:
            pass
        time.sleep(0.5)
 def analyze_pool_logs(container):
    """Extract detailed pool stats from logs."""
    logs = container.logs().decode('utf-8')
    permanent = logs.count("🔥 Using permanent browser")
    hot = logs.count("♨️  Using hot pool browser")
    cold = logs.count("❄️  Using cold pool browser")
    new = logs.count("🆕 Creating new browser")
    promotions = logs.count("⬆️  Promoting to hot pool")
    return {
        'permanent': permanent,
        'hot': hot,
        'cold': cold,
        'new': new,
        'promotions': promotions,
        'total': permanent + hot + cold
    }
 async def crawl_with_viewport(client, url, viewport):
    """Single request with specific viewport."""
    payload = {
        "urls": ["https://httpbin.org/html"],
        "browser_config": {},
        "crawler_config": {}
    }
    # Add viewport if specified
    if viewport:
        payload["browser_config"] = {
            "type": "BrowserConfig",
            "params": {
                "viewport": {"type": "dict", "value": viewport},
                "headless": True,
                "text_mode": True,
                "extra_args": [
                    "--no-sandbox",
                    "--disable-dev-shm-usage",
                    "--disable-gpu",
                    "--disable-software-rasterizer",
                    "--disable-web-security",
                    "--allow-insecure-localhost",
                    "--ignore-certificate-errors"
                ]
            }
        }
    start = time.time()
    try:
        resp = await client.post(url, json=payload, timeout=60.0)
        elapsed = (time.time() - start) * 1000
        return {"success": resp.status_code == 200, "latency_ms": elapsed, "viewport": viewport}
    except Exception as e:
        return {"success": False, "error": str(e), "viewport": viewport}
 def start_container(client, image, name, port):
    """Start container."""
    try:
        old = client.containers.get(name)
        print(f"🧹 Stopping existing container...")
        old.stop()
        old.remove()
    except docker.errors.NotFound:
        pass
    print(f"🚀 Starting container...")
    container = client.containers.run(
        image, name=name, ports={f"{port}/tcp": port},
        detach=True, shm_size="1g", mem_limit="4g",
    )
    print(f"⏳ Waiting for health...")
    for _ in range(30):
        time.sleep(1)
        container.reload()
        if container.status == "running":
            try:
                import requests
                if requests.get(f"http://localhost:{port}/health", timeout=2).status_code == 200:
                    print(f"✅ Container healthy!")
                    return container
            except:
                pass
    raise TimeoutError("Container failed to start")
 async def main():
    print("="*60)
    print("TEST 5: Pool Stress - Mixed Configs")
    print("="*60)
    client = docker.from_env()
    container = None
    monitor_thread = None
    try:
        container = start_container(client, IMAGE, CONTAINER_NAME, PORT)
        print(f"\n⏳ Waiting for permanent browser init (3s)...")
        await asyncio.sleep(3)
        # Start monitoring
        stop_monitoring.clear()
        stats_history.clear()
        monitor_thread = Thread(target=monitor_stats, args=(container,), daemon=True)
        monitor_thread.start()
        await asyncio.sleep(1)
        baseline_mem = stats_history[-1]['memory_mb'] if stats_history else 0
        print(f"📏 Baseline: {baseline_mem:.1f} MB\n")
        url = f"http://localhost:{PORT}/crawl"
        print(f"Testing {len(VIEWPORT_CONFIGS)} different configs:")
        for i, vp in enumerate(VIEWPORT_CONFIGS):
            vp_str = "Default" if vp is None else f"{vp['width']}x{vp['height']}"
            print(f"  {i+1}. {vp_str}")
        print()
        # Run requests: repeat each config REQUESTS_PER_CONFIG times
        all_results = []
        config_sequence = []
        for _ in range(REQUESTS_PER_CONFIG):
            for viewport in VIEWPORT_CONFIGS:
                config_sequence.append(viewport)
        # Shuffle to mix configs
        random.shuffle(config_sequence)
        print(f"🔄 Running {len(config_sequence)} requests with mixed configs...")
        async with httpx.AsyncClient() as http_client:
            for i, viewport in enumerate(config_sequence):
                result = await crawl_with_viewport(http_client, url, viewport)
                all_results.append(result)
                if (i + 1) % 5 == 0:
                    vp_str = "default" if result['viewport'] is None else f"{result['viewport']['width']}x{result['viewport']['height']}"
                    status = "✓" if result.get('success') else "✗"
                    lat = f"{result.get('latency_ms', 0):.0f}ms" if 'latency_ms' in result else "error"
                    print(f"  [{i+1}/{len(config_sequence)}] {status} {vp_str} - {lat}")
        # Stop monitoring
        await asyncio.sleep(2)
        stop_monitoring.set()
        if monitor_thread:
            monitor_thread.join(timeout=2)
        # Analyze results
        pool_stats = analyze_pool_logs(container)
        successes = sum(1 for r in all_results if r.get("success"))
        success_rate = (successes / len(all_results)) * 100
        latencies = [r["latency_ms"] for r in all_results if "latency_ms" in r]
        avg_lat = sum(latencies) / len(latencies) if latencies else 0
        memory_samples = [s['memory_mb'] for s in stats_history]
        peak_mem = max(memory_samples) if memory_samples else 0
        final_mem = memory_samples[-1] if memory_samples else 0
        print(f"\n{'='*60}")
        print(f"RESULTS:")
        print(f"{'='*60}")
        print(f"  Requests:     {len(all_results)}")
        print(f"  Success Rate: {success_rate:.1f}% ({successes}/{len(all_results)})")
        print(f"  Avg Latency:  {avg_lat:.0f}ms")
        print(f"\n  Pool Statistics:")
        print(f"    🔥 Permanent: {pool_stats['permanent']}")
        print(f"    ♨️  Hot:       {pool_stats['hot']}")
        print(f"    ❄️  Cold:      {pool_stats['cold']}")
        print(f"    🆕 New:       {pool_stats['new']}")
        print(f"    ⬆️  Promotions: {pool_stats['promotions']}")
        print(f"    📊 Reuse:     {(pool_stats['total'] / len(all_results) * 100):.1f}%")
        print(f"\n  Memory:")
        print(f"    Baseline: {baseline_mem:.1f} MB")
        print(f"    Peak:     {peak_mem:.1f} MB")
        print(f"    Final:    {final_mem:.1f} MB")
        print(f"    Delta:    {final_mem - baseline_mem:+.1f} MB")
        print(f"{'='*60}")
        # Pass/Fail
        passed = True
        if success_rate < 99:
            print(f"❌ FAIL: Success rate {success_rate:.1f}% < 99%")
            passed = False
        # Should see promotions since we repeat each config 5 times
        if pool_stats['promotions'] < (len(VIEWPORT_CONFIGS) - 1):  # -1 for default
            print(f"⚠️  WARNING: Only {pool_stats['promotions']} promotions (expected ~{len(VIEWPORT_CONFIGS)-1})")
        # Should have created some browsers for different configs
        if pool_stats['new'] == 0:
            print(f"⚠️  NOTE: No new browsers created (all used default?)")
        if pool_stats['permanent'] == len(all_results):
            print(f"⚠️  NOTE: All requests used permanent browser (configs not varying enough?)")
        if final_mem - baseline_mem > 500:
            print(f"⚠️  WARNING: Memory grew {final_mem - baseline_mem:.1f} MB")
        if passed:
            print(f"✅ TEST PASSED")
            return 0
        else:
            return 1
    except Exception as e:
        print(f"\n❌ TEST ERROR: {e}")
        import traceback
        traceback.print_exc()
        return 1
    finally:
        stop_monitoring.set()
        if container:
            print(f"🛑 Stopping container...")
            container.stop()
            container.remove()
 if __name__ == "__main__":
    exit_code = asyncio.run(main())
    exit(exit_code)
--- a/deploy/docker/tests/test_6_multi_endpoint.py
+++ b/deploy/docker/tests/test_6_multi_endpoint.py
@@ -0,0 +1,234 @@
 #!/usr/bin/env python3
 """
 Test 6: Multi-Endpoint Testing
 - Tests multiple endpoints together: /html, /screenshot, /pdf, /crawl
 - Validates each endpoint works correctly
 - Monitors success rates per endpoint
 """
 import asyncio
 import time
 import docker
 import httpx
 from threading import Thread, Event
 # Config
 IMAGE = "crawl4ai-local:latest"
 CONTAINER_NAME = "crawl4ai-test"
 PORT = 11235
 REQUESTS_PER_ENDPOINT = 10
 # Stats
 stats_history = []
 stop_monitoring = Event()
 def monitor_stats(container):
    """Background stats collector."""
    for stat in container.stats(decode=True, stream=True):
        if stop_monitoring.is_set():
            break
        try:
            mem_usage = stat['memory_stats'].get('usage', 0) / (1024 * 1024)
            stats_history.append({'timestamp': time.time(), 'memory_mb': mem_usage})
        except:
            pass
        time.sleep(0.5)
 async def test_html(client, base_url, count):
    """Test /html endpoint."""
    url = f"{base_url}/html"
    results = []
    for _ in range(count):
        start = time.time()
        try:
            resp = await client.post(url, json={"url": "https://httpbin.org/html"}, timeout=30.0)
            elapsed = (time.time() - start) * 1000
            results.append({"success": resp.status_code == 200, "latency_ms": elapsed})
        except Exception as e:
            results.append({"success": False, "error": str(e)})
    return results
 async def test_screenshot(client, base_url, count):
    """Test /screenshot endpoint."""
    url = f"{base_url}/screenshot"
    results = []
    for _ in range(count):
        start = time.time()
        try:
            resp = await client.post(url, json={"url": "https://httpbin.org/html"}, timeout=30.0)
            elapsed = (time.time() - start) * 1000
            results.append({"success": resp.status_code == 200, "latency_ms": elapsed})
        except Exception as e:
            results.append({"success": False, "error": str(e)})
    return results
 async def test_pdf(client, base_url, count):
    """Test /pdf endpoint."""
    url = f"{base_url}/pdf"
    results = []
    for _ in range(count):
        start = time.time()
        try:
            resp = await client.post(url, json={"url": "https://httpbin.org/html"}, timeout=30.0)
            elapsed = (time.time() - start) * 1000
            results.append({"success": resp.status_code == 200, "latency_ms": elapsed})
        except Exception as e:
            results.append({"success": False, "error": str(e)})
    return results
 async def test_crawl(client, base_url, count):
    """Test /crawl endpoint."""
    url = f"{base_url}/crawl"
    results = []
    payload = {
        "urls": ["https://httpbin.org/html"],
        "browser_config": {},
        "crawler_config": {}
    }
    for _ in range(count):
        start = time.time()
        try:
            resp = await client.post(url, json=payload, timeout=30.0)
            elapsed = (time.time() - start) * 1000
            results.append({"success": resp.status_code == 200, "latency_ms": elapsed})
        except Exception as e:
            results.append({"success": False, "error": str(e)})
    return results
 def start_container(client, image, name, port):
    """Start container."""
    try:
        old = client.containers.get(name)
        print(f"🧹 Stopping existing container...")
        old.stop()
        old.remove()
    except docker.errors.NotFound:
        pass
    print(f"🚀 Starting container...")
    container = client.containers.run(
        image, name=name, ports={f"{port}/tcp": port},
        detach=True, shm_size="1g", mem_limit="4g",
    )
    print(f"⏳ Waiting for health...")
    for _ in range(30):
        time.sleep(1)
        container.reload()
        if container.status == "running":
            try:
                import requests
                if requests.get(f"http://localhost:{port}/health", timeout=2).status_code == 200:
                    print(f"✅ Container healthy!")
                    return container
            except:
                pass
    raise TimeoutError("Container failed to start")
 async def main():
    print("="*60)
    print("TEST 6: Multi-Endpoint Testing")
    print("="*60)
    client = docker.from_env()
    container = None
    monitor_thread = None
    try:
        container = start_container(client, IMAGE, CONTAINER_NAME, PORT)
        print(f"\n⏳ Waiting for permanent browser init (3s)...")
        await asyncio.sleep(3)
        # Start monitoring
        stop_monitoring.clear()
        stats_history.clear()
        monitor_thread = Thread(target=monitor_stats, args=(container,), daemon=True)
        monitor_thread.start()
        await asyncio.sleep(1)
        baseline_mem = stats_history[-1]['memory_mb'] if stats_history else 0
        print(f"📏 Baseline: {baseline_mem:.1f} MB\n")
        base_url = f"http://localhost:{PORT}"
        # Test each endpoint
        endpoints = {
            "/html": test_html,
            "/screenshot": test_screenshot,
            "/pdf": test_pdf,
            "/crawl": test_crawl,
        }
        all_endpoint_stats = {}
        async with httpx.AsyncClient() as http_client:
            for endpoint_name, test_func in endpoints.items():
                print(f"🔄 Testing {endpoint_name} ({REQUESTS_PER_ENDPOINT} requests)...")
                results = await test_func(http_client, base_url, REQUESTS_PER_ENDPOINT)
                successes = sum(1 for r in results if r.get("success"))
                success_rate = (successes / len(results)) * 100
                latencies = [r["latency_ms"] for r in results if "latency_ms" in r]
                avg_lat = sum(latencies) / len(latencies) if latencies else 0
                all_endpoint_stats[endpoint_name] = {
                    'success_rate': success_rate,
                    'avg_latency': avg_lat,
                    'total': len(results),
                    'successes': successes
                }
                print(f"  ✓ Success: {success_rate:.1f}% ({successes}/{len(results)}), Avg: {avg_lat:.0f}ms")
        # Stop monitoring
        await asyncio.sleep(1)
        stop_monitoring.set()
        if monitor_thread:
            monitor_thread.join(timeout=2)
        # Final stats
        memory_samples = [s['memory_mb'] for s in stats_history]
        peak_mem = max(memory_samples) if memory_samples else 0
        final_mem = memory_samples[-1] if memory_samples else 0
        print(f"\n{'='*60}")
        print(f"RESULTS:")
        print(f"{'='*60}")
        for endpoint, stats in all_endpoint_stats.items():
            print(f"  {endpoint:12} Success: {stats['success_rate']:5.1f}%  Avg: {stats['avg_latency']:6.0f}ms")
        print(f"\n  Memory:")
        print(f"    Baseline: {baseline_mem:.1f} MB")
        print(f"    Peak:     {peak_mem:.1f} MB")
        print(f"    Final:    {final_mem:.1f} MB")
        print(f"    Delta:    {final_mem - baseline_mem:+.1f} MB")
        print(f"{'='*60}")
        # Pass/Fail
        passed = True
        for endpoint, stats in all_endpoint_stats.items():
            if stats['success_rate'] < 100:
                print(f"❌ FAIL: {endpoint} success rate {stats['success_rate']:.1f}% < 100%")
                passed = False
        if passed:
            print(f"✅ TEST PASSED")
            return 0
        else:
            return 1
    except Exception as e:
        print(f"\n❌ TEST ERROR: {e}")
        import traceback
        traceback.print_exc()
        return 1
    finally:
        stop_monitoring.set()
        if container:
            print(f"🛑 Stopping container...")
            container.stop()
            container.remove()
 if __name__ == "__main__":
    exit_code = asyncio.run(main())
    exit(exit_code)
--- a/deploy/docker/tests/test_7_cleanup.py
+++ b/deploy/docker/tests/test_7_cleanup.py
@@ -0,0 +1,199 @@
 #!/usr/bin/env python3
 """
 Test 7: Cleanup Verification (Janitor)
 - Creates load spike then goes idle
 - Verifies memory returns to near baseline
 - Tests janitor cleanup of idle browsers
 - Monitors memory recovery time
 """
 import asyncio
 import time
 import docker
 import httpx
 from threading import Thread, Event
 # Config
 IMAGE = "crawl4ai-local:latest"
 CONTAINER_NAME = "crawl4ai-test"
 PORT = 11235
 SPIKE_REQUESTS = 20  # Create some browsers
 IDLE_TIME = 90  # Wait 90s for janitor (runs every 60s)
 # Stats
 stats_history = []
 stop_monitoring = Event()
 def monitor_stats(container):
    """Background stats collector."""
    for stat in container.stats(decode=True, stream=True):
        if stop_monitoring.is_set():
            break
        try:
            mem_usage = stat['memory_stats'].get('usage', 0) / (1024 * 1024)
            stats_history.append({'timestamp': time.time(), 'memory_mb': mem_usage})
        except:
            pass
        time.sleep(1)  # Sample every 1s for this test
 def start_container(client, image, name, port):
    """Start container."""
    try:
        old = client.containers.get(name)
        print(f"🧹 Stopping existing container...")
        old.stop()
        old.remove()
    except docker.errors.NotFound:
        pass
    print(f"🚀 Starting container...")
    container = client.containers.run(
        image, name=name, ports={f"{port}/tcp": port},
        detach=True, shm_size="1g", mem_limit="4g",
    )
    print(f"⏳ Waiting for health...")
    for _ in range(30):
        time.sleep(1)
        container.reload()
        if container.status == "running":
            try:
                import requests
                if requests.get(f"http://localhost:{port}/health", timeout=2).status_code == 200:
                    print(f"✅ Container healthy!")
                    return container
            except:
                pass
    raise TimeoutError("Container failed to start")
 async def main():
    print("="*60)
    print("TEST 7: Cleanup Verification (Janitor)")
    print("="*60)
    client = docker.from_env()
    container = None
    monitor_thread = None
    try:
        container = start_container(client, IMAGE, CONTAINER_NAME, PORT)
        print(f"\n⏳ Waiting for permanent browser init (3s)...")
        await asyncio.sleep(3)
        # Start monitoring
        stop_monitoring.clear()
        stats_history.clear()
        monitor_thread = Thread(target=monitor_stats, args=(container,), daemon=True)
        monitor_thread.start()
        await asyncio.sleep(2)
        baseline_mem = stats_history[-1]['memory_mb'] if stats_history else 0
        print(f"📏 Baseline: {baseline_mem:.1f} MB\n")
        # Create load spike with different configs to populate pool
        print(f"🔥 Creating load spike ({SPIKE_REQUESTS} requests with varied configs)...")
        url = f"http://localhost:{PORT}/crawl"
        viewports = [
            {"width": 1920, "height": 1080},
            {"width": 1024, "height": 768},
            {"width": 375, "height": 667},
        ]
        async with httpx.AsyncClient(timeout=60.0) as http_client:
            tasks = []
            for i in range(SPIKE_REQUESTS):
                vp = viewports[i % len(viewports)]
                payload = {
                    "urls": ["https://httpbin.org/html"],
                    "browser_config": {
                        "type": "BrowserConfig",
                        "params": {
                            "viewport": {"type": "dict", "value": vp},
                            "headless": True,
                            "text_mode": True,
                            "extra_args": [
                                "--no-sandbox", "--disable-dev-shm-usage",
                                "--disable-gpu", "--disable-software-rasterizer",
                                "--disable-web-security", "--allow-insecure-localhost",
                                "--ignore-certificate-errors"
                            ]
                        }
                    },
                    "crawler_config": {}
                }
                tasks.append(http_client.post(url, json=payload))
            results = await asyncio.gather(*tasks, return_exceptions=True)
            successes = sum(1 for r in results if hasattr(r, 'status_code') and r.status_code == 200)
            print(f"  ✓ Spike completed: {successes}/{len(results)} successful")
        # Measure peak
        await asyncio.sleep(2)
        peak_mem = max([s['memory_mb'] for s in stats_history]) if stats_history else baseline_mem
        print(f"  📊 Peak memory: {peak_mem:.1f} MB (+{peak_mem - baseline_mem:.1f} MB)")
        # Now go idle and wait for janitor
        print(f"\n⏸️  Going idle for {IDLE_TIME}s (janitor cleanup)...")
        print(f"  (Janitor runs every 60s, checking for idle browsers)")
        for elapsed in range(0, IDLE_TIME, 10):
            await asyncio.sleep(10)
            current_mem = stats_history[-1]['memory_mb'] if stats_history else 0
            print(f"  [{elapsed+10:3d}s] Memory: {current_mem:.1f} MB")
        # Stop monitoring
        stop_monitoring.set()
        if monitor_thread:
            monitor_thread.join(timeout=2)
        # Analyze memory recovery
        final_mem = stats_history[-1]['memory_mb'] if stats_history else 0
        recovery_mb = peak_mem - final_mem
        recovery_pct = (recovery_mb / (peak_mem - baseline_mem) * 100) if (peak_mem - baseline_mem) > 0 else 0
        print(f"\n{'='*60}")
        print(f"RESULTS:")
        print(f"{'='*60}")
        print(f"  Memory Journey:")
        print(f"    Baseline:  {baseline_mem:.1f} MB")
        print(f"    Peak:      {peak_mem:.1f} MB  (+{peak_mem - baseline_mem:.1f} MB)")
        print(f"    Final:     {final_mem:.1f} MB  (+{final_mem - baseline_mem:.1f} MB)")
        print(f"    Recovered: {recovery_mb:.1f} MB  ({recovery_pct:.1f}%)")
        print(f"{'='*60}")
        # Pass/Fail
        passed = True
        # Should have created some memory pressure
        if peak_mem - baseline_mem < 100:
            print(f"⚠️  WARNING: Peak increase only {peak_mem - baseline_mem:.1f} MB (expected more browsers)")
        # Should recover most memory (within 100MB of baseline)
        if final_mem - baseline_mem > 100:
            print(f"⚠️  WARNING: Memory didn't recover well (still +{final_mem - baseline_mem:.1f} MB above baseline)")
        else:
            print(f"✅ Good memory recovery!")
        # Baseline + 50MB tolerance
        if final_mem - baseline_mem < 50:
            print(f"✅ Excellent cleanup (within 50MB of baseline)")
        print(f"✅ TEST PASSED")
        return 0
    except Exception as e:
        print(f"\n❌ TEST ERROR: {e}")
        import traceback
        traceback.print_exc()
        return 1
    finally:
        stop_monitoring.set()
        if container:
            print(f"🛑 Stopping container...")
            container.stop()
            container.remove()
 if __name__ == "__main__":
    exit_code = asyncio.run(main())
    exit(exit_code)
--- a/deploy/docker/tests/test_monitor_demo.py
+++ b/deploy/docker/tests/test_monitor_demo.py
@@ -0,0 +1,57 @@
 #!/usr/bin/env python3
 """Quick test to generate monitor dashboard activity"""
 import httpx
 import asyncio
 async def test_dashboard():
    async with httpx.AsyncClient(timeout=30.0) as client:
        print("📊 Generating dashboard activity...")
        # Test 1: Simple crawl
        print("\n1️⃣ Running simple crawl...")
        r1 = await client.post(
            "http://localhost:11235/crawl",
            json={"urls": ["https://httpbin.org/html"], "crawler_config": {}}
        )
        print(f"   Status: {r1.status_code}")
        # Test 2: Multiple URLs
        print("\n2️⃣ Running multi-URL crawl...")
        r2 = await client.post(
            "http://localhost:11235/crawl",
            json={
                "urls": [
                    "https://httpbin.org/html",
                    "https://httpbin.org/json"
                ],
                "crawler_config": {}
            }
        )
        print(f"   Status: {r2.status_code}")
        # Test 3: Check monitor health
        print("\n3️⃣ Checking monitor health...")
        r3 = await client.get("http://localhost:11235/monitor/health")
        health = r3.json()
        print(f"   Memory: {health['container']['memory_percent']}%")
        print(f"   Browsers: {health['pool']['permanent']['active']}")
        # Test 4: Check requests
        print("\n4️⃣ Checking request log...")
        r4 = await client.get("http://localhost:11235/monitor/requests")
        reqs = r4.json()
        print(f"   Active: {len(reqs['active'])}")
        print(f"   Completed: {len(reqs['completed'])}")
        # Test 5: Check endpoint stats
        print("\n5️⃣ Checking endpoint stats...")
        r5 = await client.get("http://localhost:11235/monitor/endpoints/stats")
        stats = r5.json()
        for endpoint, data in stats.items():
            print(f"   {endpoint}: {data['count']} requests, {data['avg_latency_ms']}ms avg")
        print("\n✅ Dashboard should now show activity!")
        print(f"\n🌐 Open: http://localhost:11235/dashboard")
 if __name__ == "__main__":
    asyncio.run(test_dashboard())
--- a/deploy/docker/utils.py
+++ b/deploy/docker/utils.py
@@ -179,3 +179,28 @@ def verify_email_domain(email: str) -> bool:
        return True if records else False
    except Exception as e:
        return False
 def get_container_memory_percent() -> float:
    """Get actual container memory usage vs limit (cgroup v1/v2 aware)."""
    try:
        # Try cgroup v2 first
        usage_path = Path("/sys/fs/cgroup/memory.current")
        limit_path = Path("/sys/fs/cgroup/memory.max")
        if not usage_path.exists():
            # Fall back to cgroup v1
            usage_path = Path("/sys/fs/cgroup/memory/memory.usage_in_bytes")
            limit_path = Path("/sys/fs/cgroup/memory/memory.limit_in_bytes")
        usage = int(usage_path.read_text())
        limit = int(limit_path.read_text())
        # Handle unlimited (v2: "max", v1: > 1e18)
        if limit > 1e18:
            import psutil
            limit = psutil.virtual_memory().total
        return (usage / limit) * 100
    except:
        # Non-container or unsupported: fallback to host
        import psutil
        return psutil.virtual_memory().percent
--- a/docs/examples/docker_client_hooks_example.py
+++ b/docs/examples/docker_client_hooks_example.py
@@ -1,522 +0,0 @@
 #!/usr/bin/env python3
 """
 Comprehensive hooks examples using Docker Client with function objects.
 This approach is recommended because:
 - Write hooks as regular Python functions
 - Full IDE support (autocomplete, type checking)
 - Automatic conversion to API format
 - Reusable and testable code
 - Clean, readable syntax
 """
 import asyncio
 from crawl4ai import Crawl4aiDockerClient
 # API_BASE_URL = "http://localhost:11235"
 API_BASE_URL = "http://localhost:11234"
 # ============================================================================
 # Hook Function Definitions
 # ============================================================================
 # --- All Hooks Demo ---
 async def browser_created_hook(browser, **kwargs):
    """Called after browser is created"""
    print("[HOOK] Browser created and ready")
    return browser
 async def page_context_hook(page, context, **kwargs):
    """Setup page environment"""
    print("[HOOK] Setting up page environment")
    # Set viewport
    await page.set_viewport_size({"width": 1920, "height": 1080})
    # Add cookies
    await context.add_cookies([{
        "name": "test_session",
        "value": "abc123xyz",
        "domain": ".httpbin.org",
        "path": "/"
    }])
    # Block resources
    await context.route("**/*.{png,jpg,jpeg,gif}", lambda route: route.abort())
    await context.route("**/analytics/*", lambda route: route.abort())
    print("[HOOK] Environment configured")
    return page
 async def user_agent_hook(page, context, user_agent, **kwargs):
    """Called when user agent is updated"""
    print(f"[HOOK] User agent: {user_agent[:50]}...")
    return page
 async def before_goto_hook(page, context, url, **kwargs):
    """Called before navigating to URL"""
    print(f"[HOOK] Navigating to: {url}")
    await page.set_extra_http_headers({
        "X-Custom-Header": "crawl4ai-test",
        "Accept-Language": "en-US"
    })
    return page
 async def after_goto_hook(page, context, url, response, **kwargs):
    """Called after page loads"""
    print(f"[HOOK] Page loaded: {url}")
    await page.wait_for_timeout(1000)
    try:
        await page.wait_for_selector("body", timeout=2000)
        print("[HOOK] Body element ready")
    except:
        print("[HOOK] Timeout, continuing")
    return page
 async def execution_started_hook(page, context, **kwargs):
    """Called when custom JS execution starts"""
    print("[HOOK] JS execution started")
    await page.evaluate("console.log('[HOOK] Custom JS');")
    return page
 async def before_retrieve_hook(page, context, **kwargs):
    """Called before retrieving HTML"""
    print("[HOOK] Preparing HTML retrieval")
    # Scroll for lazy content
    await page.evaluate("window.scrollTo(0, document.body.scrollHeight);")
    await page.wait_for_timeout(500)
    await page.evaluate("window.scrollTo(0, 0);")
    print("[HOOK] Scrolling complete")
    return page
 async def before_return_hook(page, context, html, **kwargs):
    """Called before returning HTML"""
    print(f"[HOOK] HTML ready: {len(html)} chars")
    metrics = await page.evaluate('''() => ({
        images: document.images.length,
        links: document.links.length,
        scripts: document.scripts.length
    })''')
    print(f"[HOOK] Metrics - Images: {metrics['images']}, Links: {metrics['links']}")
    return page
 # --- Authentication Hooks ---
 async def auth_context_hook(page, context, **kwargs):
    """Setup authentication context"""
    print("[HOOK] Setting up authentication")
    # Add auth cookies
    await context.add_cookies([{
        "name": "auth_token",
        "value": "fake_jwt_token",
        "domain": ".httpbin.org",
        "path": "/",
        "httpOnly": True
    }])
    # Set localStorage
    await page.evaluate('''
        localStorage.setItem('user_id', '12345');
        localStorage.setItem('auth_time', new Date().toISOString());
    ''')
    print("[HOOK] Auth context ready")
    return page
 async def auth_headers_hook(page, context, url, **kwargs):
    """Add authentication headers"""
    print(f"[HOOK] Adding auth headers for {url}")
    import base64
    credentials = base64.b64encode(b"user:passwd").decode('ascii')
    await page.set_extra_http_headers({
        'Authorization': f'Basic {credentials}',
        'X-API-Key': 'test-key-123'
    })
    return page
 # --- Performance Optimization Hooks ---
 async def performance_hook(page, context, **kwargs):
    """Optimize page for performance"""
    print("[HOOK] Optimizing for performance")
    # Block resource-heavy content
    await context.route("**/*.{png,jpg,jpeg,gif,webp,svg}", lambda r: r.abort())
    await context.route("**/*.{woff,woff2,ttf}", lambda r: r.abort())
    await context.route("**/*.{mp4,webm,ogg}", lambda r: r.abort())
    await context.route("**/googletagmanager.com/*", lambda r: r.abort())
    await context.route("**/google-analytics.com/*", lambda r: r.abort())
    await context.route("**/facebook.com/*", lambda r: r.abort())
    # Disable animations
    await page.add_style_tag(content='''
        *, *::before, *::after {
            animation-duration: 0s !important;
            transition-duration: 0s !important;
        }
    ''')
    print("[HOOK] Optimizations applied")
    return page
 async def cleanup_hook(page, context, **kwargs):
    """Clean page before extraction"""
    print("[HOOK] Cleaning page")
    await page.evaluate('''() => {
        const selectors = [
            '.ad', '.ads', '.advertisement',
            '.popup', '.modal', '.overlay',
            '.cookie-banner', '.newsletter'
        ];
        selectors.forEach(sel => {
            document.querySelectorAll(sel).forEach(el => el.remove());
        });
        document.querySelectorAll('script, style').forEach(el => el.remove());
    }''')
    print("[HOOK] Page cleaned")
    return page
 # --- Content Extraction Hooks ---
 async def wait_dynamic_content_hook(page, context, url, response, **kwargs):
    """Wait for dynamic content to load"""
    print(f"[HOOK] Waiting for dynamic content on {url}")
    await page.wait_for_timeout(2000)
    # Click "Load More" if exists
    try:
        load_more = await page.query_selector('[class*="load-more"], button:has-text("Load More")')
        if load_more:
            await load_more.click()
            await page.wait_for_timeout(1000)
            print("[HOOK] Clicked 'Load More'")
    except:
        pass
    return page
 async def extract_metadata_hook(page, context, **kwargs):
    """Extract page metadata"""
    print("[HOOK] Extracting metadata")
    metadata = await page.evaluate('''() => {
        const getMeta = (name) => {
            const el = document.querySelector(`meta[name="${name}"], meta[property="${name}"]`);
            return el ? el.getAttribute('content') : null;
        };
        return {
            title: document.title,
            description: getMeta('description'),
            author: getMeta('author'),
            keywords: getMeta('keywords'),
        };
    }''')
    print(f"[HOOK] Metadata: {metadata}")
    # Infinite scroll
    for i in range(3):
        await page.evaluate("window.scrollTo(0, document.body.scrollHeight);")
        await page.wait_for_timeout(1000)
        print(f"[HOOK] Scroll {i+1}/3")
    return page
 # --- Multi-URL Hooks ---
 async def url_specific_hook(page, context, url, **kwargs):
    """Apply URL-specific logic"""
    print(f"[HOOK] Processing URL: {url}")
    # URL-specific headers
    if 'html' in url:
        await page.set_extra_http_headers({"X-Type": "HTML"})
    elif 'json' in url:
        await page.set_extra_http_headers({"X-Type": "JSON"})
    return page
 async def track_progress_hook(page, context, url, response, **kwargs):
    """Track crawl progress"""
    status = response.status if response else 'unknown'
    print(f"[HOOK] Loaded {url} - Status: {status}")
    return page
 # ============================================================================
 # Test Functions
 # ============================================================================
 async def test_all_hooks_comprehensive():
    """Test all 8 hook types"""
    print("=" * 70)
    print("Test 1: All Hooks Comprehensive Demo (Docker Client)")
    print("=" * 70)
    async with Crawl4aiDockerClient(base_url=API_BASE_URL, verbose=False) as client:
        print("\nCrawling with all 8 hooks...")
        # Define hooks with function objects
        hooks = {
            "on_browser_created": browser_created_hook,
            "on_page_context_created": page_context_hook,
            "on_user_agent_updated": user_agent_hook,
            "before_goto": before_goto_hook,
            "after_goto": after_goto_hook,
            "on_execution_started": execution_started_hook,
            "before_retrieve_html": before_retrieve_hook,
            "before_return_html": before_return_hook
        }
        result = await client.crawl(
            ["https://httpbin.org/html"],
            hooks=hooks,
            hooks_timeout=30
        )
        print("\n✅ Success!")
        print(f"   URL: {result.url}")
        print(f"   Success: {result.success}")
        print(f"   HTML: {len(result.html)} chars")
 async def test_authentication_workflow():
    """Test authentication with hooks"""
    print("\n" + "=" * 70)
    print("Test 2: Authentication Workflow (Docker Client)")
    print("=" * 70)
    async with Crawl4aiDockerClient(base_url=API_BASE_URL, verbose=False) as client:
        print("\nTesting authentication...")
        hooks = {
            "on_page_context_created": auth_context_hook,
            "before_goto": auth_headers_hook
        }
        result = await client.crawl(
            ["https://httpbin.org/basic-auth/user/passwd"],
            hooks=hooks,
            hooks_timeout=15
        )
        print("\n✅ Authentication completed")
        if result.success:
            if '"authenticated"' in result.html and 'true' in result.html:
                print("   ✅ Basic auth successful!")
            else:
                print("   ⚠️ Auth status unclear")
        else:
            print(f"   ❌ Failed: {result.error_message}")
 async def test_performance_optimization():
    """Test performance optimization"""
    print("\n" + "=" * 70)
    print("Test 3: Performance Optimization (Docker Client)")
    print("=" * 70)
    async with Crawl4aiDockerClient(base_url=API_BASE_URL, verbose=False) as client:
        print("\nTesting performance hooks...")
        hooks = {
            "on_page_context_created": performance_hook,
            "before_retrieve_html": cleanup_hook
        }
        result = await client.crawl(
            ["https://httpbin.org/html"],
            hooks=hooks,
            hooks_timeout=10
        )
        print("\n✅ Optimization completed")
        print(f"   HTML size: {len(result.html):,} chars")
        print("   Resources blocked, ads removed")
 async def test_content_extraction():
    """Test content extraction"""
    print("\n" + "=" * 70)
    print("Test 4: Content Extraction (Docker Client)")
    print("=" * 70)
    async with Crawl4aiDockerClient(base_url=API_BASE_URL, verbose=False) as client:
        print("\nTesting extraction hooks...")
        hooks = {
            "after_goto": wait_dynamic_content_hook,
            "before_retrieve_html": extract_metadata_hook
        }
        result = await client.crawl(
            ["https://www.kidocode.com/"],
            hooks=hooks,
            hooks_timeout=20
        )
        print("\n✅ Extraction completed")
        print(f"   URL: {result.url}")
        print(f"   Success: {result.success}")
        print(f"   Metadata: {result.metadata}")
 async def test_multi_url_crawl():
    """Test hooks with multiple URLs"""
    print("\n" + "=" * 70)
    print("Test 5: Multi-URL Crawl (Docker Client)")
    print("=" * 70)
    async with Crawl4aiDockerClient(base_url=API_BASE_URL, verbose=False) as client:
        print("\nCrawling multiple URLs...")
        hooks = {
            "before_goto": url_specific_hook,
            "after_goto": track_progress_hook
        }
        results = await client.crawl(
            [
                "https://httpbin.org/html",
                "https://httpbin.org/json",
                "https://httpbin.org/xml"
            ],
            hooks=hooks,
            hooks_timeout=15
        )
        print("\n✅ Multi-URL crawl completed")
        print(f"\n   Crawled {len(results)} URLs:")
        for i, result in enumerate(results, 1):
            status = "✅" if result.success else "❌"
            print(f"   {status} {i}. {result.url}")
 async def test_reusable_hook_library():
    """Test using reusable hook library"""
    print("\n" + "=" * 70)
    print("Test 6: Reusable Hook Library (Docker Client)")
    print("=" * 70)
    # Create a library of reusable hooks
    class HookLibrary:
        @staticmethod
        async def block_images(page, context, **kwargs):
            """Block all images"""
            await context.route("**/*.{png,jpg,jpeg,gif}", lambda r: r.abort())
            print("[LIBRARY] Images blocked")
            return page
        @staticmethod
        async def block_analytics(page, context, **kwargs):
            """Block analytics"""
            await context.route("**/analytics/*", lambda r: r.abort())
            await context.route("**/google-analytics.com/*", lambda r: r.abort())
            print("[LIBRARY] Analytics blocked")
            return page
        @staticmethod
        async def scroll_infinite(page, context, **kwargs):
            """Handle infinite scroll"""
            for i in range(5):
                prev = await page.evaluate("document.body.scrollHeight")
                await page.evaluate("window.scrollTo(0, document.body.scrollHeight);")
                await page.wait_for_timeout(1000)
                curr = await page.evaluate("document.body.scrollHeight")
                if curr == prev:
                    break
            print("[LIBRARY] Infinite scroll complete")
            return page
    async with Crawl4aiDockerClient(base_url=API_BASE_URL, verbose=False) as client:
        print("\nUsing hook library...")
        hooks = {
            "on_page_context_created": HookLibrary.block_images,
            "before_retrieve_html": HookLibrary.scroll_infinite
        }
        result = await client.crawl(
            ["https://www.kidocode.com/"],
            hooks=hooks,
            hooks_timeout=20
        )
        print("\n✅ Library hooks completed")
        print(f"   Success: {result.success}")
 # ============================================================================
 # Main
 # ============================================================================
 async def main():
    """Run all Docker client hook examples"""
    print("🔧 Crawl4AI Docker Client - Hooks Examples (Function-Based)")
    print("Using Python function objects with automatic conversion")
    print("=" * 70)
    tests = [
        ("All Hooks Demo", test_all_hooks_comprehensive),
        ("Authentication", test_authentication_workflow),
        ("Performance", test_performance_optimization),
        ("Extraction", test_content_extraction),
        ("Multi-URL", test_multi_url_crawl),
        ("Hook Library", test_reusable_hook_library)
    ]
    for i, (name, test_func) in enumerate(tests, 1):
        try:
            await test_func()
            print(f"\n✅ Test {i}/{len(tests)}: {name} completed\n")
        except Exception as e:
            print(f"\n❌ Test {i}/{len(tests)}: {name} failed: {e}\n")
            import traceback
            traceback.print_exc()
    print("=" * 70)
    print("🎉 All Docker client hook examples completed!")
    print("\n💡 Key Benefits of Function-Based Hooks:")
    print("   • Write as regular Python functions")
    print("   • Full IDE support (autocomplete, types)")
    print("   • Automatic conversion to API format")
    print("   • Reusable across projects")
    print("   • Clean, readable code")
    print("   • Easy to test and debug")
    print("=" * 70)
 if __name__ == "__main__":
    asyncio.run(main())
--- a/docs/md_v2/assets/crawl4ai-skill.zip
+++ b/docs/md_v2/assets/crawl4ai-skill.zip
--- a/docs/md_v2/complete-sdk-reference.md
+++ b/docs/md_v2/complete-sdk-reference.md
--- a/docs/md_v2/core/docker-deployment.md
+++ b/docs/md_v2/core/docker-deployment.md
--- a/docs/md_v2/index.md
+++ b/docs/md_v2/index.md
@@ -59,27 +59,6 @@ Crawl4AI is the #1 trending GitHub repository, actively maintained by a vibrant
 > **Note**: If you're looking for the old documentation, you can access it [here](https://old.docs.crawl4ai.com).
 ## 🆕 AI Assistant Skill Now Available!
 <div style="background: linear-gradient(135deg, #667eea 0%, #764ba2 100%); padding: 20px; border-radius: 10px; margin: 20px 0; box-shadow: 0 4px 6px rgba(0,0,0,0.1);">
  <h3 style="color: white; margin: 0 0 10px 0;">🤖 Crawl4AI Skill for Claude & AI Assistants</h3>
  <p style="color: white; margin: 10px 0;">Supercharge your AI coding assistant with complete Crawl4AI knowledge! Download our comprehensive skill package that includes:</p>
  <ul style="color: white; margin: 10px 0;">
    <li>📚 Complete SDK reference (23K+ words)</li>
    <li>🚀 Ready-to-use extraction scripts</li>
    <li>⚡ Schema generation for efficient scraping</li>
    <li>🔧 Version 0.7.4 compatible</li>
  </ul>
  <div style="text-align: center; margin-top: 15px;">
    <a href="assets/crawl4ai-skill.zip" download style="background: white; color: #667eea; padding: 12px 30px; border-radius: 5px; text-decoration: none; font-weight: bold; display: inline-block; transition: transform 0.2s;">
      📦 Download Skill Package
    </a>
  </div>
  <p style="color: white; margin: 15px 0 0 0; font-size: 0.9em; text-align: center;">
    Works with Claude, Cursor, Windsurf, and other AI coding assistants. Import the .zip file into your AI assistant's skill/knowledge system.
  </p>
 </div>
 ## 🎯 New: Adaptive Web Crawling
 Crawl4AI now features intelligent adaptive crawling that knows when to stop! Using advanced information foraging algorithms, it determines when sufficient information has been gathered to answer your query.
--- a/docs/md_v2/marketplace/admin/admin.js
+++ b/docs/md_v2/marketplace/admin/admin.js
@@ -529,19 +529,8 @@ class AdminDashboard {
                    </label>
                </div>
                <div class="form-group full-width">
-                    <label>Long Description (Markdown - Overview tab)</label>
+                    <label>Integration Guide</label>
-                    <textarea id="form-long-description" rows="10" placeholder="Enter detailed description with markdown formatting...">${app?.long_description || ''}</textarea>
+                    <textarea id="form-integration" rows="10">${app?.integration_guide || ''}</textarea>
                    <small>Markdown support: **bold**, *italic*, [links](url), # headers, code blocks, lists</small>
                </div>
                <div class="form-group full-width">
                    <label>Integration Guide (Markdown - Integration tab)</label>
                    <textarea id="form-integration" rows="20" placeholder="Enter integration guide with installation, examples, and code snippets using markdown...">${app?.integration_guide || ''}</textarea>
                    <small>Single markdown field with installation, examples, and complete guide. Code blocks get auto copy buttons.</small>
                </div>
                <div class="form-group full-width">
                    <label>Documentation (Markdown - Documentation tab)</label>
                    <textarea id="form-documentation" rows="20" placeholder="Enter documentation with API reference, examples, and best practices using markdown...">${app?.documentation || ''}</textarea>
                    <small>Full documentation with API reference, examples, best practices, etc.</small>
                </div>
            </div>
        `;
@@ -723,9 +712,7 @@ class AdminDashboard {
            data.contact_email = document.getElementById('form-email').value;
            data.featured = document.getElementById('form-featured').checked ? 1 : 0;
            data.sponsored = document.getElementById('form-sponsored').checked ? 1 : 0;
            data.long_description = document.getElementById('form-long-description').value;
            data.integration_guide = document.getElementById('form-integration').value;
            data.documentation = document.getElementById('form-documentation').value;
        } else if (type === 'articles') {
            data.title = document.getElementById('form-title').value;
            data.slug = this.generateSlug(data.title);
--- a/docs/md_v2/marketplace/app-detail.css
+++ b/docs/md_v2/marketplace/app-detail.css
@@ -510,31 +510,6 @@
    line-height: 1.5;
 }
 /* Markdown rendered code blocks */
 .integration-content pre,
 .docs-content pre {
    background: var(--bg-dark);
    border: 1px solid var(--border-color);
    margin: 1rem 0;
    padding: 1rem;
    padding-top: 2.5rem; /* Space for copy button */
    overflow-x: auto;
    position: relative;
    max-height: none; /* Remove any height restrictions */
    height: auto; /* Allow content to expand */
 }
 .integration-content pre code,
 .docs-content pre code {
    background: transparent;
    padding: 0;
    color: var(--text-secondary);
    font-size: 0.875rem;
    line-height: 1.5;
    white-space: pre; /* Preserve whitespace and line breaks */
    display: block;
 }
 /* Feature Grid */
 .feature-grid {
    display: grid;
--- a/docs/md_v2/marketplace/app-detail.html
+++ b/docs/md_v2/marketplace/app-detail.html
@@ -80,7 +80,20 @@
                <section id="overview-tab" class="tab-content active">
                    <div class="overview-columns">
                        <div class="overview-main">
                            <h2>Overview</h2>
                            <div id="app-overview">Overview content goes here.</div>
                            <h3>Key Features</h3>
                            <ul id="app-features" class="features-list">
                                <li>Feature 1</li>
                                <li>Feature 2</li>
                                <li>Feature 3</li>
                            </ul>
                            <h3>Use Cases</h3>
                            <div id="app-use-cases" class="use-cases">
                                <p>Describe how this app can help your workflow.</p>
                            </div>
                        </div>
                        <aside class="sidebar">
@@ -129,14 +142,33 @@
                </section>
                <section id="integration-tab" class="tab-content">
-                    <div class="integration-content" id="app-integration">
+                    <div class="integration-content">
-                        <!-- Integration guide markdown content will be rendered here -->
+                        <h2>Integration Guide</h2>
                        <h3>Installation</h3>
                        <div class="code-block">
                            <pre><code id="install-code"># Installation instructions will appear here</code></pre>
                        </div>
                        <h3>Basic Usage</h3>
                        <div class="code-block">
                            <pre><code id="usage-code"># Usage example will appear here</code></pre>
                        </div>
                        <h3>Complete Integration Example</h3>
                        <div class="code-block">
                            <button class="copy-btn" id="copy-integration">Copy</button>
                            <pre><code id="integration-code"># Complete integration guide will appear here</code></pre>
                        </div>
                    </div>
                </section>
                <section id="docs-tab" class="tab-content">
-                    <div class="docs-content" id="app-docs">
+                    <div class="docs-content">
-                        <!-- Documentation markdown content will be rendered here -->
+                        <h2>Documentation</h2>
                        <div id="app-docs" class="doc-sections">
                            <p>Documentation coming soon.</p>
                        </div>
                    </div>
                </section>
--- a/docs/md_v2/marketplace/app-detail.js
+++ b/docs/md_v2/marketplace/app-detail.js
@@ -123,132 +123,144 @@ class AppDetailPage {
        document.getElementById('sidebar-pricing').textContent = this.appData.pricing || 'Free';
        document.getElementById('sidebar-contact').textContent = this.appData.contact_email || 'contact@example.com';
-        // Render tab contents from database fields
+        // Integration guide
-        this.renderTabContents();
+        this.renderIntegrationGuide();
    }
-    renderTabContents() {
+    renderIntegrationGuide() {
-        // Overview tab - use long_description from database
+        // Installation code
-        const overviewDiv = document.getElementById('app-overview');
+        const installCode = document.getElementById('install-code');
-        if (overviewDiv) {
+        if (installCode) {
-            if (this.appData.long_description) {
+            if (this.appData.type === 'Open Source' && this.appData.github_url) {
-                overviewDiv.innerHTML = this.renderMarkdown(this.appData.long_description);
+                installCode.textContent = `# Clone from GitHub
-            } else {
+git clone ${this.appData.github_url}
-                overviewDiv.innerHTML = `<p>${this.appData.description || 'No overview available.'}</p>`;
+
 # Install dependencies
 pip install -r requirements.txt`;
            } else if (this.appData.name.toLowerCase().includes('api')) {
                installCode.textContent = `# Install via pip
 pip install ${this.appData.slug}
 # Or install from source
 pip install git+${this.appData.github_url || 'https://github.com/example/repo'}`;
            }
        }
-        // Integration tab - use integration_guide field from database
+        // Usage code - customize based on category
-        const integrationDiv = document.getElementById('app-integration');
+        const usageCode = document.getElementById('usage-code');
-        if (integrationDiv) {
+        if (usageCode) {
-            if (this.appData.integration_guide) {
+            if (this.appData.category === 'Browser Automation') {
-                integrationDiv.innerHTML = this.renderMarkdown(this.appData.integration_guide);
+                usageCode.textContent = `from crawl4ai import AsyncWebCrawler
-                // Add copy buttons to all code blocks
+from ${this.appData.slug.replace(/-/g, '_')} import ${this.appData.name.replace(/\s+/g, '')}
-                this.addCopyButtonsToCodeBlocks(integrationDiv);
+
-            } else {
+async def main():
-                integrationDiv.innerHTML = '<p>Integration guide not yet available. Please check the official website for details.</p>';
+    # Initialize ${this.appData.name}
    automation = ${this.appData.name.replace(/\s+/g, '')}()
    async with AsyncWebCrawler() as crawler:
        result = await crawler.arun(
            url="https://example.com",
            browser_config=automation.config,
            wait_for="css:body"
        )
        print(result.markdown)`;
        } else if (this.appData.category === 'Proxy Services') {
            usageCode.textContent = `from crawl4ai import AsyncWebCrawler
 import ${this.appData.slug.replace(/-/g, '_')}
 # Configure proxy
 proxy_config = {
    "server": "${this.appData.website_url || 'https://proxy.example.com'}",
    "username": "your_username",
    "password": "your_password"
 }
 async with AsyncWebCrawler(proxy=proxy_config) as crawler:
    result = await crawler.arun(
        url="https://example.com",
        bypass_cache=True
    )
    print(result.status_code)`;
        } else if (this.appData.category === 'LLM Integration') {
            usageCode.textContent = `from crawl4ai import AsyncWebCrawler
 from crawl4ai.extraction_strategy import LLMExtractionStrategy
 # Configure LLM extraction
 strategy = LLMExtractionStrategy(
    provider="${this.appData.name.toLowerCase().includes('gpt') ? 'openai' : 'anthropic'}",
    api_key="your-api-key",
    model="${this.appData.name.toLowerCase().includes('gpt') ? 'gpt-4' : 'claude-3'}",
    instruction="Extract structured data"
 )
 async with AsyncWebCrawler() as crawler:
    result = await crawler.arun(
        url="https://example.com",
        extraction_strategy=strategy
    )
    print(result.extracted_content)`;
            }
        }
-        // Documentation tab - use documentation field from database
+        // Integration example
-        const docsDiv = document.getElementById('app-docs');
+        const integrationCode = document.getElementById('integration-code');
-        if (docsDiv) {
+        if (integrationCode) {
-            if (this.appData.documentation) {
+            integrationCode.textContent = this.appData.integration_guide ||
-                docsDiv.innerHTML = this.renderMarkdown(this.appData.documentation);
+`# Complete ${this.appData.name} Integration Example
-                // Add copy buttons to all code blocks
+
-                this.addCopyButtonsToCodeBlocks(docsDiv);
+from crawl4ai import AsyncWebCrawler
-            } else {
+from crawl4ai.extraction_strategy import JsonCssExtractionStrategy
-                docsDiv.innerHTML = '<p>Documentation coming soon.</p>';
+import json
-            }
+
-        }
+async def crawl_with_${this.appData.slug.replace(/-/g, '_')}():
    """
    Complete example showing how to use ${this.appData.name}
    with Crawl4AI for production web scraping
    """
    # Define extraction schema
    schema = {
        "name": "ProductList",
        "baseSelector": "div.product",
        "fields": [
            {"name": "title", "selector": "h2", "type": "text"},
            {"name": "price", "selector": ".price", "type": "text"},
            {"name": "image", "selector": "img", "type": "attribute", "attribute": "src"},
            {"name": "link", "selector": "a", "type": "attribute", "attribute": "href"}
        ]
    }
-    addCopyButtonsToCodeBlocks(container) {
+    # Initialize crawler with ${this.appData.name}
-        // Find all code blocks and add copy buttons
+    async with AsyncWebCrawler(
-        const codeBlocks = container.querySelectorAll('pre code');
+        browser_type="chromium",
-        codeBlocks.forEach(codeBlock => {
+        headless=True,
-            const pre = codeBlock.parentElement;
+        verbose=True
    ) as crawler:
-            // Skip if already has a copy button
+        # Crawl with extraction
-            if (pre.querySelector('.copy-btn')) return;
+        result = await crawler.arun(
            url="https://example.com/products",
            extraction_strategy=JsonCssExtractionStrategy(schema),
            cache_mode="bypass",
            wait_for="css:.product",
            screenshot=True
        )
-            // Create copy button
+        # Process results
-            const copyBtn = document.createElement('button');
+        if result.success:
-            copyBtn.className = 'copy-btn';
+            products = json.loads(result.extracted_content)
-            copyBtn.textContent = 'Copy';
+            print(f"Found {len(products)} products")
            copyBtn.onclick = () => {
                navigator.clipboard.writeText(codeBlock.textContent).then(() => {
                    copyBtn.textContent = '✓ Copied!';
                    setTimeout(() => {
                        copyBtn.textContent = 'Copy';
                    }, 2000);
                });
            };
-            // Add button to pre element
+            for product in products[:5]:
-            pre.style.position = 'relative';
+                print(f"- {product['title']}: {product['price']}")
-            pre.insertBefore(copyBtn, codeBlock);
+
-        });
+        return products
 # Run the crawler
 if __name__ == "__main__":
    import asyncio
    asyncio.run(crawl_with_${this.appData.slug.replace(/-/g, '_')}())`;
        }
    renderMarkdown(text) {
        if (!text) return '';
        // Store code blocks temporarily to protect them from processing
        const codeBlocks = [];
        let processed = text.replace(/```(\w+)?\n([\s\S]*?)```/g, (match, lang, code) => {
            const placeholder = `___CODE_BLOCK_${codeBlocks.length}___`;
            codeBlocks.push(`<pre><code class="language-${lang || ''}">${this.escapeHtml(code)}</code></pre>`);
            return placeholder;
        });
        // Store inline code temporarily
        const inlineCodes = [];
        processed = processed.replace(/`([^`]+)`/g, (match, code) => {
            const placeholder = `___INLINE_CODE_${inlineCodes.length}___`;
            inlineCodes.push(`<code>${this.escapeHtml(code)}</code>`);
            return placeholder;
        });
        // Now process the rest of the markdown
        processed = processed
            // Headers
            .replace(/^### (.*$)/gim, '<h3>$1</h3>')
            .replace(/^## (.*$)/gim, '<h2>$1</h2>')
            .replace(/^# (.*$)/gim, '<h1>$1</h1>')
            // Bold
            .replace(/\*\*(.*?)\*\*/g, '<strong>$1</strong>')
            // Italic
            .replace(/\*(.*?)\*/g, '<em>$1</em>')
            // Links
            .replace(/\[([^\]]+)\]\(([^)]+)\)/g, '<a href="$2" target="_blank">$1</a>')
            // Line breaks
            .replace(/\n\n/g, '</p><p>')
            .replace(/\n/g, '<br>')
            // Lists
            .replace(/^\* (.*)$/gim, '<li>$1</li>')
            .replace(/^- (.*)$/gim, '<li>$1</li>')
            // Wrap in paragraphs
            .replace(/^(?!<[h|p|pre|ul|ol|li])/gim, '<p>')
            .replace(/(?<![>])$/gim, '</p>');
        // Restore inline code
        inlineCodes.forEach((code, i) => {
            processed = processed.replace(`___INLINE_CODE_${i}___`, code);
        });
        // Restore code blocks
        codeBlocks.forEach((block, i) => {
            processed = processed.replace(`___CODE_BLOCK_${i}___`, block);
        });
        return processed;
    }
    escapeHtml(text) {
        const div = document.createElement('div');
        div.textContent = text;
        return div.innerHTML;
    }
    formatNumber(num) {
@@ -277,6 +289,33 @@ class AppDetailPage {
                document.getElementById(`${tabName}-tab`).classList.add('active');
            });
        });
        // Copy integration code
        document.getElementById('copy-integration').addEventListener('click', () => {
            const code = document.getElementById('integration-code').textContent;
            navigator.clipboard.writeText(code).then(() => {
                const btn = document.getElementById('copy-integration');
                const originalText = btn.innerHTML;
                btn.innerHTML = '<span>✓</span> Copied!';
                setTimeout(() => {
                    btn.innerHTML = originalText;
                }, 2000);
            });
        });
        // Copy code buttons
        document.querySelectorAll('.copy-btn').forEach(btn => {
            btn.addEventListener('click', (e) => {
                const codeBlock = e.target.closest('.code-block');
                const code = codeBlock.querySelector('code').textContent;
                navigator.clipboard.writeText(code).then(() => {
                    btn.textContent = 'Copied!';
                    setTimeout(() => {
                        btn.textContent = 'Copy';
                    }, 2000);
                });
            });
        });
    }
    async loadRelatedApps() {
--- a/mkdocs.yml
+++ b/mkdocs.yml
@@ -7,7 +7,6 @@ docs_dir: docs/md_v2
 nav:
  - Home: 'index.md'
  - "📚 Complete SDK Reference": "complete-sdk-reference.md"
  - "Ask AI": "core/ask-ai.md"
  - "Quick Start": "core/quickstart.md"
  - "Code Examples": "core/examples.md"
@@ -19,7 +18,7 @@ nav:
    - "Marketplace Admin": "marketplace/admin/index.html"
  - Setup & Installation:
    - "Installation": "core/installation.md"
-    - "Docker Deployment": "core/docker-deployment.md"
+    - "Self-Hosting Guide": "core/self-hosting.md"
  - "Blog & Changelog":
    - "Blog Home": "blog/index.md"
    - "Changelog": "https://github.com/unclecode/crawl4ai/blob/main/CHANGELOG.md"
--- a/tests/docker/test_hooks_utility.py
+++ b/tests/docker/test_hooks_utility.py
@@ -1,193 +0,0 @@
 """
 Test script demonstrating the hooks_to_string utility and Docker client integration.
 """
 import asyncio
 from crawl4ai import Crawl4aiDockerClient, hooks_to_string
 # Define hook functions as regular Python functions
 async def auth_hook(page, context, **kwargs):
    """Add authentication cookies."""
    await context.add_cookies([{
        'name': 'test_cookie',
        'value': 'test_value',
        'domain': '.httpbin.org',
        'path': '/'
    }])
    return page
 async def scroll_hook(page, context, **kwargs):
    """Scroll to load lazy content."""
    await page.evaluate("window.scrollTo(0, document.body.scrollHeight)")
    await page.wait_for_timeout(1000)
    return page
 async def viewport_hook(page, context, **kwargs):
    """Set custom viewport."""
    await page.set_viewport_size({"width": 1920, "height": 1080})
    return page
 async def test_hooks_utility():
    """Test the hooks_to_string utility function."""
    print("=" * 60)
    print("Testing hooks_to_string utility")
    print("=" * 60)
    # Create hooks dictionary with function objects
    hooks_dict = {
        "on_page_context_created": auth_hook,
        "before_retrieve_html": scroll_hook
    }
    # Convert to string format
    hooks_string = hooks_to_string(hooks_dict)
    print("\n✓ Successfully converted function objects to strings")
    print(f"\n✓ Converted {len(hooks_string)} hooks:")
    for hook_name in hooks_string.keys():
        print(f"  - {hook_name}")
    print("\n✓ Preview of converted hook:")
    print("-" * 60)
    print(hooks_string["on_page_context_created"][:200] + "...")
    print("-" * 60)
    return hooks_string
 async def test_docker_client_with_functions():
    """Test Docker client with function objects (automatic conversion)."""
    print("\n" + "=" * 60)
    print("Testing Docker Client with Function Objects")
    print("=" * 60)
    # Note: This requires a running Crawl4AI Docker server
    # Uncomment the following to test with actual server:
    async with Crawl4aiDockerClient(base_url="http://localhost:11234", verbose=True) as client:
        # Pass function objects directly - they'll be converted automatically
        result = await client.crawl(
            ["https://httpbin.org/html"],
            hooks={
                "on_page_context_created": auth_hook,
                "before_retrieve_html": scroll_hook
            },
            hooks_timeout=30
        )
        print(f"\n✓ Crawl successful: {result.success}")
        print(f"✓ URL: {result.url}")
    print("\n✓ Docker client accepts function objects directly")
    print("✓ Automatic conversion happens internally")
    print("✓ No manual string formatting needed!")
 async def test_docker_client_with_strings():
    """Test Docker client with pre-converted strings."""
    print("\n" + "=" * 60)
    print("Testing Docker Client with String Hooks")
    print("=" * 60)
    # Convert hooks to strings first
    hooks_dict = {
        "on_page_context_created": viewport_hook,
        "before_retrieve_html": scroll_hook
    }
    hooks_string = hooks_to_string(hooks_dict)
    # Note: This requires a running Crawl4AI Docker server
    # Uncomment the following to test with actual server:
    async with Crawl4aiDockerClient(base_url="http://localhost:11234", verbose=True) as client:
        # Pass string hooks - they'll be used as-is
        result = await client.crawl(
            ["https://httpbin.org/html"],
            hooks=hooks_string,
            hooks_timeout=30
        )
        print(f"\n✓ Crawl successful: {result.success}")
    print("\n✓ Docker client also accepts pre-converted strings")
    print("✓ Backward compatible with existing code")
 async def show_usage_patterns():
    """Show different usage patterns."""
    print("\n" + "=" * 60)
    print("Usage Patterns")
    print("=" * 60)
    print("\n1. Direct function usage (simplest):")
    print("-" * 60)
    print("""
    async def my_hook(page, context, **kwargs):
        await page.set_viewport_size({"width": 1920, "height": 1080})
        return page
    result = await client.crawl(
        ["https://example.com"],
        hooks={"on_page_context_created": my_hook}
    )
    """)
    print("\n2. Convert then use:")
    print("-" * 60)
    print("""
    hooks_dict = {"on_page_context_created": my_hook}
    hooks_string = hooks_to_string(hooks_dict)
    result = await client.crawl(
        ["https://example.com"],
        hooks=hooks_string
    )
    """)
    print("\n3. Manual string (backward compatible):")
    print("-" * 60)
    print("""
    hooks_string = {
        "on_page_context_created": '''
 async def hook(page, context, **kwargs):
    await page.set_viewport_size({"width": 1920, "height": 1080})
    return page
 '''
    }
    result = await client.crawl(
        ["https://example.com"],
        hooks=hooks_string
    )
    """)
 async def main():
    """Run all tests."""
    print("\n🚀 Crawl4AI Hooks Utility Test Suite\n")
    # Test the utility function
    # await test_hooks_utility()
    # Show usage with Docker client
    # await test_docker_client_with_functions()
    await test_docker_client_with_strings()
    # Show different patterns
    # await show_usage_patterns()
    # print("\n" + "=" * 60)
    # print("✓ All tests completed successfully!")
    # print("=" * 60)
    # print("\nKey Benefits:")
    # print("  • Write hooks as regular Python functions")
    # print("  • IDE support with autocomplete and type checking")
    # print("  • Automatic conversion to API format")
    # print("  • Backward compatible with string hooks")
    # print("  • Same utility used everywhere")
    # print("\n")
 if __name__ == "__main__":
    asyncio.run(main())
Author	SHA1	Message	Date
unclecode	1a22fb4d4f	docs: rename Docker deployment to self-hosting guide with comprehensive monitoring documentation Major documentation restructuring to emphasize self-hosting capabilities and fully document the real-time monitoring system. Changes: - Renamed docker-deployment.md → self-hosting.md to better reflect the value proposition - Updated mkdocs.yml navigation to "Self-Hosting Guide" - Completely rewrote introduction emphasizing self-hosting benefits: * Data privacy and ownership * Cost control and transparency * Performance and security advantages * Full customization capabilities - Expanded "Metrics & Monitoring" → "Real-time Monitoring & Operations" with: * Monitoring Dashboard section documenting the /monitor UI * Complete feature breakdown (system health, requests, browsers, janitor, errors) * Monitor API Endpoints with all REST endpoints and examples * WebSocket Streaming integration guide with Python examples * Control Actions for manual browser management * Production Integration patterns (Prometheus, custom dashboards, alerting) * Key production metrics to track - Enhanced summary section: * What users learned checklist * Why self-hosting matters * Clear next steps * Key resources with monitoring dashboard URL The monitoring dashboard built 2-3 weeks ago is now fully documented and discoverable. Users will understand they have complete operational visibility at http://localhost:11235/monitor with real-time updates, browser pool management, and programmatic control via REST/WebSocket APIs. This positions Crawl4AI as an enterprise-grade self-hosting solution with DevOps-level monitoring capabilities, not just a Docker deployment.	2025-11-09 13:31:52 +08:00
unclecode	81b5312629	Update gitignore	2025-11-09 10:49:42 +08:00
unclecode	73a5a7b0f5	Update gitignore	2025-10-18 12:41:29 +08:00
unclecode	05921811b8	docs: add comprehensive technical architecture documentation Created ARCHITECTURE.md as a complete technical reference for the Crawl4AI Docker server, replacing the stress test pipeline document with production-grade documentation. Contents: - System overview with architecture diagrams - Core components deep-dive (server, API, utils) - Smart browser pool implementation details - Real-time monitoring system architecture - WebSocket implementation and fallback strategy - Memory management and container detection - Production optimizations and code review fixes - Deployment guides (local, Docker, production) - Comprehensive troubleshooting section - Debug tools and performance tuning - Test suite documentation - Architecture decision log (ADRs) Target audience: Developers maintaining or extending the system Goal: Enable rapid onboarding and confident modifications	2025-10-18 12:05:49 +08:00
unclecode	25507adb5b	feat(monitor): implement code review fixes and real-time WebSocket monitoring Backend Improvements (11 fixes applied): Critical Fixes: - Add lock protection for browser pool access in monitor stats - Ensure async track_janitor_event across all call sites - Improve error handling in monitor request tracking (already in place) Important Fixes: - Replace fire-and-forget Redis with background persistence worker - Add time-based expiry for completed requests/errors (5min cleanup) - Implement input validation for monitor route parameters - Add 4s timeout to timeline updater to prevent hangs - Add warning when killing browsers with active requests - Implement monitor cleanup on shutdown with final persistence - Document memory estimates with TODO for actual tracking Frontend Enhancements: WebSocket Real-time Updates: - Add WebSocket endpoint at /monitor/ws for live monitoring - Implement auto-reconnect with exponential backoff (max 5 attempts) - Add graceful fallback to HTTP polling on WebSocket failure - Send comprehensive updates every 2 seconds (health, requests, browsers, timeline, events) UI/UX Improvements: - Add live connection status indicator with pulsing animation - Green "Live" = WebSocket connected - Yellow "Connecting..." = Attempting connection - Blue "Polling" = Fallback to HTTP polling - Red "Disconnected" = Connection failed - Restore original beautiful styling for all sections - Improve request table layout with flex-grow for URL column - Add browser type text labels alongside emojis - Add flex layout to browser section header Testing: - Add test-websocket.py for WebSocket validation - All 7 integration tests passing successfully Summary: 563 additions across 6 files	2025-10-18 11:38:25 +08:00
unclecode	aba4036ab6	Add demo and test scripts for monitor dashboard activity - Introduced a demo script (`demo_monitor_dashboard.py`) to showcase various monitoring features through simulated activity. - Implemented a test script (`test_monitor_demo.py`) to generate dashboard activity and verify monitor health and endpoint statistics. - Added a logo image to the static assets for branding purposes.	2025-10-17 22:43:06 +08:00
unclecode	e2af031b09	feat(monitor): add real-time monitoring dashboard with Redis persistence Complete observability solution for production deployments with terminal-style UI. Backend Implementation: - `monitor.py`: Stats manager tracking requests, browsers, errors, timeline data - `monitor_routes.py`: REST API endpoints for all monitor functionality - GET /monitor/health - System health snapshot - GET /monitor/requests - Active & completed requests - GET /monitor/browsers - Browser pool details - GET /monitor/endpoints/stats - Aggregated endpoint analytics - GET /monitor/timeline - Time-series data (memory, requests, browsers) - GET /monitor/logs/{janitor,errors} - Event logs - POST /monitor/actions/{cleanup,kill_browser,restart_browser} - Control actions - POST /monitor/stats/reset - Reset counters - Redis persistence for endpoint stats (survives restart) - Timeline tracking (5min window, 5s resolution, 60 data points) Frontend Dashboard (`/dashboard`): - System Health Bar: CPU%, Memory%, Network I/O, Uptime - Pool Status: Live counts (permanent/hot/cold browsers + memory) - Live Activity Tabs: - Requests: Active (realtime) + recent completed (last 100) - Browsers: Detailed table with actions (kill/restart) - Janitor: Cleanup event log with timestamps - Errors: Recent errors with stack traces - Endpoint Analytics: Count, avg latency, success%, pool hit% - Resource Timeline: SVG charts (memory/requests/browsers) with terminal aesthetics - Control Actions: Force cleanup, restart permanent, reset stats - Auto-refresh: 5s polling (toggleable) Integration: - Janitor events tracked (close_cold, close_hot, promote) - Crawler pool promotion events logged - Timeline updater background task (5s interval) - Lifespan hooks for monitor initialization UI Design: - Terminal vibe matching Crawl4AI theme - Dark background, cyan/pink accents, monospace font - Neon glow effects on charts - Responsive layout, hover interactions - Cross-navigation: Playground ↔ Monitor Key Features: - Zero-config: Works out of the box with existing Redis - Real-time visibility into pool efficiency - Manual browser management (kill/restart) - Historical data persistence - DevOps-friendly UX Routes: - API: `/monitor/*` (backend endpoints) - UI: `/dashboard` (static HTML)	2025-10-17 21:36:25 +08:00
unclecode	b97eaeea4c	feat(docker): implement smart browser pool with 10x memory efficiency Major refactoring to eliminate memory leaks and enable high-scale crawling: - Smart 3-Tier Browser Pool: - Permanent browser (always-ready default config) - Hot pool (configs used 3+ times, longer TTL) - Cold pool (new/rare configs, short TTL) - Auto-promotion: cold → hot after 3 uses - 100% pool reuse achieved in tests - Container-Aware Memory Detection: - Read cgroup v1/v2 memory limits (not host metrics) - Accurate memory pressure detection in Docker - Memory-based browser creation blocking - Adaptive Janitor: - Dynamic cleanup intervals (10s/30s/60s based on memory) - Tiered TTLs: cold 30-300s, hot 120-600s - Aggressive cleanup at high memory pressure - Unified Pool Usage: - All endpoints now use pool (/html, /screenshot, /pdf, /execute_js, /md, /llm) - Fixed config signature mismatch (permanent browser matches endpoints) - get_default_browser_config() helper for consistency - Configuration: - Reduced idle_ttl: 1800s → 300s (30min → 5min) - Fixed port: 11234 → 11235 (match Gunicorn) Performance Results (from stress tests): - Memory: 10x reduction (500-700MB × N → 270MB permanent) - Latency: 30-50x faster (<100ms pool hits vs 3-5s startup) - Reuse: 100% for default config, 60%+ for variants - Capacity: 100+ concurrent requests (vs ~20 before) - Leak: 0 MB/cycle (stable across tests) Test Infrastructure: - 7-phase sequential test suite (tests/) - Docker stats integration + log analysis - Pool promotion verification - Memory leak detection - Full endpoint coverage Fixes memory issues reported in production deployments.	2025-10-17 20:38:39 +08:00