feat(agent): implement Claude Code SDK agent with chat mode and persistent browser

Implementation: - Singleton browser pattern (BrowserManager) - one instance for entire session - 7 MCP tools for Crawl4AI (quick_crawl, sessions, navigation, extraction, JS execution, screenshots) - Interactive chat mode with streaming I/O using Claude SDK message generator - Rich-based terminal UI with markdown rendering and syntax highlighting - Single-shot and chat modes (--chat flag) - Comprehensive test suite: component tests, tool tests, 9 multi-turn scenarios Architecture: - agent_crawl.py: CLI entry point with SessionStorage (JSONL logging) - browser_manager.py: Singleton pattern for persistent AsyncWebCrawler - c4ai_tools.py: MCP tools using @tool decorator, integrated with BrowserManager - chat_mode.py: Streaming input mode per Claude SDK spec - terminal_ui.py: Rich-based beautiful terminal output - test_scenarios.py: Automated multi-turn conversation tests (simple/medium/complex) - TECH_SPEC.md: Complete AI-to-AI knowledge transfer document Key fixes: - Use result.markdown (not deprecated result.markdown_v2) - Handle both str and MarkdownGenerationResult types - Track current URL per session for extract_data/execute_js/screenshot tools - Manual browser lifecycle (start/close) instead of context managers Tools enabled: - Crawl4AI: quick_crawl, start_session, navigate, extract_data, execute_js, screenshot, close_session - Claude SDK built-in: Read, Write, Edit, Glob, Grep, Bash, NotebookEdit Total: 12 files, 2820 lines
2025-10-17 12:25:45 +08:00
parent 216019f29a
commit 31741e571a
12 changed files with 2820 additions and 0 deletions
--- a/crawl4ai/agent/TECH_SPEC.md
+++ b/crawl4ai/agent/TECH_SPEC.md
@@ -0,0 +1,429 @@
 # Crawl4AI Agent Technical Specification
 *AI-to-AI Knowledge Transfer Document*
 ## Context Documents
 **MUST READ FIRST:**
 1. `/Users/unclecode/devs/crawl4ai/tmp/CRAWL4AI_SDK.md` - Crawl4AI complete API reference
 2. `/Users/unclecode/devs/crawl4ai/tmp/cc_stream.md` - Claude SDK streaming input mode
 3. `/Users/unclecode/devs/crawl4ai/tmp/CC_PYTHON_SDK.md` - Claude Code Python SDK complete reference
 ## Architecture Overview
 **Core Principle:** Singleton browser instance + streaming chat mode + MCP tools
 ```
 ┌─────────────────────────────────────────────────────────────┐
 │                    Agent Entry Point                         │
 │         agent_crawl.py (CLI: --chat | single-shot)          │
 └─────────────────────────────────────────────────────────────┘
                            │
        ┌───────────────────┼───────────────────┐
        │                   │                   │
   [Chat Mode]         [Single-shot]    [Browser Manager]
        │                   │                   │
        ▼                   ▼                   ▼
  ChatMode.run()    CrawlAgent.run()   BrowserManager
  - Streaming        - One prompt          (Singleton)
  - Interactive      - Exit after           │
  - Commands         - Uses same            ▼
        │              browser         AsyncWebCrawler
        │                   │             (persistent)
        └───────────────────┴────────────────┘
                            │
                    ┌───────┴────────┐
                    │                │
              MCP Tools        Claude SDK
            (Crawl4AI)        (Built-in)
                    │                │
        ┌───────────┴────┐    ┌──────┴──────┐
        │                │    │             │
   quick_crawl    session    Read        Edit
   navigate       tools      Write       Glob
   extract_data              Bash        Grep
   execute_js
   screenshot
   close_session
 ```
 ## File Structure
 ```
 crawl4ai/agent/
 ├── __init__.py                 # Module exports
 ├── agent_crawl.py              # Main CLI entry (190 lines)
 │   ├── SessionStorage          # JSONL logging to ~/.crawl4ai/agents/projects/
 │   ├── CrawlAgent             # Single-shot wrapper
 │   └── main()                 # CLI parser (--chat flag)
 │
 ├── browser_manager.py          # Singleton pattern (70 lines)
 │   └── BrowserManager         # Class methods only, no instances
 │       ├── get_browser()      # Returns singleton AsyncWebCrawler
 │       ├── reconfigure_browser()
 │       ├── close_browser()
 │       └── is_browser_active()
 │
 ├── c4ai_tools.py               # 7 MCP tools (310 lines)
 │   ├── @tool decorators       # Claude SDK decorator
 │   ├── CRAWLER_SESSIONS       # Dict[str, AsyncWebCrawler] for named sessions
 │   ├── CRAWLER_SESSION_URLS   # Dict[str, str] track current URL per session
 │   └── CRAWL_TOOLS            # List of tool functions
 │
 ├── c4ai_prompts.py             # System prompt (130 lines)
 │   └── SYSTEM_PROMPT          # Agent behavior definition
 │
 ├── terminal_ui.py              # Rich-based UI (120 lines)
 │   └── TerminalUI             # Console rendering
 │       ├── show_header()
 │       ├── print_markdown()
 │       ├── print_code()
 │       └── with_spinner()
 │
 ├── chat_mode.py                # Streaming chat (160 lines)
 │   └── ChatMode
 │       ├── message_generator() # AsyncGenerator per cc_stream.md
 │       ├── _handle_command()   # /exit /clear /help /browser
 │       └── run()              # Main chat loop
 │
 ├── test_tools.py               # Direct tool tests (130 lines)
 ├── test_chat.py                # Component tests (90 lines)
 └── test_scenarios.py           # Multi-turn scenarios (500 lines)
    ├── SIMPLE_SCENARIOS
    ├── MEDIUM_SCENARIOS
    ├── COMPLEX_SCENARIOS
    └── ScenarioRunner
 ```
 ## Critical Implementation Details
 ### 1. Browser Singleton Pattern
 **Key:** ONE browser instance for ENTIRE agent session
 ```python
 # browser_manager.py
 class BrowserManager:
    _crawler: Optional[AsyncWebCrawler] = None  # Singleton
    _config: Optional[BrowserConfig] = None
    @classmethod
    async def get_browser(cls, config=None) -> AsyncWebCrawler:
        if cls._crawler is None:
            cls._crawler = AsyncWebCrawler(config or BrowserConfig())
            await cls._crawler.start()  # Manual lifecycle
        return cls._crawler
 ```
 **Behavior:**
 - First call: creates browser with `config` (or default)
 - Subsequent calls: returns same instance, **ignores config param**
 - To change config: `reconfigure_browser(new_config)` (closes old, creates new)
 - Tools use: `crawler = await BrowserManager.get_browser()`
 - No `async with` context manager - manual `start()` / `close()`
 ### 2. Tool Architecture
 **Two types of browser usage:**
 **A) Quick operations** (quick_crawl):
 ```python
@tool("quick_crawl", ...)
 async def quick_crawl(args):
    crawler = await BrowserManager.get_browser()  # Singleton
    result = await crawler.arun(url=args["url"], config=run_config)
    # No close - browser stays alive
 ```
 **B) Named sessions** (start_session, navigate, extract_data, etc.):
 ```python
 CRAWLER_SESSIONS: Dict[str, AsyncWebCrawler] = {}  # Named refs
 CRAWLER_SESSION_URLS: Dict[str, str] = {}  # Track current URL
@tool("start_session", ...)
 async def start_session(args):
    crawler = await BrowserManager.get_browser()
    CRAWLER_SESSIONS[args["session_id"]] = crawler  # Store ref
@tool("navigate", ...)
 async def navigate(args):
    crawler = CRAWLER_SESSIONS[args["session_id"]]
    result = await crawler.arun(url=args["url"], ...)
    CRAWLER_SESSION_URLS[args["session_id"]] = result.url  # Track URL
@tool("extract_data", ...)
 async def extract_data(args):
    crawler = CRAWLER_SESSIONS[args["session_id"]]
    current_url = CRAWLER_SESSION_URLS[args["session_id"]]  # Must have URL
    result = await crawler.arun(url=current_url, ...)  # Re-crawl current page
@tool("close_session", ...)
 async def close_session(args):
    CRAWLER_SESSIONS.pop(args["session_id"])  # Remove ref
    CRAWLER_SESSION_URLS.pop(args["session_id"], None)
    # Browser stays alive (singleton)
 ```
 **Important:** Named sessions are just **references** to singleton browser. Multiple sessions = same browser instance.
 ### 3. Markdown Handling (CRITICAL BUG FIX)
 **OLD (WRONG):**
 ```python
 result.markdown_v2.raw_markdown  # DEPRECATED
 ```
 **NEW (CORRECT):**
 ```python
 # result.markdown can be:
 # - str (simple mode)
 # - MarkdownGenerationResult object (with filters)
 if isinstance(result.markdown, str):
    markdown_content = result.markdown
 elif hasattr(result.markdown, 'raw_markdown'):
    markdown_content = result.markdown.raw_markdown
 ```
 Reference: `CRAWL4AI_SDK.md` line 614 - `markdown_v2` deprecated, use `markdown`
 ### 4. Chat Mode Streaming Input
 **Per cc_stream.md:** Use message generator pattern
 ```python
 # chat_mode.py
 async def message_generator(self) -> AsyncGenerator[Dict[str, Any], None]:
    while not self._exit_requested:
        user_input = await asyncio.to_thread(self.ui.get_user_input)
        if user_input.startswith('/'):
            await self._handle_command(user_input)
            continue
        # Yield in streaming input format
        yield {
            "type": "user",
            "message": {
                "role": "user",
                "content": user_input
            }
        }
 async def run(self):
    async with ClaudeSDKClient(options=self.options) as client:
        await client.query(self.message_generator())  # Pass generator
        async for message in client.receive_messages():
            # Process streaming responses
 ```
 **Key:** Generator keeps yielding user inputs, SDK streams responses back.
 ### 5. Claude SDK Integration
 **Setup:**
 ```python
 from claude_agent_sdk import tool, create_sdk_mcp_server, ClaudeSDKClient, ClaudeAgentOptions
 # 1. Define tools with @tool decorator
@tool("quick_crawl", "description", {"url": str, "output_format": str})
 async def quick_crawl(args: Dict[str, Any]) -> Dict[str, Any]:
    return {"content": [{"type": "text", "text": json.dumps(result)}]}
 # 2. Create MCP server
 crawler_server = create_sdk_mcp_server(
    name="crawl4ai",
    version="1.0.0",
    tools=[quick_crawl, start_session, ...]  # List of @tool functions
 )
 # 3. Configure options
 options = ClaudeAgentOptions(
    mcp_servers={"crawler": crawler_server},
    allowed_tools=[
        "mcp__crawler__quick_crawl",  # Format: mcp__{server}__{tool}
        "mcp__crawler__start_session",
        # Built-in tools:
        "Read", "Write", "Edit", "Glob", "Grep", "Bash", "NotebookEdit"
    ],
    system_prompt=SYSTEM_PROMPT,
    permission_mode="acceptEdits"
 )
 # 4. Use client
 async with ClaudeSDKClient(options=options) as client:
    await client.query(prompt_or_generator)
    async for message in client.receive_messages():
        # Process AssistantMessage, ResultMessage, etc.
 ```
 **Tool response format:**
 ```python
 return {
    "content": [{
        "type": "text",
        "text": json.dumps({"success": True, "data": "..."})
    }]
 }
 ```
 ## Operating Modes
 ### Single-Shot Mode
 ```bash
 python -m crawl4ai.agent.agent_crawl "Crawl example.com"
 ```
 - One prompt → execute → exit
 - Uses singleton browser
 - No cleanup of browser (process exit handles it)
 ### Chat Mode
 ```bash
 python -m crawl4ai.agent.agent_crawl --chat
 ```
 - Interactive loop with streaming I/O
 - Commands: `/exit` `/clear` `/help` `/browser`
 - Browser persists across all turns
 - Cleanup on exit: `BrowserManager.close_browser()`
 ## Testing Architecture
 **3 test levels:**
 1. **Component tests** (`test_chat.py`): Non-interactive, tests individual classes
 2. **Tool tests** (`test_tools.py`): Direct AsyncWebCrawler calls, validates Crawl4AI integration
 3. **Scenario tests** (`test_scenarios.py`): Automated multi-turn conversations
   - Injects messages programmatically
   - Validates tool calls, keywords, files created
   - Categories: SIMPLE (2), MEDIUM (3), COMPLEX (4)
 ## Dependencies
 ```python
 # External
 from crawl4ai import AsyncWebCrawler, BrowserConfig, CrawlerRunConfig, CacheMode
 from crawl4ai.extraction_strategy import LLMExtractionStrategy
 from claude_agent_sdk import (
    tool, create_sdk_mcp_server, ClaudeSDKClient, ClaudeAgentOptions,
    AssistantMessage, TextBlock, ResultMessage, ToolUseBlock
 )
 from rich.console import Console  # Already installed
 from rich.markdown import Markdown
 from rich.syntax import Syntax
 # Stdlib
 import asyncio, json, uuid, argparse
 from pathlib import Path
 from typing import Optional, Dict, Any, AsyncGenerator
 ```
 ## Common Pitfalls
 1. **DON'T** use `async with AsyncWebCrawler()` - breaks singleton pattern
 2. **DON'T** use `result.markdown_v2` - deprecated field
 3. **DON'T** call `crawler.arun()` without URL in session tools - needs current_url
 4. **DON'T** close browser in tools - managed by BrowserManager
 5. **DON'T** use `break` in message iteration - causes asyncio issues
 6. **DO** track session URLs in `CRAWLER_SESSION_URLS` for session tools
 7. **DO** handle both `str` and `MarkdownGenerationResult` for `result.markdown`
 8. **DO** use manual lifecycle `await crawler.start()` / `await crawler.close()`
 ## Session Storage
 **Location:** `~/.crawl4ai/agents/projects/{sanitized_cwd}/{uuid}.jsonl`
 **Format:** JSONL with events:
 ```json
 {"timestamp": "...", "event": "session_start", "data": {...}}
 {"timestamp": "...", "event": "user_message", "data": {"text": "..."}}
 {"timestamp": "...", "event": "assistant_message", "data": {"turn": 1, "text": "..."}}
 {"timestamp": "...", "event": "session_end", "data": {"duration_ms": 1000, ...}}
 ```
 ## CLI Options
 ```
 --chat                  Interactive chat mode
 --model MODEL          Claude model override
 --permission-mode MODE  acceptEdits|bypassPermissions|default|plan
 --add-dir DIR [DIR...] Additional accessible directories
 --system-prompt TEXT   Custom system prompt
 --session-id UUID      Resume/specify session
 --debug                Full tracebacks
 ```
 ## Performance Characteristics
 - **Browser startup:** ~2-4s (once per session)
 - **Quick crawl:** ~1-2s (reuses browser)
 - **Session operations:** ~1-2s (same browser)
 - **Chat latency:** Real-time streaming, no buffering
 - **Memory:** One browser instance regardless of operations
 ## Extension Points
 1. **New tools:** Add `@tool` function → add to `CRAWL_TOOLS` → add to `allowed_tools`
 2. **New commands:** Add handler in `ChatMode._handle_command()`
 3. **Custom UI:** Replace `TerminalUI` with different renderer
 4. **Persistent sessions:** Serialize browser cookies/state to disk in `BrowserManager`
 5. **Multi-browser:** Modify `BrowserManager` to support multiple configs (not recommended)
 ## Next Steps: Testing & Evaluation Pipeline
 ### Phase 1: Automated Testing (CURRENT)
 **Objective:** Verify codebase correctness, not agent quality
 **Test Execution:**
 ```bash
 # 1. Component tests (fast, non-interactive)
 python crawl4ai/agent/test_chat.py
 # Expected: All components instantiate correctly
 # 2. Tool integration tests (medium, requires browser)
 python crawl4ai/agent/test_tools.py
 # Expected: All 7 tools work with Crawl4AI
 # 3. Multi-turn scenario tests (slow, comprehensive)
 python crawl4ai/agent/test_scenarios.py
 # Expected: 9 scenarios pass (2 simple, 3 medium, 4 complex)
 # Output: test_agent_output/test_results.json
 ```
 **Success Criteria:**
 - All component tests pass
 - All tool tests pass
 - ≥80% scenario tests pass (7/9)
 - No crashes, exceptions, or hangs
 - Browser cleanup verified
 **Automated Pipeline:**
 ```bash
 # Run all tests in sequence, exit on first failure
 cd /Users/unclecode/devs/crawl4ai
 python crawl4ai/agent/test_chat.py && \
 python crawl4ai/agent/test_tools.py && \
 python crawl4ai/agent/test_scenarios.py
 echo "Exit code: $?"  # 0 = all passed
 ```
 ### Phase 2: Evaluation (NEXT)
 **Objective:** Measure agent performance quality
 **Metrics to define:**
 - Task completion rate
 - Tool selection accuracy
 - Context retention across turns
 - Planning effectiveness
 - Error recovery capability
 **Eval framework needed:**
 - Expand scenario tests with quality scoring
 - Add ground truth comparisons
 - Measure token efficiency
 - Track reasoning quality
 **Not in scope yet** - wait for Phase 1 completion
 ---
 **Last Updated:** 2025-01-17
 **Version:** 1.0.0
 **Status:** Testing Phase - Ready for automated test runs
--- a/crawl4ai/agent/init.py
+++ b/crawl4ai/agent/init.py
@@ -0,0 +1,13 @@
 # __init__.py
 """Crawl4AI Agent - Browser automation agent powered by Claude Code SDK."""
 from .c4ai_tools import CRAWL_TOOLS
 from .c4ai_prompts import SYSTEM_PROMPT
 from .agent_crawl import CrawlAgent, SessionStorage
 __all__ = [
    "CRAWL_TOOLS",
    "SYSTEM_PROMPT",
    "CrawlAgent",
    "SessionStorage",
 ]
--- a/crawl4ai/agent/agent-cc-sdk.md
+++ b/crawl4ai/agent/agent-cc-sdk.md
@@ -0,0 +1,593 @@
 ```python
 # c4ai_tools.py
 """Crawl4AI tools for Claude Code SDK agent."""
 import json
 import asyncio
 from typing import Any, Dict
 from crawl4ai import AsyncWebCrawler, BrowserConfig, CrawlerRunConfig, CacheMode
 from crawl4ai.extraction_strategy import LLMExtractionStrategy
 from claude_agent_sdk import tool
 # Global session storage
 CRAWLER_SESSIONS: Dict[str, AsyncWebCrawler] = {}
@tool("quick_crawl", "One-shot crawl for simple extraction. Returns markdown, HTML, or structured data.", {
    "url": str,
    "output_format": str,  # "markdown" | "html" | "structured" | "screenshot"
    "extraction_schema": str,  # Optional: JSON schema for structured extraction
    "js_code": str,  # Optional: JavaScript to execute before extraction
    "wait_for": str,  # Optional: CSS selector to wait for
 })
 async def quick_crawl(args: Dict[str, Any]) -> Dict[str, Any]:
    """Fast single-page crawl without session management."""
    crawler_config = BrowserConfig(headless=True, verbose=False)
    run_config = CrawlerRunConfig(
        cache_mode=CacheMode.BYPASS,
        js_code=args.get("js_code"),
        wait_for=args.get("wait_for"),
    )
    # Add extraction strategy if structured data requested
    if args.get("extraction_schema"):
        run_config.extraction_strategy = LLMExtractionStrategy(
            provider="openai/gpt-4o-mini",
            schema=json.loads(args["extraction_schema"]),
            instruction="Extract data according to the provided schema."
        )
    async with AsyncWebCrawler(config=crawler_config) as crawler:
        result = await crawler.arun(url=args["url"], config=run_config)
        if not result.success:
            return {
                "content": [{
                    "type": "text",
                    "text": json.dumps({"error": result.error_message, "success": False})
                }]
            }
        output_map = {
            "markdown": result.markdown_v2.raw_markdown if result.markdown_v2 else "",
            "html": result.html,
            "structured": result.extracted_content,
            "screenshot": result.screenshot,
        }
        response = {
            "success": True,
            "url": result.url,
            "data": output_map.get(args["output_format"], result.markdown_v2.raw_markdown)
        }
        return {"content": [{"type": "text", "text": json.dumps(response, indent=2)}]}
@tool("start_session", "Start a persistent browser session for multi-step crawling and automation.", {
    "session_id": str,
    "headless": bool,  # Default True
 })
 async def start_session(args: Dict[str, Any]) -> Dict[str, Any]:
    """Initialize a persistent crawler session."""
    session_id = args["session_id"]
    if session_id in CRAWLER_SESSIONS:
        return {"content": [{"type": "text", "text": json.dumps({
            "error": f"Session {session_id} already exists",
            "success": False
        })}]}
    crawler_config = BrowserConfig(
        headless=args.get("headless", True),
        verbose=False
    )
    crawler = AsyncWebCrawler(config=crawler_config)
    await crawler.__aenter__()
    CRAWLER_SESSIONS[session_id] = crawler
    return {"content": [{"type": "text", "text": json.dumps({
        "success": True,
        "session_id": session_id,
        "message": f"Browser session {session_id} started"
    })}]}
@tool("navigate", "Navigate to a URL in an active session.", {
    "session_id": str,
    "url": str,
    "wait_for": str,  # Optional: CSS selector to wait for
    "js_code": str,  # Optional: JavaScript to execute after load
 })
 async def navigate(args: Dict[str, Any]) -> Dict[str, Any]:
    """Navigate to URL in session."""
    session_id = args["session_id"]
    if session_id not in CRAWLER_SESSIONS:
        return {"content": [{"type": "text", "text": json.dumps({
            "error": f"Session {session_id} not found",
            "success": False
        })}]}
    crawler = CRAWLER_SESSIONS[session_id]
    run_config = CrawlerRunConfig(
        cache_mode=CacheMode.BYPASS,
        wait_for=args.get("wait_for"),
        js_code=args.get("js_code"),
    )
    result = await crawler.arun(url=args["url"], config=run_config)
    return {"content": [{"type": "text", "text": json.dumps({
        "success": result.success,
        "url": result.url,
        "message": f"Navigated to {args['url']}"
    })}]}
@tool("extract_data", "Extract data from current page in session using schema or return markdown.", {
    "session_id": str,
    "output_format": str,  # "markdown" | "structured"
    "extraction_schema": str,  # Required for structured, JSON schema
    "wait_for": str,  # Optional: Wait for element before extraction
    "js_code": str,  # Optional: Execute JS before extraction
 })
 async def extract_data(args: Dict[str, Any]) -> Dict[str, Any]:
    """Extract data from current page."""
    session_id = args["session_id"]
    if session_id not in CRAWLER_SESSIONS:
        return {"content": [{"type": "text", "text": json.dumps({
            "error": f"Session {session_id} not found",
            "success": False
        })}]}
    crawler = CRAWLER_SESSIONS[session_id]
    run_config = CrawlerRunConfig(
        cache_mode=CacheMode.BYPASS,
        wait_for=args.get("wait_for"),
        js_code=args.get("js_code"),
    )
    if args["output_format"] == "structured" and args.get("extraction_schema"):
        run_config.extraction_strategy = LLMExtractionStrategy(
            provider="openai/gpt-4o-mini",
            schema=json.loads(args["extraction_schema"]),
            instruction="Extract data according to schema."
        )
    result = await crawler.arun(config=run_config)
    if not result.success:
        return {"content": [{"type": "text", "text": json.dumps({
            "error": result.error_message,
            "success": False
        })}]}
    data = (result.extracted_content if args["output_format"] == "structured" 
            else result.markdown_v2.raw_markdown if result.markdown_v2 else "")
    return {"content": [{"type": "text", "text": json.dumps({
        "success": True,
        "data": data
    }, indent=2)}]}
@tool("execute_js", "Execute JavaScript in the current page context.", {
    "session_id": str,
    "js_code": str,
    "wait_for": str,  # Optional: Wait for element after execution
 })
 async def execute_js(args: Dict[str, Any]) -> Dict[str, Any]:
    """Execute JavaScript in session."""
    session_id = args["session_id"]
    if session_id not in CRAWLER_SESSIONS:
        return {"content": [{"type": "text", "text": json.dumps({
            "error": f"Session {session_id} not found",
            "success": False
        })}]}
    crawler = CRAWLER_SESSIONS[session_id]
    run_config = CrawlerRunConfig(
        cache_mode=CacheMode.BYPASS,
        js_code=args["js_code"],
        wait_for=args.get("wait_for"),
    )
    result = await crawler.arun(config=run_config)
    return {"content": [{"type": "text", "text": json.dumps({
        "success": result.success,
        "message": "JavaScript executed"
    })}]}
@tool("screenshot", "Take a screenshot of the current page.", {
    "session_id": str,
 })
 async def screenshot(args: Dict[str, Any]) -> Dict[str, Any]:
    """Capture screenshot."""
    session_id = args["session_id"]
    if session_id not in CRAWLER_SESSIONS:
        return {"content": [{"type": "text", "text": json.dumps({
            "error": f"Session {session_id} not found",
            "success": False
        })}]}
    crawler = CRAWLER_SESSIONS[session_id]
    result = await crawler.arun(config=CrawlerRunConfig(cache_mode=CacheMode.BYPASS))
    return {"content": [{"type": "text", "text": json.dumps({
        "success": True,
        "screenshot": result.screenshot if result.success else None
    })}]}
@tool("close_session", "Close and cleanup a browser session.", {
    "session_id": str,
 })
 async def close_session(args: Dict[str, Any]) -> Dict[str, Any]:
    """Close crawler session."""
    session_id = args["session_id"]
    if session_id not in CRAWLER_SESSIONS:
        return {"content": [{"type": "text", "text": json.dumps({
            "error": f"Session {session_id} not found",
            "success": False
        })}]}
    crawler = CRAWLER_SESSIONS.pop(session_id)
    await crawler.__aexit__(None, None, None)
    return {"content": [{"type": "text", "text": json.dumps({
        "success": True,
        "message": f"Session {session_id} closed"
    })}]}
 # Export all tools
 CRAWL_TOOLS = [
    quick_crawl,
    start_session,
    navigate,
    extract_data,
    execute_js,
    screenshot,
    close_session,
 ]
 ```
 ```python
 # c4ai_prompts.py
 """System prompts for Crawl4AI agent."""
 SYSTEM_PROMPT = """You are an expert web crawling and browser automation agent powered by Crawl4AI.
 # Core Capabilities
 You can perform sophisticated multi-step web scraping and automation tasks through two modes:
 ## Quick Mode (simple tasks)
 - Use `quick_crawl` for single-page data extraction
 - Best for: simple scrapes, getting page content, one-time extractions
 ## Session Mode (complex tasks)
 - Use `start_session` to create persistent browser sessions
 - Navigate, interact, extract data across multiple pages
 - Essential for: workflows requiring JS execution, pagination, filtering, multi-step automation
 # Tool Usage Patterns
 ## Simple Extraction
 1. Use `quick_crawl` with appropriate output_format
 2. Provide extraction_schema for structured data
 ## Multi-Step Workflow
 1. `start_session` - Create browser session with unique ID
 2. `navigate` - Go to target URL
 3. `execute_js` - Interact with page (click buttons, scroll, fill forms)
 4. `extract_data` - Get data using schema or markdown
 5. Repeat steps 2-4 as needed
 6. `close_session` - Clean up when done
 # Critical Instructions
 1. **Iteration & Validation**: When tasks require filtering or conditional logic:
   - Extract data first, analyze results
   - Filter/validate in your reasoning
   - Make subsequent tool calls based on validation
   - Continue until task criteria are met
 2. **Structured Extraction**: Always use JSON schemas for structured data:
   ```json
   {
     "type": "object",
     "properties": {
       "field_name": {"type": "string"},
       "price": {"type": "number"}
     }
   }
   ```
 3. **Session Management**:
   - Generate unique session IDs (e.g., "product_scrape_001")
   - Always close sessions when done
   - Use sessions for tasks requiring multiple page visits
 4. **JavaScript Execution**:
   - Use for: clicking buttons, scrolling, waiting for dynamic content
   - Example: `js_code: "document.querySelector('.load-more').click()"`
   - Combine with `wait_for` to ensure content loads
 5. **Error Handling**:
   - Check `success` field in all responses
   - Retry with different strategies if extraction fails
   - Report specific errors to user
 6. **Data Persistence**:
   - Save results using `Write` tool to JSON files
   - Use descriptive filenames with timestamps
   - Structure data clearly for user consumption
 # Example Workflows
 ## Workflow 1: Filter & Crawl
 Task: "Find products >$10, crawl each, extract details"
 1. `quick_crawl` product listing page with schema for [name, price, url]
 2. Analyze results, filter price > 10 in reasoning
 3. `start_session` for detailed crawling
 4. For each filtered product:
   - `navigate` to product URL
   - `extract_data` with detail schema
 5. Aggregate results
 6. `close_session`
 7. `Write` results to JSON
 ## Workflow 2: Paginated Scraping
 Task: "Scrape all items across multiple pages"
 1. `start_session`
 2. `navigate` to page 1
 3. `extract_data` items from current page
 4. Check for "next" button
 5. `execute_js` to click next
 6. Repeat 3-5 until no more pages
 7. `close_session`
 8. Save aggregated data
 ## Workflow 3: Dynamic Content
 Task: "Scrape reviews after clicking 'Load More'"
 1. `start_session`
 2. `navigate` to product page
 3. `execute_js` to click load more button
 4. `wait_for` reviews container
 5. `extract_data` all reviews
 6. `close_session`
 # Quality Guidelines
 - **Be thorough**: Don't stop until task requirements are fully met
 - **Validate data**: Check extracted data matches expected format
 - **Handle edge cases**: Empty results, pagination limits, rate limiting
 - **Clear reporting**: Summarize what was found, any issues encountered
 - **Efficient**: Use quick_crawl when possible, sessions only when needed
 # Output Format
 When saving data, use clean JSON structure:
 ```json
 {
  "metadata": {
    "scraped_at": "ISO timestamp",
    "source_url": "...",
    "total_items": 0
  },
  "data": [...]
 }
 ```
 Always provide a final summary of:
 - Items found/processed
 - Time taken
 - Files created
 - Any warnings/errors
 Remember: You have unlimited turns to complete the task. Take your time, validate each step, and ensure quality results."""
 ```
 ```python
 # agent_crawl.py
 """Crawl4AI Agent CLI - Browser automation agent powered by Claude Code SDK."""
 import asyncio
 import sys
 import json
 import uuid
 from pathlib import Path
 from datetime import datetime
 from typing import Optional
 import argparse
 from claude_agent_sdk import ClaudeSDKClient, ClaudeAgentOptions, create_sdk_mcp_server
 from claude_agent_sdk import AssistantMessage, TextBlock, ResultMessage
 from c4ai_tools import CRAWL_TOOLS
 from c4ai_prompts import SYSTEM_PROMPT
 class SessionStorage:
    """Manage session storage in ~/.crawl4ai/agents/projects/"""
    def __init__(self, cwd: Optional[str] = None):
        self.cwd = Path(cwd) if cwd else Path.cwd()
        self.base_dir = Path.home() / ".crawl4ai" / "agents" / "projects"
        self.project_dir = self.base_dir / self._sanitize_path(str(self.cwd.resolve()))
        self.project_dir.mkdir(parents=True, exist_ok=True)
        self.session_id = str(uuid.uuid4())
        self.log_file = self.project_dir / f"{self.session_id}.jsonl"
    @staticmethod
    def _sanitize_path(path: str) -> str:
        """Convert /Users/unclecode/devs/test to -Users-unclecode-devs-test"""
        return path.replace("/", "-").replace("\\", "-")
    def log(self, event_type: str, data: dict):
        """Append event to JSONL log."""
        entry = {
            "timestamp": datetime.utcnow().isoformat(),
            "event": event_type,
            "session_id": self.session_id,
            "data": data
        }
        with open(self.log_file, "a") as f:
            f.write(json.dumps(entry) + "\n")
    def get_session_path(self) -> str:
        """Return path to current session log."""
        return str(self.log_file)
 class CrawlAgent:
    """Crawl4AI agent wrapper."""
    def __init__(self, args: argparse.Namespace):
        self.args = args
        self.storage = SessionStorage(args.add_dir[0] if args.add_dir else None)
        self.client: Optional[ClaudeSDKClient] = None
        # Create MCP server with crawl tools
        self.crawler_server = create_sdk_mcp_server(
            name="crawl4ai",
            version="1.0.0",
            tools=CRAWL_TOOLS
        )
        # Build options
        self.options = ClaudeAgentOptions(
            mcp_servers={"crawler": self.crawler_server},
            allowed_tools=[
                "mcp__crawler__quick_crawl",
                "mcp__crawler__start_session",
                "mcp__crawler__navigate",
                "mcp__crawler__extract_data",
                "mcp__crawler__execute_js",
                "mcp__crawler__screenshot",
                "mcp__crawler__close_session",
                "Write", "Read", "Bash"
            ],
            system_prompt=SYSTEM_PROMPT if not args.system_prompt else args.system_prompt,
            permission_mode=args.permission_mode or "acceptEdits",
            cwd=args.add_dir[0] if args.add_dir else str(Path.cwd()),
            model=args.model,
            session_id=args.session_id or self.storage.session_id,
        )
    async def run(self, prompt: str):
        """Execute crawl task."""
        self.storage.log("session_start", {
            "prompt": prompt,
            "cwd": self.options.cwd,
            "model": self.options.model
        })
        print(f"\n🕷️  Crawl4AI Agent")
        print(f"📁 Session: {self.storage.session_id}")
        print(f"💾 Log: {self.storage.get_session_path()}")
        print(f"🎯 Task: {prompt}\n")
        async with ClaudeSDKClient(options=self.options) as client:
            self.client = client
            await client.query(prompt)
            turn = 0
            async for message in client.receive_messages():
                turn += 1
                if isinstance(message, AssistantMessage):
                    for block in message.content:
                        if isinstance(block, TextBlock):
                            print(f"\n💭 [{turn}] {block.text}")
                            self.storage.log("assistant_message", {"turn": turn, "text": block.text})
                elif isinstance(message, ResultMessage):
                    print(f"\n✅ Completed in {message.duration_ms/1000:.2f}s")
                    print(f"💰 Cost: ${message.total_cost_usd:.4f}" if message.total_cost_usd else "")
                    print(f"🔄 Turns: {message.num_turns}")
                    self.storage.log("session_end", {
                        "duration_ms": message.duration_ms,
                        "cost_usd": message.total_cost_usd,
                        "turns": message.num_turns,
                        "success": not message.is_error
                    })
                    break
        print(f"\n📊 Session log: {self.storage.get_session_path()}\n")
 def main():
    parser = argparse.ArgumentParser(
        description="Crawl4AI Agent - Browser automation powered by Claude Code SDK",
        formatter_class=argparse.RawDescriptionHelpFormatter
    )
    parser.add_argument("prompt", nargs="?", help="Your crawling task prompt")
    parser.add_argument("--system-prompt", help="Custom system prompt")
    parser.add_argument("--permission-mode", choices=["acceptEdits", "bypassPermissions", "default", "plan"],
                       help="Permission mode for tool execution")
    parser.add_argument("--model", help="Model to use (e.g., 'sonnet', 'opus')")
    parser.add_argument("--add-dir", nargs="+", help="Additional directories for file access")
    parser.add_argument("--session-id", help="Use specific session ID (UUID)")
    parser.add_argument("-v", "--version", action="version", version="Crawl4AI Agent 1.0.0")
    parser.add_argument("--debug", action="store_true", help="Enable debug mode")
    args = parser.parse_args()
    if not args.prompt:
        parser.print_help()
        print("\nExample usage:")
        print('  crawl-agent "Scrape all products from example.com with price > $10"')
        print('  crawl-agent --add-dir ~/projects "Find all Python files and analyze imports"')
        sys.exit(1)
    try:
        agent = CrawlAgent(args)
        asyncio.run(agent.run(args.prompt))
    except KeyboardInterrupt:
        print("\n\n⚠️  Interrupted by user")
        sys.exit(0)
    except Exception as e:
        print(f"\n❌ Error: {e}")
        if args.debug:
            raise
        sys.exit(1)
 if __name__ == "__main__":
    main()
 ```
 **Usage:**
 ```bash
 # Simple scrape
 python agent_crawl.py "Get all product names from example.com"
 # Complex filtering
 python agent_crawl.py "Find products >$10 from shop.com, crawl each, extract id/name/price"
 # Multi-step automation
 python agent_crawl.py "Go to amazon.com, search 'laptop', filter 4+ stars, scrape top 10"
 # With options
 python agent_crawl.py --add-dir ~/projects --model sonnet "Scrape competitor prices"
 ```
 **Session logs stored at:**
 `~/.crawl4ai/agents/projects/-Users-unclecode-devs-test/{uuid}.jsonl`
--- a/crawl4ai/agent/agent_crawl.py
+++ b/crawl4ai/agent/agent_crawl.py
@@ -0,0 +1,202 @@
 # agent_crawl.py
 """Crawl4AI Agent CLI - Browser automation agent powered by Claude Code SDK."""
 import asyncio
 import sys
 import json
 import uuid
 from pathlib import Path
 from datetime import datetime
 from typing import Optional
 import argparse
 from claude_agent_sdk import ClaudeSDKClient, ClaudeAgentOptions, create_sdk_mcp_server
 from claude_agent_sdk import AssistantMessage, TextBlock, ResultMessage
 from .c4ai_tools import CRAWL_TOOLS
 from .c4ai_prompts import SYSTEM_PROMPT
 from .terminal_ui import TerminalUI
 from .chat_mode import ChatMode
 class SessionStorage:
    """Manage session storage in ~/.crawl4ai/agents/projects/"""
    def __init__(self, cwd: Optional[str] = None):
        self.cwd = Path(cwd) if cwd else Path.cwd()
        self.base_dir = Path.home() / ".crawl4ai" / "agents" / "projects"
        self.project_dir = self.base_dir / self._sanitize_path(str(self.cwd.resolve()))
        self.project_dir.mkdir(parents=True, exist_ok=True)
        self.session_id = str(uuid.uuid4())
        self.log_file = self.project_dir / f"{self.session_id}.jsonl"
    @staticmethod
    def _sanitize_path(path: str) -> str:
        """Convert /Users/unclecode/devs/test to -Users-unclecode-devs-test"""
        return path.replace("/", "-").replace("\\", "-")
    def log(self, event_type: str, data: dict):
        """Append event to JSONL log."""
        entry = {
            "timestamp": datetime.utcnow().isoformat(),
            "event": event_type,
            "session_id": self.session_id,
            "data": data
        }
        with open(self.log_file, "a") as f:
            f.write(json.dumps(entry) + "\n")
    def get_session_path(self) -> str:
        """Return path to current session log."""
        return str(self.log_file)
 class CrawlAgent:
    """Crawl4AI agent wrapper."""
    def __init__(self, args: argparse.Namespace):
        self.args = args
        self.storage = SessionStorage(args.add_dir[0] if args.add_dir else None)
        self.client: Optional[ClaudeSDKClient] = None
        # Create MCP server with crawl tools
        self.crawler_server = create_sdk_mcp_server(
            name="crawl4ai",
            version="1.0.0",
            tools=CRAWL_TOOLS
        )
        # Build options
        self.options = ClaudeAgentOptions(
            mcp_servers={"crawler": self.crawler_server},
            allowed_tools=[
                # Crawl4AI tools
                "mcp__crawler__quick_crawl",
                "mcp__crawler__start_session",
                "mcp__crawler__navigate",
                "mcp__crawler__extract_data",
                "mcp__crawler__execute_js",
                "mcp__crawler__screenshot",
                "mcp__crawler__close_session",
                # Claude Code SDK built-in tools
                "Read",
                "Write",
                "Edit",
                "Glob",
                "Grep",
                "Bash",
                "NotebookEdit"
            ],
            system_prompt=SYSTEM_PROMPT if not args.system_prompt else args.system_prompt,
            permission_mode=args.permission_mode or "acceptEdits",
            cwd=args.add_dir[0] if args.add_dir else str(Path.cwd()),
            model=args.model,
        )
    async def run(self, prompt: str):
        """Execute crawl task."""
        self.storage.log("session_start", {
            "prompt": prompt,
            "cwd": self.options.cwd,
            "model": self.options.model
        })
        print(f"\n🕷️  Crawl4AI Agent")
        print(f"📁 Session: {self.storage.session_id}")
        print(f"💾 Log: {self.storage.get_session_path()}")
        print(f"🎯 Task: {prompt}\n")
        async with ClaudeSDKClient(options=self.options) as client:
            self.client = client
            await client.query(prompt)
            turn = 0
            async for message in client.receive_messages():
                turn += 1
                if isinstance(message, AssistantMessage):
                    for block in message.content:
                        if isinstance(block, TextBlock):
                            print(f"\n💭 [{turn}] {block.text}")
                            self.storage.log("assistant_message", {"turn": turn, "text": block.text})
                elif isinstance(message, ResultMessage):
                    print(f"\n✅ Completed in {message.duration_ms/1000:.2f}s")
                    print(f"💰 Cost: ${message.total_cost_usd:.4f}" if message.total_cost_usd else "")
                    print(f"🔄 Turns: {message.num_turns}")
                    self.storage.log("session_end", {
                        "duration_ms": message.duration_ms,
                        "cost_usd": message.total_cost_usd,
                        "turns": message.num_turns,
                        "success": not message.is_error
                    })
                    break
        print(f"\n📊 Session log: {self.storage.get_session_path()}\n")
 def main():
    parser = argparse.ArgumentParser(
        description="Crawl4AI Agent - Browser automation powered by Claude Code SDK",
        formatter_class=argparse.RawDescriptionHelpFormatter
    )
    parser.add_argument("prompt", nargs="?", help="Your crawling task prompt (not used in --chat mode)")
    parser.add_argument("--chat", action="store_true", help="Start interactive chat mode")
    parser.add_argument("--system-prompt", help="Custom system prompt")
    parser.add_argument("--permission-mode", choices=["acceptEdits", "bypassPermissions", "default", "plan"],
                       help="Permission mode for tool execution")
    parser.add_argument("--model", help="Model to use (e.g., 'sonnet', 'opus')")
    parser.add_argument("--add-dir", nargs="+", help="Additional directories for file access")
    parser.add_argument("--session-id", help="Use specific session ID (UUID)")
    parser.add_argument("-v", "--version", action="version", version="Crawl4AI Agent 1.0.0")
    parser.add_argument("--debug", action="store_true", help="Enable debug mode")
    args = parser.parse_args()
    # Chat mode - interactive
    if args.chat:
        try:
            agent = CrawlAgent(args)
            ui = TerminalUI()
            chat = ChatMode(agent.options, ui, agent.storage)
            asyncio.run(chat.run())
        except KeyboardInterrupt:
            print("\n\n⚠️  Chat interrupted by user")
            sys.exit(0)
        except Exception as e:
            print(f"\n❌ Error: {e}")
            if args.debug:
                raise
            sys.exit(1)
        return
    # Single-shot mode - requires prompt
    if not args.prompt:
        parser.print_help()
        print("\nExample usage:")
        print('  # Single-shot mode:')
        print('  crawl-agent "Scrape all products from example.com with price > $10"')
        print('  crawl-agent --add-dir ~/projects "Find all Python files and analyze imports"')
        print()
        print('  # Interactive chat mode:')
        print('  crawl-agent --chat')
        sys.exit(1)
    try:
        agent = CrawlAgent(args)
        asyncio.run(agent.run(args.prompt))
    except KeyboardInterrupt:
        print("\n\n⚠️  Interrupted by user")
        sys.exit(0)
    except Exception as e:
        print(f"\n❌ Error: {e}")
        if args.debug:
            raise
        sys.exit(1)
 if __name__ == "__main__":
    main()
--- a/crawl4ai/agent/browser_manager.py
+++ b/crawl4ai/agent/browser_manager.py
@@ -0,0 +1,73 @@
 """Browser session management with singleton pattern for persistent browser instances."""
 from typing import Optional
 from crawl4ai import AsyncWebCrawler, BrowserConfig
 class BrowserManager:
    """Singleton browser manager for persistent browser sessions across agent operations."""
    _instance: Optional['BrowserManager'] = None
    _crawler: Optional[AsyncWebCrawler] = None
    _config: Optional[BrowserConfig] = None
    def __new__(cls):
        if cls._instance is None:
            cls._instance = super().__new__(cls)
        return cls._instance
    @classmethod
    async def get_browser(cls, config: Optional[BrowserConfig] = None) -> AsyncWebCrawler:
        """
        Get or create the singleton browser instance.
        Args:
            config: Optional browser configuration. Only used if no browser exists yet.
                   To change config, use reconfigure_browser() instead.
        Returns:
            AsyncWebCrawler instance
        """
        # Create new browser if needed
        if cls._crawler is None:
            # Create default config if none provided
            if config is None:
                config = BrowserConfig(headless=True, verbose=False)
            cls._crawler = AsyncWebCrawler(config=config)
            await cls._crawler.start()
            cls._config = config
        return cls._crawler
    @classmethod
    async def reconfigure_browser(cls, new_config: BrowserConfig) -> AsyncWebCrawler:
        """
        Close current browser and create a new one with different configuration.
        Args:
            new_config: New browser configuration
        Returns:
            New AsyncWebCrawler instance
        """
        await cls.close_browser()
        return await cls.get_browser(new_config)
    @classmethod
    async def close_browser(cls):
        """Close the current browser instance and cleanup."""
        if cls._crawler is not None:
            await cls._crawler.close()
            cls._crawler = None
            cls._config = None
    @classmethod
    def is_browser_active(cls) -> bool:
        """Check if browser is currently active."""
        return cls._crawler is not None
    @classmethod
    def get_current_config(cls) -> Optional[BrowserConfig]:
        """Get the current browser configuration."""
        return cls._config
--- a/crawl4ai/agent/c4ai_prompts.py
+++ b/crawl4ai/agent/c4ai_prompts.py
@@ -0,0 +1,137 @@
 # c4ai_prompts.py
 """System prompts for Crawl4AI agent."""
 SYSTEM_PROMPT = """You are an expert web crawling and browser automation agent powered by Crawl4AI.
 # Core Capabilities
 You can perform sophisticated multi-step web scraping and automation tasks through two modes:
 ## Quick Mode (simple tasks)
 - Use `quick_crawl` for single-page data extraction
 - Best for: simple scrapes, getting page content, one-time extractions
 ## Session Mode (complex tasks)
 - Use `start_session` to create persistent browser sessions
 - Navigate, interact, extract data across multiple pages
 - Essential for: workflows requiring JS execution, pagination, filtering, multi-step automation
 # Tool Usage Patterns
 ## Simple Extraction
 1. Use `quick_crawl` with appropriate output_format
 2. Provide extraction_schema for structured data
 ## Multi-Step Workflow
 1. `start_session` - Create browser session with unique ID
 2. `navigate` - Go to target URL
 3. `execute_js` - Interact with page (click buttons, scroll, fill forms)
 4. `extract_data` - Get data using schema or markdown
 5. Repeat steps 2-4 as needed
 6. `close_session` - Clean up when done
 # Critical Instructions
 1. **Iteration & Validation**: When tasks require filtering or conditional logic:
   - Extract data first, analyze results
   - Filter/validate in your reasoning
   - Make subsequent tool calls based on validation
   - Continue until task criteria are met
 2. **Structured Extraction**: Always use JSON schemas for structured data:
   ```json
   {
     "type": "object",
     "properties": {
       "field_name": {"type": "string"},
       "price": {"type": "number"}
     }
   }
   ```
 3. **Session Management**:
   - Generate unique session IDs (e.g., "product_scrape_001")
   - Always close sessions when done
   - Use sessions for tasks requiring multiple page visits
 4. **JavaScript Execution**:
   - Use for: clicking buttons, scrolling, waiting for dynamic content
   - Example: `js_code: "document.querySelector('.load-more').click()"`
   - Combine with `wait_for` to ensure content loads
 5. **Error Handling**:
   - Check `success` field in all responses
   - Retry with different strategies if extraction fails
   - Report specific errors to user
 6. **Data Persistence**:
   - Save results using `Write` tool to JSON files
   - Use descriptive filenames with timestamps
   - Structure data clearly for user consumption
 # Example Workflows
 ## Workflow 1: Filter & Crawl
 Task: "Find products >$10, crawl each, extract details"
 1. `quick_crawl` product listing page with schema for [name, price, url]
 2. Analyze results, filter price > 10 in reasoning
 3. `start_session` for detailed crawling
 4. For each filtered product:
   - `navigate` to product URL
   - `extract_data` with detail schema
 5. Aggregate results
 6. `close_session`
 7. `Write` results to JSON
 ## Workflow 2: Paginated Scraping
 Task: "Scrape all items across multiple pages"
 1. `start_session`
 2. `navigate` to page 1
 3. `extract_data` items from current page
 4. Check for "next" button
 5. `execute_js` to click next
 6. Repeat 3-5 until no more pages
 7. `close_session`
 8. Save aggregated data
 ## Workflow 3: Dynamic Content
 Task: "Scrape reviews after clicking 'Load More'"
 1. `start_session`
 2. `navigate` to product page
 3. `execute_js` to click load more button
 4. `wait_for` reviews container
 5. `extract_data` all reviews
 6. `close_session`
 # Quality Guidelines
 - **Be thorough**: Don't stop until task requirements are fully met
 - **Validate data**: Check extracted data matches expected format
 - **Handle edge cases**: Empty results, pagination limits, rate limiting
 - **Clear reporting**: Summarize what was found, any issues encountered
 - **Efficient**: Use quick_crawl when possible, sessions only when needed
 # Output Format
 When saving data, use clean JSON structure:
 ```json
 {
  "metadata": {
    "scraped_at": "ISO timestamp",
    "source_url": "...",
    "total_items": 0
  },
  "data": [...]
 }
 ```
 Always provide a final summary of:
 - Items found/processed
 - Time taken
 - Files created
 - Any warnings/errors
 Remember: You have unlimited turns to complete the task. Take your time, validate each step, and ensure quality results."""
--- a/crawl4ai/agent/c4ai_tools.py
+++ b/crawl4ai/agent/c4ai_tools.py
@@ -0,0 +1,314 @@
 # c4ai_tools.py
 """Crawl4AI tools for Claude Code SDK agent."""
 import json
 import asyncio
 from typing import Any, Dict
 from crawl4ai import AsyncWebCrawler, BrowserConfig, CrawlerRunConfig, CacheMode
 from crawl4ai.extraction_strategy import LLMExtractionStrategy
 from claude_agent_sdk import tool
 from .browser_manager import BrowserManager
 # Global session storage (for named sessions only)
 CRAWLER_SESSIONS: Dict[str, AsyncWebCrawler] = {}
 CRAWLER_SESSION_URLS: Dict[str, str] = {}  # Track current URL per session
@tool("quick_crawl", "One-shot crawl for simple extraction. Returns markdown, HTML, or structured data.", {
    "url": str,
    "output_format": str,  # "markdown" | "html" | "structured" | "screenshot"
    "extraction_schema": str,  # Optional: JSON schema for structured extraction
    "js_code": str,  # Optional: JavaScript to execute before extraction
    "wait_for": str,  # Optional: CSS selector to wait for
 })
 async def quick_crawl(args: Dict[str, Any]) -> Dict[str, Any]:
    """Fast single-page crawl using persistent browser."""
    # Use singleton browser manager
    crawler_config = BrowserConfig(headless=True, verbose=False)
    crawler = await BrowserManager.get_browser(crawler_config)
    run_config = CrawlerRunConfig(
        cache_mode=CacheMode.BYPASS,
        js_code=args.get("js_code"),
        wait_for=args.get("wait_for"),
    )
    # Add extraction strategy if structured data requested
    if args.get("extraction_schema"):
        run_config.extraction_strategy = LLMExtractionStrategy(
            provider="openai/gpt-4o-mini",
            schema=json.loads(args["extraction_schema"]),
            instruction="Extract data according to the provided schema."
        )
    result = await crawler.arun(url=args["url"], config=run_config)
    if not result.success:
        return {
            "content": [{
                "type": "text",
                "text": json.dumps({"error": result.error_message, "success": False})
            }]
        }
    # Handle markdown - can be string or MarkdownGenerationResult object
    markdown_content = ""
    if isinstance(result.markdown, str):
        markdown_content = result.markdown
    elif hasattr(result.markdown, 'raw_markdown'):
        markdown_content = result.markdown.raw_markdown
    output_map = {
        "markdown": markdown_content,
        "html": result.html,
        "structured": result.extracted_content,
        "screenshot": result.screenshot,
    }
    response = {
        "success": True,
        "url": result.url,
        "data": output_map.get(args["output_format"], markdown_content)
    }
    return {"content": [{"type": "text", "text": json.dumps(response, indent=2)}]}
@tool("start_session", "Start a named browser session for multi-step crawling and automation.", {
    "session_id": str,
    "headless": bool,  # Default True
 })
 async def start_session(args: Dict[str, Any]) -> Dict[str, Any]:
    """Initialize a named crawler session using the singleton browser."""
    session_id = args["session_id"]
    if session_id in CRAWLER_SESSIONS:
        return {"content": [{"type": "text", "text": json.dumps({
            "error": f"Session {session_id} already exists",
            "success": False
        })}]}
    # Use the singleton browser
    crawler_config = BrowserConfig(
        headless=args.get("headless", True),
        verbose=False
    )
    crawler = await BrowserManager.get_browser(crawler_config)
    # Store reference for named session
    CRAWLER_SESSIONS[session_id] = crawler
    return {"content": [{"type": "text", "text": json.dumps({
        "success": True,
        "session_id": session_id,
        "message": f"Browser session {session_id} started"
    })}]}
@tool("navigate", "Navigate to a URL in an active session.", {
    "session_id": str,
    "url": str,
    "wait_for": str,  # Optional: CSS selector to wait for
    "js_code": str,  # Optional: JavaScript to execute after load
 })
 async def navigate(args: Dict[str, Any]) -> Dict[str, Any]:
    """Navigate to URL in session."""
    session_id = args["session_id"]
    if session_id not in CRAWLER_SESSIONS:
        return {"content": [{"type": "text", "text": json.dumps({
            "error": f"Session {session_id} not found",
            "success": False
        })}]}
    crawler = CRAWLER_SESSIONS[session_id]
    run_config = CrawlerRunConfig(
        cache_mode=CacheMode.BYPASS,
        wait_for=args.get("wait_for"),
        js_code=args.get("js_code"),
    )
    result = await crawler.arun(url=args["url"], config=run_config)
    # Store current URL for this session
    if result.success:
        CRAWLER_SESSION_URLS[session_id] = result.url
    return {"content": [{"type": "text", "text": json.dumps({
        "success": result.success,
        "url": result.url,
        "message": f"Navigated to {args['url']}"
    })}]}
@tool("extract_data", "Extract data from current page in session using schema or return markdown.", {
    "session_id": str,
    "output_format": str,  # "markdown" | "structured"
    "extraction_schema": str,  # Required for structured, JSON schema
    "wait_for": str,  # Optional: Wait for element before extraction
    "js_code": str,  # Optional: Execute JS before extraction
 })
 async def extract_data(args: Dict[str, Any]) -> Dict[str, Any]:
    """Extract data from current page."""
    session_id = args["session_id"]
    if session_id not in CRAWLER_SESSIONS:
        return {"content": [{"type": "text", "text": json.dumps({
            "error": f"Session {session_id} not found",
            "success": False
        })}]}
    # Check if we have a current URL for this session
    if session_id not in CRAWLER_SESSION_URLS:
        return {"content": [{"type": "text", "text": json.dumps({
            "error": "No page loaded in session. Use 'navigate' first.",
            "success": False
        })}]}
    crawler = CRAWLER_SESSIONS[session_id]
    current_url = CRAWLER_SESSION_URLS[session_id]
    run_config = CrawlerRunConfig(
        cache_mode=CacheMode.BYPASS,
        wait_for=args.get("wait_for"),
        js_code=args.get("js_code"),
    )
    if args["output_format"] == "structured" and args.get("extraction_schema"):
        run_config.extraction_strategy = LLMExtractionStrategy(
            provider="openai/gpt-4o-mini",
            schema=json.loads(args["extraction_schema"]),
            instruction="Extract data according to schema."
        )
    result = await crawler.arun(url=current_url, config=run_config)
    if not result.success:
        return {"content": [{"type": "text", "text": json.dumps({
            "error": result.error_message,
            "success": False
        })}]}
    # Handle markdown - can be string or MarkdownGenerationResult object
    markdown_content = ""
    if isinstance(result.markdown, str):
        markdown_content = result.markdown
    elif hasattr(result.markdown, 'raw_markdown'):
        markdown_content = result.markdown.raw_markdown
    data = (result.extracted_content if args["output_format"] == "structured"
            else markdown_content)
    return {"content": [{"type": "text", "text": json.dumps({
        "success": True,
        "data": data
    }, indent=2)}]}
@tool("execute_js", "Execute JavaScript in the current page context.", {
    "session_id": str,
    "js_code": str,
    "wait_for": str,  # Optional: Wait for element after execution
 })
 async def execute_js(args: Dict[str, Any]) -> Dict[str, Any]:
    """Execute JavaScript in session."""
    session_id = args["session_id"]
    if session_id not in CRAWLER_SESSIONS:
        return {"content": [{"type": "text", "text": json.dumps({
            "error": f"Session {session_id} not found",
            "success": False
        })}]}
    # Check if we have a current URL for this session
    if session_id not in CRAWLER_SESSION_URLS:
        return {"content": [{"type": "text", "text": json.dumps({
            "error": "No page loaded in session. Use 'navigate' first.",
            "success": False
        })}]}
    crawler = CRAWLER_SESSIONS[session_id]
    current_url = CRAWLER_SESSION_URLS[session_id]
    run_config = CrawlerRunConfig(
        cache_mode=CacheMode.BYPASS,
        js_code=args["js_code"],
        wait_for=args.get("wait_for"),
    )
    result = await crawler.arun(url=current_url, config=run_config)
    return {"content": [{"type": "text", "text": json.dumps({
        "success": result.success,
        "message": "JavaScript executed"
    })}]}
@tool("screenshot", "Take a screenshot of the current page.", {
    "session_id": str,
 })
 async def screenshot(args: Dict[str, Any]) -> Dict[str, Any]:
    """Capture screenshot."""
    session_id = args["session_id"]
    if session_id not in CRAWLER_SESSIONS:
        return {"content": [{"type": "text", "text": json.dumps({
            "error": f"Session {session_id} not found",
            "success": False
        })}]}
    # Check if we have a current URL for this session
    if session_id not in CRAWLER_SESSION_URLS:
        return {"content": [{"type": "text", "text": json.dumps({
            "error": "No page loaded in session. Use 'navigate' first.",
            "success": False
        })}]}
    crawler = CRAWLER_SESSIONS[session_id]
    current_url = CRAWLER_SESSION_URLS[session_id]
    result = await crawler.arun(
        url=current_url,
        config=CrawlerRunConfig(cache_mode=CacheMode.BYPASS, screenshot=True)
    )
    return {"content": [{"type": "text", "text": json.dumps({
        "success": True,
        "screenshot": result.screenshot if result.success else None
    })}]}
@tool("close_session", "Close and cleanup a named browser session.", {
    "session_id": str,
 })
 async def close_session(args: Dict[str, Any]) -> Dict[str, Any]:
    """Close named crawler session (browser stays alive for other operations)."""
    session_id = args["session_id"]
    if session_id not in CRAWLER_SESSIONS:
        return {"content": [{"type": "text", "text": json.dumps({
            "error": f"Session {session_id} not found",
            "success": False
        })}]}
    # Remove from named sessions, but don't close the singleton browser
    CRAWLER_SESSIONS.pop(session_id)
    CRAWLER_SESSION_URLS.pop(session_id, None)  # Remove URL tracking
    return {"content": [{"type": "text", "text": json.dumps({
        "success": True,
        "message": f"Session {session_id} closed"
    })}]}
 # Export all tools
 CRAWL_TOOLS = [
    quick_crawl,
    start_session,
    navigate,
    extract_data,
    execute_js,
    screenshot,
    close_session,
 ]
--- a/crawl4ai/agent/chat_mode.py
+++ b/crawl4ai/agent/chat_mode.py
@@ -0,0 +1,166 @@
 """Chat mode implementation with streaming message generator for Claude SDK."""
 import asyncio
 from typing import AsyncGenerator, Dict, Any, Optional
 from claude_agent_sdk import ClaudeSDKClient, ClaudeAgentOptions, AssistantMessage, TextBlock, ResultMessage, ToolUseBlock
 from .terminal_ui import TerminalUI
 from .browser_manager import BrowserManager
 class ChatMode:
    """Interactive chat mode with streaming input/output."""
    def __init__(self, options: ClaudeAgentOptions, ui: TerminalUI, storage):
        self.options = options
        self.ui = ui
        self.storage = storage
        self._exit_requested = False
        self._current_streaming_text = ""
    async def message_generator(self) -> AsyncGenerator[Dict[str, Any], None]:
        """
        Generate user messages as async generator (streaming input mode per cc_stream.md).
        Yields messages in the format:
        {
            "type": "user",
            "message": {
                "role": "user",
                "content": "user input text"
            }
        }
        """
        while not self._exit_requested:
            try:
                # Get user input
                user_input = await asyncio.to_thread(self.ui.get_user_input)
                # Handle commands
                if user_input.startswith('/'):
                    await self._handle_command(user_input)
                    if self._exit_requested:
                        break
                    continue
                # Skip empty input
                if not user_input.strip():
                    continue
                # Log user message
                self.storage.log("user_message", {"text": user_input})
                # Yield user message for agent
                yield {
                    "type": "user",
                    "message": {
                        "role": "user",
                        "content": user_input
                    }
                }
            except KeyboardInterrupt:
                self._exit_requested = True
                break
            except Exception as e:
                self.ui.print_error(f"Input error: {e}")
    async def _handle_command(self, command: str):
        """Handle special chat commands."""
        cmd = command.lower().strip()
        if cmd == '/exit' or cmd == '/quit':
            self._exit_requested = True
            self.ui.print_info("Exiting chat mode...")
        elif cmd == '/clear':
            self.ui.clear_screen()
        elif cmd == '/help':
            self.ui.show_commands()
        elif cmd == '/browser':
            # Show browser status
            if BrowserManager.is_browser_active():
                config = BrowserManager.get_current_config()
                self.ui.print_info(f"Browser active: {config}")
            else:
                self.ui.print_info("No browser instance active")
        else:
            self.ui.print_error(f"Unknown command: {command}")
    async def run(self):
        """Run the interactive chat loop with streaming responses."""
        # Show header
        self.ui.show_header(
            session_id=str(self.options.session_id or "chat"),
            log_path=self.storage.get_session_path() if hasattr(self.storage, 'get_session_path') else "N/A"
        )
        self.ui.show_commands()
        try:
            async with ClaudeSDKClient(options=self.options) as client:
                # Start streaming input mode
                await client.query(self.message_generator())
                # Process streaming responses
                turn = 0
                async for message in client.receive_messages():
                    turn += 1
                    if isinstance(message, AssistantMessage):
                        # Clear "thinking" line if we printed it
                        if self._current_streaming_text:
                            self.ui.console.print()  # New line after streaming
                        self._current_streaming_text = ""
                        # Process message content blocks
                        for block in message.content:
                            if isinstance(block, TextBlock):
                                # Stream text as it arrives
                                self.ui.print_agent_text(block.text)
                                self._current_streaming_text += block.text
                                # Log assistant message
                                self.storage.log("assistant_message", {
                                    "turn": turn,
                                    "text": block.text
                                })
                            elif isinstance(block, ToolUseBlock):
                                # Show tool usage
                                self.ui.print_tool_use(block.name)
                    elif isinstance(message, ResultMessage):
                        # Session completed (user exited or error)
                        if message.is_error:
                            self.ui.print_error(f"Agent error: {message.result}")
                        else:
                            self.ui.print_session_summary(
                                duration_s=message.duration_ms / 1000 if message.duration_ms else 0,
                                turns=message.num_turns,
                                cost_usd=message.total_cost_usd
                            )
                        # Log session end
                        self.storage.log("session_end", {
                            "duration_ms": message.duration_ms,
                            "cost_usd": message.total_cost_usd,
                            "turns": message.num_turns,
                            "success": not message.is_error
                        })
                        break
        except KeyboardInterrupt:
            self.ui.print_info("\nChat interrupted by user")
        except Exception as e:
            self.ui.print_error(f"Chat error: {e}")
            raise
        finally:
            # Cleanup browser on exit
            await BrowserManager.close_browser()
            self.ui.print_info("Browser closed")
--- a/crawl4ai/agent/terminal_ui.py
+++ b/crawl4ai/agent/terminal_ui.py
@@ -0,0 +1,115 @@
 """Terminal UI components using Rich for beautiful agent output."""
 from rich.console import Console
 from rich.markdown import Markdown
 from rich.syntax import Syntax
 from rich.panel import Panel
 from rich.live import Live
 from rich.spinner import Spinner
 from rich.text import Text
 from rich.prompt import Prompt
 from rich.rule import Rule
 class TerminalUI:
    """Rich-based terminal interface for the Crawl4AI agent."""
    def __init__(self):
        self.console = Console()
        self._current_text = ""
    def show_header(self, session_id: str, log_path: str):
        """Display agent session header."""
        self.console.print()
        self.console.print(Panel.fit(
            "[bold cyan]🕷️  Crawl4AI Agent - Chat Mode[/bold cyan]",
            border_style="cyan"
        ))
        self.console.print(f"[dim]📁 Session: {session_id}[/dim]")
        self.console.print(f"[dim]💾 Log: {log_path}[/dim]")
        self.console.print()
    def show_commands(self):
        """Display available commands."""
        self.console.print("\n[dim]Commands:[/dim]")
        self.console.print("  [cyan]/exit[/cyan] - Exit chat")
        self.console.print("  [cyan]/clear[/cyan] - Clear screen")
        self.console.print("  [cyan]/help[/cyan] - Show this help\n")
    def get_user_input(self) -> str:
        """Get user input with styled prompt."""
        return Prompt.ask("\n[bold green]You[/bold green]")
    def print_separator(self):
        """Print a visual separator."""
        self.console.print(Rule(style="dim"))
    def print_thinking(self):
        """Show thinking indicator."""
        self.console.print("\n[cyan]Agent:[/cyan] [dim]thinking...[/dim]", end="")
    def print_agent_text(self, text: str, stream: bool = False):
        """
        Print agent response text.
        Args:
            text: Text to print
            stream: If True, append to current streaming output
        """
        if stream:
            # For streaming, just print without newline
            self.console.print(f"\r[cyan]Agent:[/cyan] {text}", end="")
        else:
            # For complete messages
            self.console.print(f"\n[cyan]Agent:[/cyan] {text}")
    def print_markdown(self, markdown_text: str):
        """Render markdown content."""
        self.console.print()
        self.console.print(Markdown(markdown_text))
    def print_code(self, code: str, language: str = "python"):
        """Render code with syntax highlighting."""
        self.console.print()
        self.console.print(Syntax(code, language, theme="monokai", line_numbers=True))
    def print_error(self, error_msg: str):
        """Display error message."""
        self.console.print(f"\n[bold red]Error:[/bold red] {error_msg}")
    def print_success(self, msg: str):
        """Display success message."""
        self.console.print(f"\n[bold green]✓[/bold green] {msg}")
    def print_info(self, msg: str):
        """Display info message."""
        self.console.print(f"\n[bold blue]ℹ[/bold blue] {msg}")
    def clear_screen(self):
        """Clear the terminal screen."""
        self.console.clear()
    def print_session_summary(self, duration_s: float, turns: int, cost_usd: float = None):
        """Display session completion summary."""
        self.console.print()
        self.console.print(Panel(
            f"[green]✅ Completed[/green]\n"
            f"⏱ Duration: {duration_s:.2f}s\n"
            f"🔄 Turns: {turns}\n"
            + (f"💰 Cost: ${cost_usd:.4f}" if cost_usd else ""),
            border_style="green"
        ))
    def print_tool_use(self, tool_name: str):
        """Indicate tool usage."""
        self.console.print(f"\n[dim]🔧 Using tool: {tool_name}[/dim]")
    def with_spinner(self, text: str = "Processing..."):
        """
        Context manager for showing a spinner.
        Usage:
            with ui.with_spinner("Crawling page..."):
                # do work
        """
        return self.console.status(f"[cyan]{text}[/cyan]", spinner="dots")
--- a/crawl4ai/agent/test_chat.py
+++ b/crawl4ai/agent/test_chat.py
@@ -0,0 +1,114 @@
 #!/usr/bin/env python
 """Test script to verify chat mode setup (non-interactive)."""
 import sys
 import asyncio
 from pathlib import Path
 # Add parent to path for imports
 sys.path.insert(0, str(Path(__file__).parent.parent.parent))
 from crawl4ai.agent.browser_manager import BrowserManager
 from crawl4ai.agent.terminal_ui import TerminalUI
 from crawl4ai.agent.chat_mode import ChatMode
 from crawl4ai.agent.c4ai_tools import CRAWL_TOOLS
 from crawl4ai.agent.c4ai_prompts import SYSTEM_PROMPT
 from claude_agent_sdk import ClaudeAgentOptions, create_sdk_mcp_server
 class MockStorage:
    """Mock storage for testing."""
    def log(self, event_type: str, data: dict):
        print(f"[LOG] {event_type}: {data}")
    def get_session_path(self):
        return "/tmp/test_session.jsonl"
 async def test_components():
    """Test individual components."""
    print("="*60)
    print("CHAT MODE COMPONENT TESTS")
    print("="*60)
    # Test 1: BrowserManager
    print("\n[TEST 1] BrowserManager singleton")
    try:
        browser1 = await BrowserManager.get_browser()
        browser2 = await BrowserManager.get_browser()
        assert browser1 is browser2, "Browser instances should be same (singleton)"
        print("✓ BrowserManager singleton works")
        await BrowserManager.close_browser()
    except Exception as e:
        print(f"✗ BrowserManager failed: {e}")
        return False
    # Test 2: TerminalUI
    print("\n[TEST 2] TerminalUI rendering")
    try:
        ui = TerminalUI()
        ui.show_header("test-123", "/tmp/test.log")
        ui.print_agent_text("Hello from agent")
        ui.print_markdown("# Test\nThis is **bold**")
        ui.print_success("Test success message")
        print("✓ TerminalUI renders correctly")
    except Exception as e:
        print(f"✗ TerminalUI failed: {e}")
        return False
    # Test 3: MCP Server Setup
    print("\n[TEST 3] MCP Server with tools")
    try:
        crawler_server = create_sdk_mcp_server(
            name="crawl4ai",
            version="1.0.0",
            tools=CRAWL_TOOLS
        )
        print(f"✓ MCP server created with {len(CRAWL_TOOLS)} tools")
    except Exception as e:
        print(f"✗ MCP Server failed: {e}")
        return False
    # Test 4: ChatMode instantiation
    print("\n[TEST 4] ChatMode instantiation")
    try:
        options = ClaudeAgentOptions(
            mcp_servers={"crawler": crawler_server},
            allowed_tools=[
                "mcp__crawler__quick_crawl",
                "mcp__crawler__start_session",
                "mcp__crawler__navigate",
                "mcp__crawler__extract_data",
                "mcp__crawler__execute_js",
                "mcp__crawler__screenshot",
                "mcp__crawler__close_session",
            ],
            system_prompt=SYSTEM_PROMPT,
            permission_mode="acceptEdits"
        )
        ui = TerminalUI()
        storage = MockStorage()
        chat = ChatMode(options, ui, storage)
        print("✓ ChatMode instance created successfully")
    except Exception as e:
        print(f"✗ ChatMode failed: {e}")
        import traceback
        traceback.print_exc()
        return False
    print("\n" + "="*60)
    print("ALL COMPONENT TESTS PASSED ✓")
    print("="*60)
    print("\nTo test interactive chat mode, run:")
    print("  python -m crawl4ai.agent.agent_crawl --chat")
    return True
 if __name__ == "__main__":
    success = asyncio.run(test_components())
    sys.exit(0 if success else 1)
--- a/crawl4ai/agent/test_scenarios.py
+++ b/crawl4ai/agent/test_scenarios.py
@@ -0,0 +1,524 @@
 #!/usr/bin/env python
 """
 Automated multi-turn chat scenario tests for Crawl4AI Agent.
 Tests agent's ability to handle complex conversations, maintain state,
 plan and execute tasks without human interaction.
 """
 import asyncio
 import json
 import time
 from pathlib import Path
 from typing import List, Dict, Any, Optional
 from dataclasses import dataclass
 from enum import Enum
 from claude_agent_sdk import ClaudeSDKClient, ClaudeAgentOptions, create_sdk_mcp_server
 from claude_agent_sdk import AssistantMessage, TextBlock, ResultMessage, ToolUseBlock
 from .c4ai_tools import CRAWL_TOOLS
 from .c4ai_prompts import SYSTEM_PROMPT
 from .browser_manager import BrowserManager
 class TurnResult(Enum):
    """Result of a single conversation turn."""
    PASS = "PASS"
    FAIL = "FAIL"
    TIMEOUT = "TIMEOUT"
    ERROR = "ERROR"
@dataclass
 class TurnExpectation:
    """Expectations for a single conversation turn."""
    user_message: str
    expect_tools: Optional[List[str]] = None  # Tools that should be called
    expect_keywords: Optional[List[str]] = None  # Keywords in response
    expect_files_created: Optional[List[str]] = None  # File patterns created
    expect_success: bool = True  # Should complete without error
    expect_min_turns: int = 1  # Minimum agent turns to complete
    timeout_seconds: int = 60
@dataclass
 class Scenario:
    """A complete multi-turn conversation scenario."""
    name: str
    category: str  # "simple", "medium", "complex"
    description: str
    turns: List[TurnExpectation]
    cleanup_files: Optional[List[str]] = None  # Files to cleanup after test
 # =============================================================================
 # TEST SCENARIOS - Categorized from Simple to Complex
 # =============================================================================
 SIMPLE_SCENARIOS = [
    Scenario(
        name="Single quick crawl",
        category="simple",
        description="Basic one-shot crawl with markdown extraction",
        turns=[
            TurnExpectation(
                user_message="Use quick_crawl to get the title from example.com",
                expect_tools=["mcp__crawler__quick_crawl"],
                expect_keywords=["Example Domain", "title"],
                timeout_seconds=30
            )
        ]
    ),
    Scenario(
        name="Session lifecycle",
        category="simple",
        description="Start session, navigate, close - basic session management",
        turns=[
            TurnExpectation(
                user_message="Start a session named 'simple_test'",
                expect_tools=["mcp__crawler__start_session"],
                expect_keywords=["session", "started"],
                timeout_seconds=20
            ),
            TurnExpectation(
                user_message="Navigate to example.com",
                expect_tools=["mcp__crawler__navigate"],
                expect_keywords=["navigated", "example.com"],
                timeout_seconds=25
            ),
            TurnExpectation(
                user_message="Close the session",
                expect_tools=["mcp__crawler__close_session"],
                expect_keywords=["closed"],
                timeout_seconds=15
            )
        ]
    ),
 ]
 MEDIUM_SCENARIOS = [
    Scenario(
        name="Multi-page crawl with file output",
        category="medium",
        description="Crawl multiple pages and save results to file",
        turns=[
            TurnExpectation(
                user_message="Crawl example.com and example.org, extract titles from both",
                expect_tools=["mcp__crawler__quick_crawl"],
                expect_min_turns=2,  # Should make 2 separate crawls
                timeout_seconds=45
            ),
            TurnExpectation(
                user_message="Save the results to a JSON file called crawl_results.json",
                expect_tools=["Write"],
                expect_files_created=["crawl_results.json"],
                timeout_seconds=20
            )
        ],
        cleanup_files=["crawl_results.json"]
    ),
    Scenario(
        name="Session-based data extraction",
        category="medium",
        description="Use session to navigate and extract data in steps",
        turns=[
            TurnExpectation(
                user_message="Start session 'extract_test', navigate to example.com, and extract the markdown",
                expect_tools=["mcp__crawler__start_session", "mcp__crawler__navigate", "mcp__crawler__extract_data"],
                expect_keywords=["Example Domain"],
                timeout_seconds=50
            ),
            TurnExpectation(
                user_message="Now save that markdown to example_content.md",
                expect_tools=["Write"],
                expect_files_created=["example_content.md"],
                timeout_seconds=20
            ),
            TurnExpectation(
                user_message="Close the session",
                expect_tools=["mcp__crawler__close_session"],
                timeout_seconds=15
            )
        ],
        cleanup_files=["example_content.md"]
    ),
    Scenario(
        name="Context retention across turns",
        category="medium",
        description="Agent should remember previous context",
        turns=[
            TurnExpectation(
                user_message="Crawl example.com and tell me the title",
                expect_tools=["mcp__crawler__quick_crawl"],
                expect_keywords=["Example Domain"],
                timeout_seconds=30
            ),
            TurnExpectation(
                user_message="What was the URL I just asked you to crawl?",
                expect_keywords=["example.com"],
                expect_tools=[],  # Should answer from memory, no tools needed
                timeout_seconds=15
            )
        ]
    ),
 ]
 COMPLEX_SCENARIOS = [
    Scenario(
        name="Multi-step task with planning",
        category="complex",
        description="Complex task requiring agent to plan, execute, and verify",
        turns=[
            TurnExpectation(
                user_message="Crawl example.com and example.org, compare their content, and create a markdown report with: 1) titles of both, 2) word count comparison, 3) save to comparison_report.md",
                expect_tools=["mcp__crawler__quick_crawl", "Write"],
                expect_files_created=["comparison_report.md"],
                expect_min_turns=3,  # Plan, crawl both, write report
                timeout_seconds=90
            ),
            TurnExpectation(
                user_message="Read back the report you just created",
                expect_tools=["Read"],
                expect_keywords=["Example Domain"],
                timeout_seconds=20
            )
        ],
        cleanup_files=["comparison_report.md"]
    ),
    Scenario(
        name="Session with state manipulation",
        category="complex",
        description="Complex session workflow with multiple operations",
        turns=[
            TurnExpectation(
                user_message="Start session 'complex_session' and navigate to example.com",
                expect_tools=["mcp__crawler__start_session", "mcp__crawler__navigate"],
                timeout_seconds=30
            ),
            TurnExpectation(
                user_message="Extract the page content and count how many times the word 'example' appears (case insensitive)",
                expect_tools=["mcp__crawler__extract_data"],
                expect_keywords=["example"],
                timeout_seconds=30
            ),
            TurnExpectation(
                user_message="Take a screenshot of the current page",
                expect_tools=["mcp__crawler__screenshot"],
                expect_keywords=["screenshot"],
                timeout_seconds=25
            ),
            TurnExpectation(
                user_message="Close the session",
                expect_tools=["mcp__crawler__close_session"],
                timeout_seconds=15
            )
        ]
    ),
    Scenario(
        name="Error recovery and continuation",
        category="complex",
        description="Agent should handle errors gracefully and continue",
        turns=[
            TurnExpectation(
                user_message="Crawl https://this-site-definitely-does-not-exist-12345.com",
                expect_success=False,  # Should fail gracefully
                expect_keywords=["error", "fail"],
                timeout_seconds=30
            ),
            TurnExpectation(
                user_message="That's okay, crawl example.com instead",
                expect_tools=["mcp__crawler__quick_crawl"],
                expect_keywords=["Example Domain"],
                timeout_seconds=30
            )
        ]
    ),
 ]
 # Combine all scenarios
 ALL_SCENARIOS = SIMPLE_SCENARIOS + MEDIUM_SCENARIOS + COMPLEX_SCENARIOS
 # =============================================================================
 # TEST RUNNER
 # =============================================================================
 class ScenarioRunner:
    """Runs automated chat scenarios without human interaction."""
    def __init__(self, working_dir: Path):
        self.working_dir = working_dir
        self.results = []
    async def run_scenario(self, scenario: Scenario) -> Dict[str, Any]:
        """Run a single scenario and return results."""
        print(f"\n{'='*70}")
        print(f"[{scenario.category.upper()}] {scenario.name}")
        print(f"{'='*70}")
        print(f"Description: {scenario.description}\n")
        start_time = time.time()
        turn_results = []
        try:
            # Setup agent options
            crawler_server = create_sdk_mcp_server(
                name="crawl4ai",
                version="1.0.0",
                tools=CRAWL_TOOLS
            )
            options = ClaudeAgentOptions(
                mcp_servers={"crawler": crawler_server},
                allowed_tools=[
                    "mcp__crawler__quick_crawl",
                    "mcp__crawler__start_session",
                    "mcp__crawler__navigate",
                    "mcp__crawler__extract_data",
                    "mcp__crawler__execute_js",
                    "mcp__crawler__screenshot",
                    "mcp__crawler__close_session",
                    "Read", "Write", "Edit", "Glob", "Grep", "Bash"
                ],
                system_prompt=SYSTEM_PROMPT,
                permission_mode="acceptEdits",
                cwd=str(self.working_dir)
            )
            # Run conversation
            async with ClaudeSDKClient(options=options) as client:
                for turn_idx, expectation in enumerate(scenario.turns, 1):
                    print(f"\nTurn {turn_idx}: {expectation.user_message}")
                    turn_result = await self._run_turn(
                        client, expectation, turn_idx
                    )
                    turn_results.append(turn_result)
                    if turn_result["status"] != TurnResult.PASS:
                        print(f"  ✗ FAILED: {turn_result['reason']}")
                        break
                    else:
                        print(f"  ✓ PASSED")
            # Cleanup
            if scenario.cleanup_files:
                self._cleanup_files(scenario.cleanup_files)
            # Overall result
            all_passed = all(r["status"] == TurnResult.PASS for r in turn_results)
            duration = time.time() - start_time
            result = {
                "scenario": scenario.name,
                "category": scenario.category,
                "status": "PASS" if all_passed else "FAIL",
                "duration_seconds": duration,
                "turns": turn_results
            }
            return result
        except Exception as e:
            print(f"\n✗ SCENARIO ERROR: {e}")
            return {
                "scenario": scenario.name,
                "category": scenario.category,
                "status": "ERROR",
                "error": str(e),
                "duration_seconds": time.time() - start_time,
                "turns": turn_results
            }
        finally:
            # Ensure browser cleanup
            await BrowserManager.close_browser()
    async def _run_turn(
        self,
        client: ClaudeSDKClient,
        expectation: TurnExpectation,
        turn_number: int
    ) -> Dict[str, Any]:
        """Execute a single conversation turn and validate."""
        tools_used = []
        response_text = ""
        agent_turns = 0
        try:
            # Send user message
            await client.query(expectation.user_message)
            # Collect response
            start_time = time.time()
            async for message in client.receive_messages():
                if time.time() - start_time > expectation.timeout_seconds:
                    return {
                        "turn": turn_number,
                        "status": TurnResult.TIMEOUT,
                        "reason": f"Exceeded {expectation.timeout_seconds}s timeout"
                    }
                if isinstance(message, AssistantMessage):
                    agent_turns += 1
                    for block in message.content:
                        if isinstance(block, TextBlock):
                            response_text += block.text + " "
                        elif isinstance(block, ToolUseBlock):
                            tools_used.append(block.name)
                elif isinstance(message, ResultMessage):
                    # Check if error when expecting success
                    if expectation.expect_success and message.is_error:
                        return {
                            "turn": turn_number,
                            "status": TurnResult.FAIL,
                            "reason": f"Agent returned error: {message.result}"
                        }
                    break
            # Validate expectations
            validation = self._validate_turn(
                expectation, tools_used, response_text, agent_turns
            )
            return {
                "turn": turn_number,
                "status": validation["status"],
                "reason": validation.get("reason", "All checks passed"),
                "tools_used": tools_used,
                "agent_turns": agent_turns
            }
        except Exception as e:
            return {
                "turn": turn_number,
                "status": TurnResult.ERROR,
                "reason": f"Exception: {str(e)}"
            }
    def _validate_turn(
        self,
        expectation: TurnExpectation,
        tools_used: List[str],
        response_text: str,
        agent_turns: int
    ) -> Dict[str, Any]:
        """Validate turn results against expectations."""
        # Check expected tools
        if expectation.expect_tools:
            for tool in expectation.expect_tools:
                if tool not in tools_used:
                    return {
                        "status": TurnResult.FAIL,
                        "reason": f"Expected tool '{tool}' was not used"
                    }
        # Check keywords
        if expectation.expect_keywords:
            response_lower = response_text.lower()
            for keyword in expectation.expect_keywords:
                if keyword.lower() not in response_lower:
                    return {
                        "status": TurnResult.FAIL,
                        "reason": f"Expected keyword '{keyword}' not found in response"
                    }
        # Check files created
        if expectation.expect_files_created:
            for pattern in expectation.expect_files_created:
                matches = list(self.working_dir.glob(pattern))
                if not matches:
                    return {
                        "status": TurnResult.FAIL,
                        "reason": f"Expected file matching '{pattern}' was not created"
                    }
        # Check minimum turns
        if agent_turns < expectation.expect_min_turns:
            return {
                "status": TurnResult.FAIL,
                "reason": f"Expected at least {expectation.expect_min_turns} agent turns, got {agent_turns}"
            }
        return {"status": TurnResult.PASS}
    def _cleanup_files(self, patterns: List[str]):
        """Remove files created during test."""
        for pattern in patterns:
            for file_path in self.working_dir.glob(pattern):
                try:
                    file_path.unlink()
                except Exception as e:
                    print(f"  Warning: Could not delete {file_path}: {e}")
 async def run_all_scenarios(working_dir: Optional[Path] = None):
    """Run all test scenarios and report results."""
    if working_dir is None:
        working_dir = Path.cwd() / "test_agent_output"
        working_dir.mkdir(exist_ok=True)
    runner = ScenarioRunner(working_dir)
    print("\n" + "="*70)
    print("CRAWL4AI AGENT SCENARIO TESTS")
    print("="*70)
    print(f"Working directory: {working_dir}")
    print(f"Total scenarios: {len(ALL_SCENARIOS)}")
    print(f"  Simple: {len(SIMPLE_SCENARIOS)}")
    print(f"  Medium: {len(MEDIUM_SCENARIOS)}")
    print(f"  Complex: {len(COMPLEX_SCENARIOS)}")
    results = []
    for scenario in ALL_SCENARIOS:
        result = await runner.run_scenario(scenario)
        results.append(result)
    # Summary
    print("\n" + "="*70)
    print("TEST SUMMARY")
    print("="*70)
    by_category = {"simple": [], "medium": [], "complex": []}
    for result in results:
        by_category[result["category"]].append(result)
    for category in ["simple", "medium", "complex"]:
        cat_results = by_category[category]
        passed = sum(1 for r in cat_results if r["status"] == "PASS")
        total = len(cat_results)
        print(f"\n{category.upper()}: {passed}/{total} passed")
        for r in cat_results:
            status_icon = "✓" if r["status"] == "PASS" else "✗"
            print(f"  {status_icon} {r['scenario']} ({r['duration_seconds']:.1f}s)")
    total_passed = sum(1 for r in results if r["status"] == "PASS")
    total = len(results)
    print(f"\nOVERALL: {total_passed}/{total} scenarios passed ({total_passed/total*100:.1f}%)")
    # Save detailed results
    results_file = working_dir / "test_results.json"
    with open(results_file, "w") as f:
        json.dump(results, f, indent=2)
    print(f"\nDetailed results saved to: {results_file}")
    return total_passed == total
 if __name__ == "__main__":
    import sys
    success = asyncio.run(run_all_scenarios())
    sys.exit(0 if success else 1)
--- a/crawl4ai/agent/test_tools.py
+++ b/crawl4ai/agent/test_tools.py
@@ -0,0 +1,140 @@
 #!/usr/bin/env python
 """Test script for Crawl4AI tools - tests tools directly without the agent."""
 import asyncio
 import json
 from crawl4ai import AsyncWebCrawler, BrowserConfig, CrawlerRunConfig, CacheMode
 async def test_quick_crawl():
    """Test quick_crawl tool logic directly."""
    print("\n" + "="*60)
    print("TEST 1: Quick Crawl - Markdown Format")
    print("="*60)
    crawler_config = BrowserConfig(headless=True, verbose=False)
    run_config = CrawlerRunConfig(cache_mode=CacheMode.BYPASS)
    async with AsyncWebCrawler(config=crawler_config) as crawler:
        result = await crawler.arun(url="https://example.com", config=run_config)
        print(f"Success: {result.success}")
        print(f"URL: {result.url}")
        # Handle markdown - can be string or MarkdownGenerationResult object
        if isinstance(result.markdown, str):
            markdown_content = result.markdown
        elif hasattr(result.markdown, 'raw_markdown'):
            markdown_content = result.markdown.raw_markdown
        else:
            markdown_content = str(result.markdown)
        print(f"Markdown type: {type(result.markdown)}")
        print(f"Markdown length: {len(markdown_content)}")
        print(f"Markdown preview:\n{markdown_content[:300]}")
        return result.success
 async def test_session_workflow():
    """Test session-based workflow."""
    print("\n" + "="*60)
    print("TEST 2: Session-Based Workflow")
    print("="*60)
    crawler_config = BrowserConfig(headless=True, verbose=False)
    # Start session
    crawler = AsyncWebCrawler(config=crawler_config)
    await crawler.__aenter__()
    print("✓ Session started")
    try:
        # Navigate to URL
        run_config = CrawlerRunConfig(cache_mode=CacheMode.BYPASS)
        result = await crawler.arun(url="https://example.com", config=run_config)
        print(f"✓ Navigated to {result.url}, success: {result.success}")
        # Extract data
        if isinstance(result.markdown, str):
            markdown_content = result.markdown
        elif hasattr(result.markdown, 'raw_markdown'):
            markdown_content = result.markdown.raw_markdown
        else:
            markdown_content = str(result.markdown)
        print(f"✓ Extracted {len(markdown_content)} chars of markdown")
        print(f"  Preview: {markdown_content[:200]}")
        # Screenshot test - need to re-fetch with screenshot enabled
        screenshot_config = CrawlerRunConfig(cache_mode=CacheMode.BYPASS, screenshot=True)
        result2 = await crawler.arun(url=result.url, config=screenshot_config)
        print(f"✓ Screenshot captured: {result2.screenshot is not None}")
        return True
    finally:
        # Close session
        await crawler.__aexit__(None, None, None)
        print("✓ Session closed")
 async def test_html_format():
    """Test HTML output format."""
    print("\n" + "="*60)
    print("TEST 3: Quick Crawl - HTML Format")
    print("="*60)
    crawler_config = BrowserConfig(headless=True, verbose=False)
    run_config = CrawlerRunConfig(cache_mode=CacheMode.BYPASS)
    async with AsyncWebCrawler(config=crawler_config) as crawler:
        result = await crawler.arun(url="https://example.com", config=run_config)
        print(f"Success: {result.success}")
        print(f"HTML length: {len(result.html)}")
        print(f"HTML preview:\n{result.html[:300]}")
        return result.success
 async def main():
    """Run all tests."""
    print("\n" + "="*70)
    print(" CRAWL4AI TOOLS TEST SUITE")
    print("="*70)
    tests = [
        ("Quick Crawl (Markdown)", test_quick_crawl),
        ("Session Workflow", test_session_workflow),
        ("Quick Crawl (HTML)", test_html_format),
    ]
    results = []
    for name, test_func in tests:
        try:
            result = await test_func()
            results.append((name, result, None))
        except Exception as e:
            results.append((name, False, str(e)))
    # Summary
    print("\n" + "="*70)
    print(" TEST SUMMARY")
    print("="*70)
    for name, success, error in results:
        status = "✓ PASS" if success else "✗ FAIL"
        print(f"{status} - {name}")
        if error:
            print(f"     Error: {error}")
    total = len(results)
    passed = sum(1 for _, success, _ in results if success)
    print(f"\nTotal: {total} | Passed: {passed} | Failed: {total - passed}")
    return all(success for _, success, _ in results)
 if __name__ == "__main__":
    success = asyncio.run(main())
    exit(0 if success else 1)