This commit introduces significant enhancements to the Crawl4AI ecosystem: Chrome Extension - Script Builder (Alpha): - Add recording functionality to capture user interactions (clicks, typing, scrolling) - Implement smart event grouping for cleaner script generation - Support export to both JavaScript and C4A script formats - Add timeline view for visualizing and editing recorded actions - Include wait commands (time-based and element-based) - Add saved flows functionality for reusing automation scripts - Update UI with consistent dark terminal theme (Dank Mono font, green/pink accents) - Release new extension versions: v1.1.0, v1.2.0, v1.2.1 LLM Context Builder Improvements: - Reorganize context files from llmtxt/ to llm.txt/ with better structure - Separate diagram templates from text content (diagrams/ and txt/ subdirectories) - Add comprehensive context files for all major Crawl4AI components - Improve file naming convention for better discoverability Documentation Updates: - Update apps index page to match main documentation theme - Standardize color scheme: "Available" tags use primary color (#50ffff) - Change "Coming Soon" tags to dark gray for better visual hierarchy - Add interactive two-column layout for extension landing page - Include code examples for both Schema Builder and Script Builder features Technical Improvements: - Enhance event capture mechanism with better element selection - Add support for contenteditable elements and complex form interactions - Implement proper scroll event handling for both window and element scrolling - Add meta key support for keyboard shortcuts - Improve selector generation for more reliable element targeting The Script Builder is released as Alpha, acknowledging potential bugs while providing early access to this powerful automation recording feature.
Crawl4AI Chrome Extension
Visual schema and script builder for Crawl4AI - Build extraction schemas by clicking on webpage elements!
🚀 Features
- Visual Schema Builder: Click on elements to build extraction schemas
- Smart Element Selection: Container and field selection with visual feedback
- Code Generation: Generates complete Python code with LLM integration
- Beautiful Dark UI: Consistent with Crawl4AI's design language
- One-Click Download: Get your generated code instantly
📦 Installation
Method 1: Load Unpacked Extension (Recommended for Development)
- Open Chrome and navigate to
chrome://extensions/ - Enable "Developer mode" in the top right corner
- Click "Load unpacked"
- Select the
crawl4ai-assistantfolder - The extension icon (🚀🤖) will appear in your toolbar
Method 2: Generate Icons First
If you want proper icons:
- Open
icons/generate_icons.htmlin your browser - Right-click each canvas and save as:
icon-16.pngicon-48.pngicon-128.png
- Then follow Method 1 above
🎯 How to Use
Building a Schema
- Navigate to any website you want to extract data from
- Click the Crawl4AI extension icon in your toolbar
- Click "Schema Builder" to start the capture mode
- Select a container element:
- Hover over elements (they'll highlight in blue)
- Click on a repeating container (e.g., product card, article block)
- Select fields within the container:
- Elements will now highlight in green
- Click on each piece of data you want to extract
- Name each field (e.g., "title", "price", "description")
- Generate the code:
- Click "Generate Code" in the extension popup
- A Python file will automatically download
Running the Generated Code
The downloaded Python file contains:
# 1. The HTML snippet of your selected container
HTML_SNIPPET = """..."""
# 2. The extraction query based on your selections
EXTRACTION_QUERY = """..."""
# 3. Functions to generate and test the schema
async def generate_schema():
# Generates the extraction schema using LLM
async def test_extraction():
# Tests the schema on the actual website
To use it:
- Install Crawl4AI:
pip install crawl4ai - Run the script:
python crawl4ai_schema_*.py - The script will generate a
generated_schema.jsonfile - Use this schema in your Crawl4AI projects!
🎨 Visual Feedback
- Blue dashed outline: Container selection mode
- Green dashed outline: Field selection mode
- Solid blue outline: Selected container
- Solid green outline: Selected fields
- Floating toolbar: Shows current mode and selection status
⌨️ Keyboard Shortcuts
- ESC: Cancel current capture session
🔧 Technical Details
- Built with Manifest V3 for security and performance
- Pure client-side - no data sent to external servers
- Generates code that uses Crawl4AI's LLM integration
- Smart selector generation prioritizes stable attributes
🐛 Troubleshooting
Extension doesn't load
- Make sure you're in Developer Mode
- Check the console for any errors
- Ensure all files are in the correct directories
Can't select elements
- Some websites may block extensions
- Try refreshing the page
- Make sure you clicked "Schema Builder" first
Generated code doesn't work
- Ensure you have Crawl4AI installed
- Check that you have an LLM API key configured
- Make sure the website structure hasn't changed
🤝 Contributing
This extension is part of the Crawl4AI project. Contributions are welcome!
- Report issues: GitHub Issues
- Join discussion: Discord
📄 License
Same as Crawl4AI - see main project for details.