From 8dea3f470f5a496a30dada1eab1c3b23ee3560ca Mon Sep 17 00:00:00 2001 From: UncleCode Date: Fri, 22 Nov 2024 18:50:12 +0800 Subject: [PATCH] chore: update README to include new features and improvements for version 0.3.74 --- README.md | 25 +++++++++++++------------ 1 file changed, 13 insertions(+), 12 deletions(-) diff --git a/README.md b/README.md index b0f9fff9..fa88a507 100644 --- a/README.md +++ b/README.md @@ -13,18 +13,19 @@ Crawl4AI simplifies asynchronous web crawling and data extraction, making it acc ## New in 0.3.74 ✨ -🚀 **Blazing Fast Scraping**: Significantly improved scraping speed. -📥 **Download Manager**: Integrated file crawling, downloading, and tracking within `CrawlResult`. -📝 **Markdown Strategy**: Flexible system for custom markdown generation and formats. -🔗 **LLM-Friendly Citations**: Auto-converts links to numbered citations with reference lists. -🔎 **Markdown Filter**: BM25-based content extraction for cleaner, relevant markdown. -🖼️ **Image Extraction**: Supports `srcset`, `picture`, and responsive image formats. -🗂️ **Local/Raw HTML**: Crawl `file://` paths and raw HTML (`raw:`) directly. -🤖 **Browser Control**: Custom browser setups with stealth integration to bypass bots. -☁️ **API & Cache Boost**: CORS, static serving, and enhanced filesystem-based caching. -🐳 **API Gateway**: Run as an API service with secure token authentication. -🛠️ **Database Upgrades**: Optimized for larger content sets with faster caching. -🐛 **Bug Fixes**: Resolved browser context issues, memory leaks, and improved error handling. +- 🚀 **Blazing Fast Scraping**: Significantly improved scraping speed. +- 📥 **Download Manager**: Integrated file crawling, downloading, and tracking within `CrawlResult`. +- 📝 **Markdown Strategy**: Flexible system for custom markdown generation and formats. +- 🔗 **LLM-Friendly Citations**: Auto-converts links to numbered citations with reference lists. +- 🔎 **Markdown Filter**: BM25-based content extraction for cleaner, relevant markdown. +- 🖼️ **Image Extraction**: Supports `srcset`, `picture`, and responsive image formats. +- 🗂️ **Local/Raw HTML**: Crawl `file://` paths and raw HTML (`raw:`) directly. +- 🤖 **Browser Control**: Custom browser setups with stealth integration to bypass bots. +- ☁️ **API & Cache Boost**: CORS, static serving, and enhanced filesystem-based caching. +- 🐳 **API Gateway**: Run as an API service with secure token authentication. +- 🛠️ **Database Upgrades**: Optimized for larger content sets with faster caching. +- 🐛 **Bug Fixes**: Resolved browser context issues, memory leaks, and improved error handling. + ## Try it Now!