From a56dd07559aa7a342e7bdaabfb30300167a4a61f Mon Sep 17 00:00:00 2001 From: unclecode Date: Sun, 1 Feb 2026 06:53:53 +0000 Subject: [PATCH] =?UTF-8?q?Merge=20PRs=20#1667,=20#1296,=20#1364=20?= =?UTF-8?q?=E2=80=94=20CLI=20deep-crawl,=20env=20var,=20script=20tags?= MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit - PR #1667: Fix deep-crawl CLI outputting only the first page - PR #1296: Fix VersionManager ignoring CRAWL4_AI_BASE_DIRECTORY - PR #1364: Fix script tag removal losing adjacent text - Fix: restore .crawl4ai subfolder in VersionManager path - Close #1150 (already fixed on develop) - Update CONTRIBUTORS.md and PR-TODOLIST.md --- .context/PR-TODOLIST.md | 14 +++++++++----- CONTRIBUTORS.md | 3 +++ 2 files changed, 12 insertions(+), 5 deletions(-) diff --git a/.context/PR-TODOLIST.md b/.context/PR-TODOLIST.md index 690f062e..b308f243 100644 --- a/.context/PR-TODOLIST.md +++ b/.context/PR-TODOLIST.md @@ -1,6 +1,6 @@ # PR Review Todolist -> Last updated: 2026-02-01 | Total open PRs: 85 +> Last updated: 2026-02-01 | Total open PRs: 81 --- @@ -14,22 +14,22 @@ | ~~#1719~~ | ~~YuriNachos~~ | ~~Fix GoogleSearchCrawler `script.js` missing from package distribution. (#1711)~~ | **merged** | | ~~#1717~~ | ~~YuriNachos~~ | ~~Fix local sentence-transformers embeddings blocked by OpenAI fallback. (#1658)~~ | **merged** | | ~~#1714~~ | ~~YuriNachos~~ | ~~Fix: Replace `tf-playwright-stealth` with `playwright-stealth` dependency. (#1553)~~ | **merged** | -| #1667 | christian-oudard | Fix `crwl --deep-crawl` only outputting first page. Real CLI bug with tests. | pending | +| ~~#1667~~ | ~~christian-oudard~~ | ~~Fix `crwl --deep-crawl` only outputting first page. Real CLI bug with tests.~~ | **merged** | | #1640 | Martichou | Fix memory leak — unused browser contexts never cleaned up under continuous load. (#943) | pending | | #1622 | zhaoyun006 | Fix redirect target verification in AsyncUrlSeeder and enhance tests. | pending | | #1592 | jzmiller1 | Fix CDP page leaks and race conditions in concurrent crawling. (#1563) | pending | | #1572 | yuexuan-chen | Fix CDP setting with managed browser. | pending | | #1450 | prlz77 | Fix LLM extraction fails when content is in alternative response fields. | pending | -| #1364 | nnxiong | Fix `