- Issue Resolved: Every <pre> tag's HTML content is replaced with its inner text to address situations like syntax highlighters, where each character might be in a <span>. This avoids issues where the minimum word threshold might ignore them.
This commit is contained in:
3
.gitignore
vendored
3
.gitignore
vendored
@@ -164,4 +164,5 @@ cython_debug/
|
||||
Crawl4AI.egg-info/
|
||||
Crawl4AI.egg-info/*
|
||||
crawler_data.db
|
||||
.vscode/
|
||||
.vscode/
|
||||
test_pad.py
|
||||
Reference in New Issue
Block a user