Files
crawl4ai/crawl4ai
unclecode 3d78001c30 Add smart TTL cache for sitemap URL seeder
- Add cache_ttl_hours and validate_sitemap_lastmod params to SeedingConfig
- New JSON cache format with metadata (version, created_at, lastmod, url_count)
- Cache validation by TTL expiry and sitemap lastmod comparison
- Auto-migration from old .jsonl to new .json format
- Fixes bug where incomplete cache was used indefinitely
2025-12-30 01:59:09 +00:00
..
2025-02-19 14:13:17 +08:00
2025-08-04 19:02:01 +08:00
2025-12-11 11:04:52 +01:00
2025-12-11 11:04:52 +01:00
2025-12-11 11:04:52 +01:00
2025-12-11 11:04:52 +01:00
2025-12-21 04:48:03 +00:00
2025-10-22 20:41:06 +08:00
2025-10-22 20:41:06 +08:00
2025-01-13 19:19:58 +08:00
2025-12-21 04:48:03 +00:00
2025-07-08 11:46:13 +02:00
2025-12-11 11:04:52 +01:00
2025-12-11 11:04:52 +01:00
2025-01-13 19:19:58 +08:00
2025-01-13 19:19:58 +08:00
2025-12-21 04:48:03 +00:00
2025-12-11 11:04:52 +01:00
2025-08-04 19:02:01 +08:00