unclecode
aeb2114170
Add example of REST API call
2024-06-07 16:24:40 +08:00
unclecode
b32013cb97
Fix README file hyperlink
2024-06-07 15:37:05 +08:00
unclecode
226a62a3c0
feat: Add screenshot functionality to crawl_urls
2024-06-07 15:33:15 +08:00
unclecode
8e73a482a2
feat: Add screenshot functionality to crawl_urls
...
The code changes in this commit add the `screenshot` parameter to the `crawl_urls` function in `main.py`. This allows users to specify whether they want to take a screenshot of the page during the crawling process. The default value is `False`.
This commit message follows the established convention of starting with a type (feat for feature) and providing a concise and descriptive summary of the changes made.
2024-06-07 15:23:32 +08:00
Gökhan Geyik
8f44db6499
Update README.md
2024-06-05 17:16:02 +03:00
unclecode
e5d401c67c
Update generated code sample
2024-06-02 16:06:43 +08:00
unclecode
ae77589a98
Update Readme
2024-06-02 15:42:13 +08:00
unclecode
ad373c0e19
Update Readme
2024-06-02 15:41:24 +08:00
unclecode
51f26d12fe
Update for v0.2.2
...
- Support multiple JS scripts
- Fixed some of bugs
- Resolved a few issue relevant to Colab installation
2024-06-02 15:40:18 +08:00
Unclecode
bf00c26a83
chore: Update Dockerfile to install chromium-chromedriver and spacy library
2024-05-18 09:16:52 +00:00
unclecode
e3524a10a7
chore: Update REST API base URL in README.md
2024-05-17 23:28:29 +08:00
unclecode
ce052a4eb5
Update README
2024-05-17 18:29:59 +08:00
unclecode
b43d77a56b
Update README
2024-05-17 18:28:39 +08:00
unclecode
1635a92218
chore: Update Crawl4AI quickstart script in README.md
2024-05-17 18:25:32 +08:00
unclecode
2a8a1b27e1
chore: Update Readme
2024-05-17 18:24:47 +08:00
unclecode
f5f3cce2c8
Merge new-release-0.0.2-no-spacy into main for v0.2.0 release
2024-05-17 18:23:27 +08:00
unclecode
6f96dcd649
chore: Update README
2024-05-17 18:12:50 +08:00
unclecode
957a2458b1
chore: Update web crawler URLs to use NBC News business section
2024-05-17 18:11:13 +08:00
unclecode
32c87f0388
chore: Update NlpSentenceChunking constructor parameters to None
...
The NlpSentenceChunking constructor parameters have been updated to None in order to simplify the usage of the class. This change removes the need for specifying the SpaCy model for sentence detection, making the code more concise and easier to understand.
2024-05-17 17:00:43 +08:00
unclecode
647cfda225
chore: Update Crawl4AI quickstart script in README.md
...
This commit updates the Crawl4AI quickstart script in the README.md file. The script is now properly formatted and aligned, making it easier to read and understand. The unnecessary indentation has been removed, and the script is now more concise and efficient.
2024-05-17 16:55:34 +08:00
unclecode
1cc67df301
chore: Update pip installation command and requirements, add new dependencies
2024-05-17 16:53:03 +08:00
unclecode
f85df91ca6
chore: Update README.md with Colab badge
2024-05-17 00:21:16 +08:00
unclecode
ea16dec587
Improve library loading
2024-05-16 21:19:02 +08:00
unclecode
45569d058d
chore: Update pip installation command and requirements for Crawl4AI
2024-05-16 20:42:53 +08:00
unclecode
5bb0b0b378
chore: Update pip installation command and requirements for Crawl4AI
2024-05-16 20:36:29 +08:00
unclecode
c8589f8da3
Update:
...
- Fix Spacy model issue
- Update Readme and requirements.txt
2024-05-16 19:50:20 +08:00
unclecode
6a6365ae0a
Refactor code to exclude the extraction of semantical blocks of text from the HTML
2024-05-16 18:10:55 +08:00
unclecode
5b80be956d
Update:
...
- Debug
- Refactor code for new version
2024-05-16 17:31:44 +08:00
UncleCode
4a2e17447b
Update README.md
2024-05-16 08:57:58 +08:00
unclecode
f6e59157bf
- Test all methods
...
- Update index.hml
- Update Readme
- Resolve some bugs
2024-05-14 21:27:41 +08:00
unclecode
8e536b9717
chore: Refactor README.md and project structure
2024-05-12 12:41:42 +08:00
unclecode
aac4e07389
chore: Update README.md and project structure
2024-05-12 12:39:31 +08:00
UncleCode
e3960ace68
Update README.md
...
Explain more about `extract_blocks_flag`
2024-05-11 22:11:16 +08:00
UncleCode
b0f97ab2b3
Update README.md
...
Public server is available now
2024-05-11 08:56:19 +08:00
unclecode
20ef255c7f
Update README
2024-05-09 23:28:47 +08:00
unclecode
da7748a780
Update README file
2024-05-09 22:51:10 +08:00
unclecode
f74f4e88c0
Update README file
2024-05-09 22:48:42 +08:00
unclecode
a8e7218769
chore: Update README.md and project structure
2024-05-09 22:40:08 +08:00
unclecode
84f093593a
Update README
2024-05-09 22:37:45 +08:00
unclecode
88643612e8
chore: Update environment variable usage in config files
2024-05-09 22:37:01 +08:00
unclecode
6f99bad6f0
Update web application URL in README.md
2024-05-09 22:28:37 +08:00
unclecode
50d7a7e45d
chore: Update forced flag for single page fetch to use default value
2024-05-09 22:21:12 +08:00
unclecode
c71dd9189b
chore: Update import statements to use crawl4ai package
2024-05-09 22:17:15 +08:00
UncleCode
7ee8001b7d
Update README.md
...
Add configuration section
2024-05-09 21:49:04 +08:00
unclecode
c71adb29ce
chore: Update .gitignore and README.md
2024-05-09 19:25:25 +08:00
unclecode
898ec30a18
chore: Update license information in README.md
...
`chore: Update social media links in index.html`
2024-05-09 19:14:48 +08:00
unclecode
343c4477f8
Update Crawl4AI web application URL in README.md
2024-05-09 19:13:20 +08:00
unclecode
99e0dd1ccd
chore: Update README.md with installation instructions for Crawl4AI library and local server
2024-05-09 19:12:39 +08:00
unclecode
b8e743cd8d
Initial Commit
2024-05-09 19:10:25 +08:00