format bash scripts properly so copy & paste may work without issues
This commit is contained in:
@@ -34,15 +34,15 @@ crwl profiles
|
|||||||
## 2 Discovery – scrape companies & people
|
## 2 Discovery – scrape companies & people
|
||||||
|
|
||||||
```bash
|
```bash
|
||||||
python c4ai_discover.py full \
|
python c4ai_discover.py full \
|
||||||
--query "health insurance management" \
|
--query "health insurance management" \
|
||||||
--geo 102713980 \ # Malaysia geoUrn
|
--geo 102713980 \ # Malaysia geoUrn
|
||||||
--title-filters "" \ # or "Product,Engineering"
|
--title-filters "" \ # or "Product,Engineering"
|
||||||
--max-companies 10 \ # default set small for workshops
|
--max-companies 10 \ # default set small for workshops
|
||||||
--max-people 20 \ # \^ same
|
--max-people 20 \ # \^ same
|
||||||
--profile-name profile_linkedin_uc \
|
--profile-name profile_linkedin_uc \
|
||||||
--outdir ./data \
|
--outdir ./data \
|
||||||
--concurrency 2 \
|
--concurrency 2 \
|
||||||
--log-level debug
|
--log-level debug
|
||||||
```
|
```
|
||||||
**Outputs** in `./data/`:
|
**Outputs** in `./data/`:
|
||||||
@@ -66,15 +66,15 @@ _See more: <https://www.linkedin.com/search/results/companies/?geoUrn=XXX> – t
|
|||||||
## 3 Insights – embeddings, org‑charts, decision makers
|
## 3 Insights – embeddings, org‑charts, decision makers
|
||||||
|
|
||||||
```bash
|
```bash
|
||||||
python c4ai_insights.py \
|
python c4ai_insights.py \
|
||||||
--in ./data \
|
--in ./data \
|
||||||
--out ./data \
|
--out ./data \
|
||||||
--embed_model all-MiniLM-L6-v2 \
|
--embed-model all-MiniLM-L6-v2 \
|
||||||
--llm-provider gemini/gemini-2.0-flash \
|
--llm-provider gemini/gemini-2.0-flash \
|
||||||
--llm-api-key "" \
|
--llm-api-key "" \
|
||||||
--top-k 10 \
|
--top-k 10 \
|
||||||
--max-llm-tokens 8024 \
|
--max-llm-tokens 8024 \
|
||||||
--llm-temperature 1.0 \
|
--llm-temperature 1.0 \
|
||||||
--workers 4
|
--workers 4
|
||||||
```
|
```
|
||||||
Emits next to the Stage‑1 files:
|
Emits next to the Stage‑1 files:
|
||||||
|
|||||||
Reference in New Issue
Block a user