chore: release v4.0.0 - sync 550+ skills and restructure docs

2026-01-28 17:15:26 +01:00
parent 8c49211c70
commit 0ffee44828
683 changed files with 170528 additions and 588 deletions
--- a/skills/vector-index-tuning/SKILL.md
+++ b/skills/vector-index-tuning/SKILL.md
@@ -0,0 +1,42 @@
+---
+name: vector-index-tuning
+description: Optimize vector index performance for latency, recall, and memory. Use when tuning HNSW parameters, selecting quantization strategies, or scaling vector search infrastructure.
+---
+
+# Vector Index Tuning
+
+Guide to optimizing vector indexes for production performance.
+
+## Use this skill when
+
+- Tuning HNSW parameters
+- Implementing quantization
+- Optimizing memory usage
+- Reducing search latency
+- Balancing recall vs speed
+- Scaling to billions of vectors
+
+## Do not use this skill when
+
+- You only need exact search on small datasets (use a flat index)
+- You lack workload metrics or ground truth to validate recall
+- You need end-to-end retrieval system design beyond index tuning
+
+## Instructions
+
+1. Gather workload targets (latency, recall, QPS), data size, and memory budget.
+2. Choose an index type and establish a baseline with default parameters.
+3. Benchmark parameter sweeps using real queries and track recall, latency, and memory.
+4. Validate changes on a staging dataset before rolling out to production.
+
+Refer to `resources/implementation-playbook.md` for detailed patterns, checklists, and templates.
+
+## Safety
+
+- Avoid reindexing in production without a rollback plan.
+- Validate changes under realistic load before applying globally.
+- Track recall regressions and revert if quality drops.
+
+## Resources
+
+- `resources/implementation-playbook.md` for detailed patterns, checklists, and templates.