tavily-extractlisted
Install: claude install-skill Neetx/ai-research-radar
# tavily extract
Extract clean markdown/text content from one or more URLs. In this repo,
"opened" means extracted with this skill (or fetched with a built-in web tool) —
only opened URLs are citable as evidence.
## Before running any command
Check `tvly --status`. If `tvly` is not on PATH:
```bash
pip install -q tavily-cli || uv tool install tavily-cli
```
Auth comes from the `TAVILY_API_KEY` environment variable (already set in this
environment). Never print it. If it is unset, report that and fall back to the
built-in web tools.
## Quick start
```bash
# Single URL
tvly extract "https://example.com/article" --json
# Batch (max 20 per call)
tvly extract "https://arxiv.org/abs/XXXX.XXXXX" "https://github.com/org/repo" --json
# Long pages: only the relevant chunks
tvly extract "https://example.com/docs" --query "release date changelog" --chunks-per-source 3 --json
# JS-heavy pages
tvly extract "https://app.example.com" --extract-depth advanced --json
```
## Key options
| Option | Description |
|--------|-------------|
| `--query` + `--chunks-per-source` | Return only chunks relevant to a query (1–5 per URL) |
| `--extract-depth` | `basic` (default, try first) or `advanced` (JS-rendered pages) |
| `--format` | `markdown` (default) or `text` |
| `--timeout` | 1–60 seconds for slow pages |
| `--json` / `-o FILE` | Structured output / save to file |
## Tips
- Always quote URLs (shell interprets `?` and `&`).
- Try `basic` first; escalate to `advanced` only if content