Skip to content

Web Research

GHOST can search the web and fetch page content for research tasks.

GHOST supports two search backends:

  • SearXNG (default) — self-hosted meta-search engine. Aggregates results from multiple engines and returns rich metadata (source engines, relevance scores, ranking positions).
  • Brave Search — API-based fallback. Requires a subscription key.

Point GHOST at your SearXNG instance via config or environment variable:

~/.config/ghost/config.toml
[web.search]
provider = "searxng"
url = "http://localhost:8888"

Or set the SEARXNG_URL environment variable — GHOST will pick it up automatically without any TOML config.

If you don’t run a SearXNG instance, set a Brave API key instead:

~/.config/ghost/config.toml
[web.search]
provider = "brave"
Terminal window
ghost web search "rust error handling best practices"
ghost web search "latest AI news" -n 10

Extracts readable content from web pages. Supports two modes:

  • Readability — article extraction (Mozilla Readability algorithm)
  • Raw — full HTML output
Terminal window
ghost web fetch https://example.com
ghost web fetch https://example.com --readability
ghost web fetch https://example.com --raw
~/.config/ghost/config.toml
[web]
crawl4ai_url = "http://localhost:11235"

All web fetches are automatically cached to $WORKSPACE/.web-cache/. The reflection agent triages those artifacts when it runs, creating sources linked to the fetched data.

During conversation, your GHOST proactively:

  1. Searches the web when current information is needed
  2. Fetches full pages to read beyond search snippets
  3. Caches results for later reference
  4. Cites sources in responses

The deep-research agent takes this further — it performs iterative multi-source research, fetching multiple full pages before drawing conclusions.