Getting started

  1. Create an account.
  2. Build a spider in the wizard — or start from a recipe (OpenAPI specs, safety data sheets, product pricing) and tweak it.
  3. Set the seeds, crawl scope, fetching tier, and the records you want to extract.
  4. Choose an output (store, webhook, object storage) and a schedule, then run it.

Want to discover new sources rather than crawl known ones? Switch the spider to discovery mode to find new domains across the web and build a catalog.