PlatformWeb scraping infrastructure
Send a URL.Get structured data back.
ScrapeNest runs the browsers, rotates the egress, retries the failures and hands you a clean artifact bundle over one API. You ship features instead of nursing headless Chrome at 3 a.m.
3 engines/one API/hosted in France
$ curl -sX POST https://api.scrapenest.com/v1/jobs \
-H "Authorization: Bearer $KEY" \
-d '{ "url": "…/pricing", "engine": "standard",
"extract": { "price": ".price::text" } }'from a curl-fast HTTP path to hardened anti-bot evasion
job-acceptance target on Pro and above
data residency — built and hosted in France
logged, traceable and reproducible
Three moving parts, none of them yours
You describe the work. We own the ugly middle. The result lands somewhere you can reach it.
Submit
POST a job with a target URL, the engine you want, and optional extraction rules. One key, one endpoint, scoped to your organization.
We execute
Temporal orchestrates the run. We pick a clean egress IP, render JavaScript when the page needs it, back off and retry on transient failures, and quarantine the ones that don't recover.
You collect
Pull the artifact bundle — rendered HTML, parsed JSON, screenshots, HAR, run metadata — or get a signed webhook the moment it's ready. Artifacts live in object storage on your retention policy.
Pick the engine that fits the target
Same API, three execution profiles. Start cheap and fast; reach for the heavy machinery only when a site forces you to.
Light
HTTP (TLS impersonation)
As fast as curl, dressed like a browser
Pure HTTP that mimics the TLS fingerprints and headers of real browsers. No browser, no render cost — the cheapest, fastest way to pull JSON APIs and server-rendered HTML at volume.
APIs · static HTML · high volume
1 credit / job
Standard
Playwright + Chromium
Render the page, drive it like a user
Headless Chromium that runs the page's JavaScript, then clicks, scrolls, paginates, fills forms and captures screenshots or HAR. The workhorse for SPAs and multi-step flows behind a login.
SPAs · interactions · logged-in flows
5 credits / job
Stealth
Camoufox
For the targets that fight back
Camoufox-based fingerprint hardening for sites with serious bot defenses (Cloudflare, Akamai, DataDome). Reach for it when Standard gets blocked — not before, because it's the priciest path.
Hostile targets · heavy anti-bot
30 credits / job
What you don't have to build
The parts of a scraping platform that look trivial in a demo and eat a quarter in production.
Retries that mean something
Idempotent dispatch, exponential backoff, a dead-letter queue for the genuinely broken. A flaky target doesn't become a 2 a.m. page.
Real browser automation
Click, scroll, wait for selectors, fill forms. Camoufox-based stealth for the targets that fight back, plain Chromium when they don't.
Extraction at the edge
Ship CSS/XPath/JSONPath rules with the job and get structured JSON back, or take the readability-cleaned article. No second pipeline to maintain.
Webhooks you can trust
Signed payloads, replay protection, automatic retries and delivery tracking. Stop polling; let completed jobs come to you.
Observability included
Per-job metrics, structured logs and traces. Every run carries a request ID you can follow from API call to stored artifact.
An API that documents itself
OpenAPI spec, typed SDKs, and a console playground for the debug loop. Predictable errors, no surprises in the contract.
