Start crawl job
Authorization
bearerAuth API key authentication. Include your API key in the Authorization header as Bearer rf_your_key.
In: header
Query Parameters
Block until job completes and return results directly. Max wait time is 2 minutes. Returns 202 if timeout exceeded.
falseMaximum seconds to wait when wait=true (default 120s, max 120s/2min). For longer jobs, use async mode.
120int6410 <= value <= 120Request Body
application/json
Content cleaner chain (default: [markdown])
Optional LLM configuration override (BYOK)
Crawl configuration options
Extraction instructions - either a structured schema (YAML/JSON with 'name' and 'fields') or freeform natural language prompt. The API auto-detects the format.
1 <= lengthSeed URL to start crawling from
1 <= lengthInline ephemeral webhook configuration
ID of a saved webhook to call on job events
Simple webhook URL (backward compatible)
uriResponse Body
application/json
application/problem+json
curl -X POST "https://api.refyne.uk/api/v1/crawl" \ -H "Content-Type: application/json" \ -d '{ "schema": "string", "url": "https://example.com/products" }'{
"cost_usd": 0.15,
"data": {
"property1": null,
"property2": null
},
"duration_ms": 12500,
"error_message": "string",
"job_id": "01HXYZ123ABC456DEF789",
"page_count": 5,
"status": "completed",
"status_url": "https://api.refyne.uk/api/v1/jobs/01HXYZ123ABC456DEF789",
"token_usage": {
"input": 8500,
"output": 2100
}
}{
"detail": "Property foo is required but is missing.",
"errors": [
{
"location": "string",
"message": "string",
"value": null
}
],
"instance": "https://example.com/error-log/abc123",
"status": 400,
"title": "Bad Request",
"type": "https://example.com/errors/example"
}