Scrape
JSON mode - LLM Extract
Extract structured data from pages via LLMs
Scrape and extract structured data with Firecrawl
Firecrawl uses AI to get structured data from web pages in 3 steps:
-
Set the Schema: Tell us what data you want by defining a JSON schema (using OpenAI’s format) along with the webpage URL.
-
Make the Request: Send your URL and schema to our scrape endpoint. See how here: Scrape Endpoint Documentation
-
Get Your Data: Get back clean, structured data matching your schema that you can use right away.
This makes getting web data in the format you need quick and easy.
Extract structured data
/scrape (with json) endpoint
Used to extract structured data from scraped pages.
Output:
JSON
Extracting without schema (New)
You can now extract without a schema by just passing a prompt
to the endpoint. The llm chooses the structure of the data.
Output:
JSON
Extract object
The extract
object accepts the following parameters:
schema
: The schema to use for the extraction.systemPrompt
: The system prompt to use for the extraction.prompt
: The prompt to use for the extraction without a schema.