交互 | Firecrawl

抓取页面以获取干净的数据，然后调用 /interact 在该页面中开始执行 actions——点击按钮、填写表单、提取动态内容，或进一步深入导航。只需描述你想做什么；如果需要完全控制，也可以编写代码。

AI prompts

描述你希望在页面中执行的操作

代码执行

通过代码安全地与 playwright、agent-browser 交互

Live view

通过可嵌入的流实时观看或与浏览器交互

工作原理

使用 POST /v2/scrape 抓取一个 URL。响应会在 data.metadata.scrapeId 中返回 scrapeId。如果你想持久保存浏览器状态，请在此请求中传入 profile。
调用 POST /v2/scrape/{scrapeId}/interact，并传入 prompt 或 Playwright code 进行交互。此处不要传入 profile；交互会话会继承抓取任务中的 profile。
完成后，使用 DELETE /v2/scrape/{scrapeId}/interact 停止该会话。对于可写的 profile，会话停止时会保存更改。

快速开始

抓取页面、与其交互，然后停止会话：

from firecrawl import Firecrawl

app = Firecrawl(api_key="fc-YOUR-API-KEY")

# 1. 抓取 Amazon 首页
result = app.scrape("https://www.amazon.com", formats=["markdown"])
scrape_id = result.metadata.scrape_id

# 2. 交互 — 搜索商品并获取价格
app.interact(scrape_id, prompt="Search for iPhone 16 Pro Max")
response = app.interact(scrape_id, prompt="Click on the first result and tell me the price")
print(response.output)

# 3. 停止会话
app.stop_interaction(scrape_id)

Response

{
  "success": true,
  "liveViewUrl": "https://liveview.firecrawl.dev/...",
  "interactiveLiveViewUrl": "https://liveview.firecrawl.dev/...",
  "output": "The iPhone 16 Pro Max (256GB) is priced at $1,199.00.",
  "exitCode": 0,
  "killed": false
}

通过 prompt 交互

这是与页面交互的最简单方式。用自然语言描述你的需求，它会自动点击、输入、滚动并提取数据。

response = app.interact(scrape_id, prompt="What are the customer reviews saying about battery life?")
print(response.output)

响应中包含一个 output 字段，其中包含代理的答案：

Response

{
  "success": true,
  "liveViewUrl": "https://liveview.firecrawl.dev/...",
  "interactiveLiveViewUrl": "https://liveview.firecrawl.dev/...",
  "output": "Customers are generally positive about battery life. Most reviewers report 8-10 hours of use on a single charge. A few noted it drains faster with heavy multitasking.",
  "stdout": "...",
  "result": "...",
  "stderr": "",
  "exitCode": 0,
  "killed": false
}

保持 prompt 简短且聚焦

当每个 prompt 都是单一且明确的任务时，效果最好。不要一次性要求代理完成复杂的多步骤工作流，而应将其拆分为单独的交互调用。每次调用都会复用同一个浏览器会话，因此状态会在调用之间延续。

运行代码

若要实现完全控制，你可以直接在浏览器沙箱中执行代码。page 变量 (一个 Playwright Page 对象) 可在 Node.js 和 Python 中使用。Bash 模式已预装 agent-browser。你还可以在当前会话中截取屏幕截图——在 Node.js 中使用 (await page.screenshot()).toString("base64")，在 Python 中使用 await page.screenshot(path="/tmp/screenshot.png")，或在 Bash 中使用 agent-browser screenshot。

Node.js (Playwright)

默认语言。可直接编写 Playwright 代码——page 已连接到浏览器。

response = app.interact(scrape_id, code="""
// 点击按钮并等待页面导航
await page.click('#next-page');
await page.waitForLoadState('networkidle');

// 从新页面提取内容
const title = await page.title();
const content = await page.$eval('.article-body', el => el.textContent);
JSON.stringify({ title, content });
""")
print(response.result)

Python

将 language 设置为 "python"，以使用 Playwright 的 Python API。

response = app.interact(
    scrape_id,
    code="""
import json

await page.click('#load-more')
await page.wait_for_load_state('networkidle')

items = await page.query_selector_all('.item')
data = []
for item in items:
    text = await item.text_content()
    data.append(text.strip())

print(json.dumps(data))
""",
    language="python",
)
print(response.stdout)

Bash (agent-browser)

agent-browser 是一个预装在沙箱中的 CLI 工具，提供 60 多个命令。它会提供带有元素引用 (@e1、@e2 等) 的辅助功能树，非常适合由 LLM 驱动的自动化。

# 拍摄快照以查看交互元素
snapshot = app.interact(
    scrape_id,
    code="agent-browser snapshot -i",
    language="bash",
)
print(snapshot.stdout)
# 输出：
# [document]
#   @e1 [input type="text"] "Search..."
#   @e2 [button] "Search"
#   @e3 [link] "About"

# 使用 @refs 与元素交互
app.interact(
    scrape_id,
    code='agent-browser fill @e1 "firecrawl" && agent-browser click @e2',
    language="bash",
)

常见的 agent-browser 命令：

命令	描述
`snapshot`	带元素引用的完整辅助功能树
`snapshot -i`	仅显示可交互元素
`click @e1`	通过引用点击元素
`fill @e1 "text"`	清空字段并输入文本
`type @e1 "text"`	不清空直接输入
`press Enter`	按下键盘按键
`scroll down 500`	向下滚动 500 像素
`get text @e1`	获取文本内容
`get url`	获取当前 URL
`wait @e1`	等待元素出现
`wait --load networkidle`	等待网络空闲
`find text "X" click`	按文本查找元素并点击
`screenshot`	对当前页面进行截图
`eval "js code"`	在页面中运行 JavaScript

实时视图

每个交互响应都会返回一个 liveViewUrl，你可以将其嵌入页面中，以实时查看浏览器画面。适用于调试、演示或构建基于浏览器的 UI。

Response

{
  "success": true,
  "liveViewUrl": "https://liveview.firecrawl.dev/...",
  "interactiveLiveViewUrl": "https://liveview.firecrawl.dev/...",
  "stdout": "",
  "result": "...",
  "exitCode": 0
}

<iframe src="LIVE_VIEW_URL" width="100%" height="600" />

交互式实时视图

响应还包含一个 interactiveLiveViewUrl。与仅可查看的标准实时视图不同，交互式实时视图允许用户通过嵌入式流直接点击、输入，并与浏览器会话交互。这对于构建面向用户的浏览器 UI 很有帮助——例如登录流程，或需要终端用户控制浏览器的引导式工作流。

<iframe src="INTERACTIVE_LIVE_VIEW_URL" width="100%" height="600" />

会话生命周期

创建

首次调用 POST /v2/scrape/{scrapeId}/interact 会延续抓取会话并启动交互。

复用

对同一个 scrapeId 的后续 interact 调用会复用现有会话。浏览器会保持打开状态，并在调用之间保留其状态，因此你可以将多个交互串联起来：

# 第一次调用——点击一个标签页
app.interact(scrape_id, code="await page.click('#tab-2')")

# 第二次调用——该标签页仍保持选中状态，提取其内容
result = app.interact(scrape_id, code="await page.$eval('#tab-2-content', el => el.textContent)")
print(result.result)

清理

完成后请显式停止会话：

app.stop_interaction(scrape_id)

会话也会根据 TTL (默认值：10 分钟) 或无活动 timeout (默认值：5 分钟) 自动过期。

请务必在使用完毕后停止会话，以避免不必要的计费。额度按秒折算。

使用 Scrape + 交互的持久化配置文件

默认情况下，每个 scrape + 交互会话都会从全新的浏览器状态开始。使用 profile，你可以在多次抓取之间保存并复用浏览器状态 (cookies、localStorage、会话) 。这对于保持登录状态和保留偏好设置非常有用。在初始 POST /v2/scrape 请求中传入 profile 对象。不要在 POST /v2/scrape/{scrapeId}/interact 中传入 profile；交互会话会复用抓取任务的浏览器会话和 profile 设置。使用 DELETE /v2/scrape/{scrapeId}/interact 停止交互会话，以便保存对可写配置文件所做的更改。

cURL

curl -X POST "https://api.firecrawl.dev/v2/scrape" \
  -H "Authorization: Bearer fc-YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "url": "https://example.com",
    "formats": ["markdown"],
    "profile": {
      "name": "my-profile",
      "saveChanges": true
    }
  }'

curl -X POST "https://api.firecrawl.dev/v2/scrape/SCRAPE_ID/interact" \
  -H "Authorization: Bearer fc-YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "prompt": "Click the login button"
  }'

curl -X DELETE "https://api.firecrawl.dev/v2/scrape/SCRAPE_ID/interact" \
  -H "Authorization: Bearer fc-YOUR_API_KEY"

配置文件的生命周期如下：

使用 profile.name 和 saveChanges: true 创建抓取。
针对返回的 scrapeId 运行 prompt 或代码交互。
停止会话以保存 cookies、localStorage 和其他浏览器状态。
稍后使用相同的 profile.name 启动新的抓取。当你只想读取现有状态而不将更改写回时，使用 saveChanges: false。

from firecrawl import Firecrawl

app = Firecrawl(api_key="fc-YOUR-API-KEY")

# Session 1: Scrape with a profile, log in, then stop (state is saved)
result = app.scrape(
    "https://app.example.com/login",
    formats=["markdown"],
    profile={"name": "my-app", "save_changes": True},
)
scrape_id = result.metadata.scrape_id

app.interact(scrape_id, prompt="Fill in user@example.com and password, then click Login")
app.stop_interaction(scrape_id)

# 会话 2：使用相同配置文件以只读模式进行抓取——已处于登录状态
result = app.scrape(
    "https://app.example.com/dashboard",
    formats=["markdown"],
    profile={"name": "my-app", "save_changes": False},
)
scrape_id = result.metadata.scrape_id

response = app.interact(scrape_id, prompt="Extract the dashboard data")
print(response.output)
app.stop_interaction(scrape_id)

参数	默认值	描述
`name`	—	持久化配置文件的名称。名称相同的抓取会共享浏览器状态。
`saveChanges`	`true`	当为 `true` 时，交互会话停止后会将浏览器状态保存回该配置文件。设为 `false` 可在不写入的情况下加载现有数据——适用于你需要多个并发只读会话的场景。

同一时间只能有一个会话保存到某个配置文件。如果另一个会话已在保存，你将收到 409 错误。你仍然可以使用 saveChanges: false 打开同一个配置文件，或稍后重试。

浏览器状态会在交互会话停止时保存。完成后请务必停止该会话，以便该配置文件可以被复用。

验证持久化

你可以在一个会话中写入 localStorage 值并停止该会话，然后在第二个使用相同配置文件的会话中读取该值，以此测试持久化，而无需依赖真实的登录流程。

cURL

# 会话 1：写入浏览器状态并保存
RESPONSE=$(curl -s -X POST "https://api.firecrawl.dev/v2/scrape" \
  -H "Authorization: Bearer $FIRECRAWL_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "url": "https://example.com",
    "formats": ["markdown"],
    "profile": { "name": "profile-validation", "saveChanges": true }
  }')

SCRAPE_ID=$(echo "$RESPONSE" | jq -r ".data.metadata.scrapeId")

curl -s -X POST "https://api.firecrawl.dev/v2/scrape/$SCRAPE_ID/interact" \
  -H "Authorization: Bearer $FIRECRAWL_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "code": "await page.evaluate(() => { localStorage.setItem(\"firecrawlProfileCheck\", \"saved\"); document.cookie = \"firecrawl_profile_check=saved; path=/; max-age=3600\"; return localStorage.getItem(\"firecrawlProfileCheck\"); });"
  }'

curl -s -X DELETE "https://api.firecrawl.dev/v2/scrape/$SCRAPE_ID/interact" \
  -H "Authorization: Bearer $FIRECRAWL_API_KEY"

# 会话 2：以只读模式加载相同配置文件并验证值
RESPONSE=$(curl -s -X POST "https://api.firecrawl.dev/v2/scrape" \
  -H "Authorization: Bearer $FIRECRAWL_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "url": "https://example.com",
    "formats": ["markdown"],
    "profile": { "name": "profile-validation", "saveChanges": false }
  }')

SCRAPE_ID=$(echo "$RESPONSE" | jq -r ".data.metadata.scrapeId")

curl -s -X POST "https://api.firecrawl.dev/v2/scrape/$SCRAPE_ID/interact" \
  -H "Authorization: Bearer $FIRECRAWL_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "code": "await page.evaluate(() => ({ localStorage: localStorage.getItem(\"firecrawlProfileCheck\"), cookie: document.cookie.includes(\"firecrawl_profile_check=saved\") }));"
  }'

curl -s -X DELETE "https://api.firecrawl.dev/v2/scrape/$SCRAPE_ID/interact" \
  -H "Authorization: Bearer $FIRECRAWL_API_KEY"

第二个交互响应应显示 localStorage 为 "saved"，cookie 为 true。

通过 API 创建的 Profiles 可能暂时还不会显示在 Dashboard > Interact > Profiles 中。Dashboard 目前尚未提供通过 API 创建的持久化 Profiles 的完整列表。

何时使用什么

使用场景	推荐	原因
网页搜索	Search	专用搜索端点
从 URL 获取干净内容	Scrape	一次 API 调用，无需会话
在页面上点击、输入、导航	交互 (prompt)	只需用英文描述即可
提取交互后的数据	交互 (prompt)	无需选择器
复杂的抓取逻辑	交互 (code)	完整的 Playwright 控制能力

交互与浏览器沙箱：交互构建在与浏览器沙箱相同的基础设施之上，但针对最常见的使用模式提供了更好的界面——先抓取页面，再进一步深入。当你需要一个不绑定到特定抓取任务的独立浏览器会话时，浏览器沙箱更合适。

定价

仅代码 (无 prompt) — 每个会话分钟 2 个额度
使用 AI prompts — 每个会话分钟 7 个额度
抓取 — 单独计费 (每次抓取 1 个额度，外加任何特定格式的费用) 。

API 参考

执行交互 — POST /v2/scrape/{scrapeId}/interact
停止交互 — DELETE /v2/scrape/{scrapeId}/interact

请求体 (POST)

字段	类型	默认值	描述
`prompt`	`string`	—	提供给 AI 代理的自然语言任务。若未设置 `code`，则此项必填。最多 10,000 个字符。
`code`	`string`	—	要执行的代码 (Node.js、Python 或 Bash) 。若未设置 `prompt`，则此项必填。最多 100,000 个字符。
`language`	`string`	`"node"`	`"node"`、`"python"` 或 `"bash"`。仅在使用 `code` 时生效。
`timeout`	`number`	`30`	超时时间，单位为秒 (1–300) 。
`origin`	`string`	—	用于活动追踪的调用方标识符。

响应

字段	描述
`success`	如果执行已完成且未出现错误，则为 `true`
`liveViewUrl`	浏览器会话的只读实时视图 URL
`interactiveLiveViewUrl`	交互式实时视图 URL (查看者可控制浏览器)
`output`	代理对你的 `prompt` 给出的自然语言回答。仅在使用 `prompt` 时返回。
`stdout`	代码执行的标准输出
`result`	sandbox 的原始返回值。对于 `code`：最后一个求值的表达式。对于 `prompt`：代理用于生成 `output` 的原始页面快照。
`stderr`	标准错误输出
`exitCode`	退出码 (`0` = 成功)
`killed`	如果执行因超时而终止，则为 `true`

有反馈或需要帮助？请发送邮件至 help@firecrawl.com，或通过 Discord 联系我们。

AI prompts

代码执行

Live view

​工作原理

​快速开始

​通过 prompt 交互

​保持 prompt 简短且聚焦

​运行代码

​Node.js (Playwright)

​Python

​Bash (agent-browser)

​实时视图

​交互式实时视图

​会话生命周期

​创建

​复用

​清理

​使用 Scrape + 交互 的持久化配置文件

​验证持久化

​何时使用什么

​定价

​API 参考

​请求体 (POST)

​响应

工作原理

快速开始

通过 prompt 交互

保持 prompt 简短且聚焦

运行代码

Node.js (Playwright)

Python

Bash (agent-browser)

实时视图

交互式实时视图

会话生命周期

创建

复用

清理

使用 Scrape + 交互的持久化配置文件

验证持久化

何时使用什么

定价

API 参考

请求体 (POST)

响应