Skip to main content

Playwright

Playwright adds rendered web page extraction for pages that need browser execution before their content can be read.

Tools

ToolPurpose
web-page-content-extractorFetch a web page and extract its main text content.

web-page-content-extractor Inputs

InputRequiredDescription
urlyesHTTP or HTTPS URL to fetch.

Properties

Required Playwright properties:

VariableDescription
LIGHTFLARE_TOOLS_PLAYWRIGHT_ENABLEDSet to true.
LIGHTFLARE_TOOLS_PLAYWRIGHT_ENABLED_TOOLSComma-separated tool allowlist. Use web-page-content-extractor.

Optional Properties

VariableDefaultDescription
LIGHTFLARE_TOOLS_PLAYWRIGHT_ENABLED_TOOLS[]Exposed Playwright tools. Empty exposes no Playwright tools.
LIGHTFLARE_TOOLS_PLAYWRIGHT_WORKER_COUNT2Playwright worker count.
LIGHTFLARE_TOOLS_PLAYWRIGHT_MAX_PENDING_TASKS_PER_WORKER8Max queued Playwright tasks per worker.
LIGHTFLARE_TOOLS_PLAYWRIGHT_QUEUE_ACQUIRE_TIMEOUT_MS1000Playwright queue acquire timeout.
LIGHTFLARE_TOOLS_PLAYWRIGHT_PAGE_TIMEOUT_MS10000Playwright page timeout.
LIGHTFLARE_TOOLS_PLAYWRIGHT_HEADLESStrueRun Playwright headless.
LIGHTFLARE_TOOLS_PLAYWRIGHT_ACCEPT_DOWNLOADSfalseAllow Playwright downloads.
LIGHTFLARE_TOOLS_PLAYWRIGHT_BLOCK_PRIVATE_NETWORK_TARGETStrueBlock private network browser targets.
LIGHTFLARE_TOOLS_PLAYWRIGHT_ALLOWED_SCHEMEShttp,httpsAllowed URL schemes.
LIGHTFLARE_TOOLS_PLAYWRIGHT_ALLOWED_HOSTS[]Host allowlist. Empty allows all hosts permitted by the URL policy.