Script.It Documentations
  • 🌟Introduction
  • 🧠Concepts
  • πŸ› οΈTools
    • ❓TextAI
    • πŸ“”DataOps
    • πŸ“–DocQA
    • πŸ“‘WebAgent
    • πŸ—„οΈFile Explorer
    • πŸ“„DocSum
    • ❔WebQA
    • πŸ•ΈοΈWebOps
    • ℹ️Inputs
    • πŸ€–BrowserAgent
    • 🈺Business Reviews
    • πŸ“°News Search
    • πŸ“‚Upload Files
    • πŸ“ˆImport Data
    • πŸ“ŠPlotting
    • 🌐WebData
    • 🌎WebQA
    • 🌍WebOps
    • πŸ”Google Search
  • ⛓️APIs
    • πŸ’¬Slack
    • πŸ—ƒοΈNotion
    • πŸŒ†Image Generation
    • πŸ—³οΈAirtable
    • πŸ“…Google Calendar
    • πŸ“„Google Docs
    • πŸ“ΆGoogle Sheets
    • πŸ’ Google Slides
    • ☁️Google Drive
    • πŸ”ŠText-to-Speech
    • πŸŽ’eBay Search
    • πŸ“§Gmail
    • βœ–οΈX
    • 🏨CrunchBase
    • πŸ—„οΈSec Filing Retrieval
    • πŸ“ΉVideo Understanding
    • πŸ—£οΈWhisper
    • πŸ”˜Twilio
  • πŸ‘©β€πŸ«Tutorials
    • Creating a Basic Workflow
    • The 2 Run Types
    • The Execution Flow Mode
  • ☝️Tips and Tricks
  • πŸ“šSample Scripts
  • ▢️Use Cases
    • πŸ’΅Sales Prospecting
      • πŸ“ Account Names to Facility Match using PeeringDB
      • πŸ“ [Long list] Account Names to Facility Match using PeeringDB
Powered by GitBook
On this page
  • Introduction
  • Examples
  1. Tools

WebData

Extract Web Data

PreviousPlottingNextWebQA

Last updated 1 year ago

This tool allows scraping the HTML or text content of an URL into the workflow. It can be used to extract data from any website.

Introduction

Select WebData API by entering /WebData

The WebData tool can be used to extract data from any website and return the text or the original HTML. The supported query is:

Examples

The supported query is:

go to some_url

Like any other tools, WebData can be applied to a list of URLs stored in the table in the previous cell by doing:

Like any other tools, web data can be applied to a list of URLs stored in the table in the previous cell by doing:

go to @col_name
  • Model: "These are two models to choose from: GPT-3 and GPT-4."

  • Browser: "These are two options you can choose from:

    • Static pages - Faster: for basic webpages without actions such as clicking, filtering, etc.

    • Dynamic pages - Slower: used for webpages that require actions, like LinkedIn, eBay, etc."

  • Result type: You can choose HTML or Text for your work.

Let's start with this example:

Extract the text from website @1.link

I chose the GPT-3 model and the Dynamic browser, then the Result type returned HTML.

πŸ› οΈ
🌐