🌐WebData

Extract Web Data

This tool allows scraping the HTML or text content of an URL into the workflow. It can be used to extract data from any website.

Introduction

Select WebData API by entering /WebData

The WebData tool can be used to extract data from any website and return the text or the original HTML. The supported query is:

Examples

The supported query is:

go to some_url

Like any other tools, WebData can be applied to a list of URLs stored in the table in the previous cell by doing:

Like any other tools, web data can be applied to a list of URLs stored in the table in the previous cell by doing:

go to @col_name
  • Model: "These are two models to choose from: GPT-3 and GPT-4."

  • Browser: "These are two options you can choose from:

    • Static pages - Faster: for basic webpages without actions such as clicking, filtering, etc.

    • Dynamic pages - Slower: used for webpages that require actions, like LinkedIn, eBay, etc."

  • Result type: You can choose HTML or Text for your work.

Let's start with this example:

Extract the text from website @1.link

I chose the GPT-3 model and the Dynamic browser, then the Result type returned HTML.

Last updated