WebData
Extract Web Data
Last updated
Extract Web Data
Last updated
This tool allows scraping the HTML or text content of an URL into the workflow. It can be used to extract data from any website.
Select WebData API by entering /WebData
The WebData tool can be used to extract data from any website and return the text or the original HTML. The supported query is:
The supported query is:
Like any other tools, WebData can be applied to a list of URLs stored in the table in the previous cell by doing:
Like any other tools, web data can be applied to a list of URLs stored in the table in the previous cell by doing:
Model: "These are two models to choose from: GPT-3 and GPT-4."
Browser: "These are two options you can choose from:
Static pages - Faster: for basic webpages without actions such as clicking, filtering, etc.
Dynamic pages - Slower: used for webpages that require actions, like LinkedIn, eBay, etc."
Result type: You can choose HTML or Text for your work.
Let's start with this example:
I chose the GPT-3 model and the Dynamic browser, then the Result type returned HTML.