Skip to content

Open Page

Naibo Wang edited this page Feb 13, 2023 · 6 revisions

"Open Page" is the first operation of a workflow, which means to open a web page.

Define Operation (at the Task Design stage)

This operation will be automatically added to the workflow when we click the "Start Design" button when designing a task.

image

Operation Properties

This section shows the available properties of "Open Page" operation in the Workflow Manager.

Option Name: the option name, click the "Confirm" button to refresh the name after modification.

Use link inside the Loop: whether open webpages with links set in the "Loop" operation instead of the "All links filled" defined in this operation. This option will only appear when "Open Page" operation is inside the "Loop" operation. E.g., we can use this option to open different web pages with more than one (such as 3) links to collect 3 different product information, details will be provided later.

url: default URL of the operation, only serve as a description and will not be used when executing the task.

All links filled: links will be opened, every link takes one line. We can set many URLs inside this option, then when invoking the task, EasySpider will repeat the whole workflow n times if we put n URLs into this option.

After executed, whether scroll down: Yes or No, means when the web page is loaded, whether or not to scroll the web page to the bottom. This is used when some web page requires user to scroll down to get all available contents, such as twitter where users will view posts one by one by scroll their mouses down.

Scroll Times: when set "Yes" for the scroll down option above, how many times will EasySpider automatically scroll down to the bottom, because in some web pages, we often need to scroll many times to make all contents loaded.

Seconds after executed: how long should EasySpider wait after automatically opens a web page.

Execute Operation (at the Task Invocation stage)

Every "Open Page" operation will be mapped as an input parameter of the task, which can be modified when invoking the task.

image

User can modify the value freely when invoking the task:

image

Then EasySpider will:

  1. Open URL: https://www.google.com/search?q=1, then executing all subsequent operations in the task workflow, until the whole workflow is finished.

  2. Open URL: https://www.google.com/search?q=2, then executing all subsequent operations in the task workflow, until the whole workflow is finished.

  3. Open URL: https://www.google.com/search?q=3, then executing all subsequent operations in the task workflow, until the whole workflow is finished.

I.e., for n links, EasySpider will execute the whole task workflow n times with n different URLs.

Clone this wiki locally