Webscraping and RPA

click Click

Description

Click Node does a mouse click on a given coordinates.

Parameters

Two parameters, a pair of coordinates count in pixels from upper left corner, are required:

Parameter

Type

Description

X (required)

integer

A horizontal coordinate count in pixels from left edge of the display.

Y (required)

integer

A vertical coordinate count in pixels from the upper edge of the display.

click_id Click Id Element

Description

Clicks on the website element specified by its ID.

Parameters

Parameter

Type

Description

Id

string

The ID of the element to be clicked upon.

click_image Click Image

Description

Finds provided image on currently scraped page and clicks at point that has coordinates (image_pos_X + X, image_pos_Y + Y). In other words icon finds the coordinates of image on page and clicks at point (X, Y), relatively to found image.

Parameters

Parameter

Type

Description

Image path (required)

string

Path to image

Click offset X (required)

int/float

Relative to image offset on axis X

Click offset Y (required)

int/float

Relative to image offset on axis Y

click_name Click Name Element

Description

Clicks on the website element containing the given text.

Parameters

Parameter

Type

Description

Text (required)

string

The text to be searched on the website and clicked upon.

click_xpath Click XPath Element

Description

Clicks on the website element specified by its XPath.

Parameters

Parameter

Type

Description

XPath (required)

string

The XPath of the element to be clicked upon, e.g. //div[@class=”media-select__input-content”]//button.

get_current_url Close Browser

Description

Closes browser after all scraping processes are done.

download_image Download Image

Description

Downloads image from URL.

Parameters

Parameter

Type

Description

Image URL (required)

string

URL with image

Output filename (required)

string

Name of image to be saved

download_images_xpath Download Images XPath

Description

Downloads image using XPath. Can be used for downloading multiple images at once.

Parameters

Parameter

Type

Description

XPath (required)

string

XPath of image element

Output filename (required)

string

Name of image to be saved. In case of downloading multiple images will be used as a directory name.

page_source Extract Page Source

Description

Extracts the HTML source of currently scraped web page and saves it to new variable inside platform.

Parameters

Parameter

Type

Description

Output variable (required)

string

Name of variable to be created inside platform

get_current_url Get Current URL

Description

Saves the URL of web page, which is being currently scraped, to new variable inside platform.

Parameters

Parameter

Type

Description

Output variable (required)

string

Name of variable to be created inside platform

load_website Load Website

Description

Loads website with a given URL.

Parameters

One parameter is required:

Parameter

Type

Description

URL (required)

string

URL of the loaded website, e.g. https://forloop.ai/.

Take screenshot

boolean

If screenshot is needed (e.g. for using in Browser View)

open_browser Open Browser

Description

Opens the system default web browser (for further usage such as Load Website etc.). Allows to choose driver to use.

For now available: Firefox (Geckodriver), Chrome.

Note: To use a driver an appropriate web browser should be installed. Driver itself will be installed automatically.

Parameters

Parameter

Type

Description

Driver

driver (combobox)

Driver type (Firefox, Chrome)

Show browser

boolean

Show browser GUI

page_source Refresh Page Source

Description

Refreshes page source of currently scraped page.

scan_web_page Scan Web Page

Description

Searches for different elements on page and highlights them in Browser View.

Available types of elements:

  • Tables

  • Bullet lists

  • Texts

  • Headlines

  • Links

  • Images

  • + elements with custom XPath

Parameters

Parameter

Type

Description

Tables

boolean

Include tables in search

Bullet lists

boolean

Include bullet lists in search

Texts

boolean

Include texts in search

Headlines

boolean

Include headlines lists in search

Links

boolean

Include links in search

Images

boolean

Include images in search

Custom XPath (optional)

string

Include elements with custom XPath

scroll_web_page Scroll Web Page

Description

Scrolls web page by X pixels in directions Up/Down.

Parameters

Parameter

Type

Description

Scroll to

string (combobox)

The direction of scrolling (Up/Down)

Scroll by (required)

int

Length of scrolling in pixels

Scroll max

boolean

Scroll to the top / end of page

wait Wait

Description

Wait Node creates a pauses between pipeline steps. It waits (and does nothing, obviously) for a specified amount of ms and then lets the next step in a pipeline to proceed.

Parameters

Two parameters are required:

Parameter

Type

Description

Miliseconds (required)

int/float

Waiting time interval in miliseconds, e.g. entry: 1000 1000 ms (1 s) waiting time before another step.

Add random ms

int/float

Adds a random real picked from a uniform distribution defined on the interval (- entered value, + entered value).

wait_until_element_is_located Wait Until Element Is Located

Description

Creates a pause until a certain element is located. Can be useful for instance when dealing with popup windows on page.

Parameters

Parameter

Type

Description

XPath (required)

string

XPath of element that should appear on page

write Write

Description

Types the text given via the entry box as if it were typed on the regular keyboard.

Parameters

Parameter

Type

Description

Text (required)

string

The text to be typed by the write icon