Webscraping and RPA¶
Click¶
Description¶
Click Node does a mouse click on a given coordinates.
Parameters¶
Two parameters, a pair of coordinates count in pixels from upper left corner, are required:
Parameter |
Type |
Description |
---|---|---|
X (required) |
integer |
A horizontal coordinate count in pixels from left edge of the display. |
Y (required) |
integer |
A vertical coordinate count in pixels from the upper edge of the display. |
Click Id Element¶
Description¶
Clicks on the website element specified by its ID.
Parameters¶
Parameter |
Type |
Description |
---|---|---|
Id |
string |
The ID of the element to be clicked upon. |
Click Image¶
Description¶
Finds provided image on currently scraped page and clicks at point that has coordinates (image_pos_X + X, image_pos_Y + Y). In other words icon finds the coordinates of image on page and clicks at point (X, Y), relatively to found image.
Parameters¶
Parameter |
Type |
Description |
---|---|---|
Image path (required) |
string |
Path to image |
Click offset X (required) |
int/float |
Relative to image offset on axis X |
Click offset Y (required) |
int/float |
Relative to image offset on axis Y |
Click Name Element¶
Description¶
Clicks on the website element containing the given text.
Parameters¶
Parameter |
Type |
Description |
---|---|---|
Text (required) |
string |
The text to be searched on the website and clicked upon. |
Click XPath Element¶
Description¶
Clicks on the website element specified by its XPath.
Parameters¶
Parameter |
Type |
Description |
---|---|---|
XPath (required) |
string |
The XPath of the element to be clicked upon, e.g. //div[@class=”media-select__input-content”]//button. |
Download Image¶
Description¶
Downloads image from URL.
Parameters¶
Parameter |
Type |
Description |
---|---|---|
Image URL (required) |
string |
URL with image |
Output filename (required) |
string |
Name of image to be saved |
Download Images XPath¶
Description¶
Downloads image using XPath. Can be used for downloading multiple images at once.
Parameters¶
Parameter |
Type |
Description |
---|---|---|
XPath (required) |
string |
XPath of image element |
Output filename (required) |
string |
Name of image to be saved. In case of downloading multiple images will be used as a directory name. |
Extract Multiple XPaths¶
Description¶
Finds multiple HTML elements using XPath and exports them.
Parameters¶
Parameter |
Type |
Description |
---|---|---|
Extraction setup file path (required) |
string |
Path to .txt file with XPaths (1 per row) |
Output variable (required) |
string |
Name of variable to be created inside platform |
Extract Page Source¶
Description¶
Extracts the HTML source of currently scraped web page and saves it to new variable inside platform.
Parameters¶
Parameter |
Type |
Description |
---|---|---|
Output variable (required) |
string |
Name of variable to be created inside platform |
Extract Table XPath Element¶
Description¶
Finds table using XPath and exports it.
Also allows to load XPath of element from Browser View to the first parameter (XPath).
Parameters¶
Parameter |
Type |
Description |
---|---|---|
XPath Rows (required) |
string |
XPath of table row |
XPath Columns (required) |
string |
XPath of table column |
Output variable (required) |
string |
Name of variable to be created inside platform |
Extract XPath Element¶
Description¶
Finds HTML element using XPath and exports it.
Also allows to load XPath of element from Browser View to the first parameter (XPath).
Parameters¶
Parameter |
Type |
Description |
---|---|---|
XPath (required) |
string |
XPath of element to be extracted |
Output variable (required) |
string |
Name of variable to be created inside platform |
Output filename (required) |
string |
Name of file to be created |
Write in file mode |
string (combobox) |
|
Get Current URL¶
Description¶
Saves the URL of web page, which is being currently scraped, to new variable inside platform.
Parameters¶
Parameter |
Type |
Description |
---|---|---|
Output variable (required) |
string |
Name of variable to be created inside platform |
Load Website¶
Description¶
Loads website with a given URL.
Parameters¶
One parameter is required:
Parameter |
Type |
Description |
---|---|---|
URL (required) |
string |
URL of the loaded website, e.g. https://forloop.ai/. |
Take screenshot |
boolean |
If screenshot is needed (e.g. for using in Browser View) |
Open Browser¶
Description¶
Opens the system default web browser (for further usage such as Load Website etc.). Allows to choose driver to use.
For now available: Firefox (Geckodriver), Chrome.
Note: To use a driver an appropriate web browser should be installed. Driver itself will be installed automatically.
Parameters¶
Parameter |
Type |
Description |
---|---|---|
Driver |
driver (combobox) |
Driver type (Firefox, Chrome) |
Show browser |
boolean |
Show browser GUI |
Scan Web Page¶
Description¶
Searches for different elements on page and highlights them in Browser View.
Available types of elements:
Tables
Bullet lists
Texts
Headlines
Links
Images
+ elements with custom XPath
Parameters¶
Parameter |
Type |
Description |
---|---|---|
Tables |
boolean |
Include tables in search |
Bullet lists |
boolean |
Include bullet lists in search |
Texts |
boolean |
Include texts in search |
Headlines |
boolean |
Include headlines lists in search |
Links |
boolean |
Include links in search |
Images |
boolean |
Include images in search |
Custom XPath (optional) |
string |
Include elements with custom XPath |
Scroll Web Page¶
Description¶
Scrolls web page by X pixels in directions Up/Down.
Parameters¶
Parameter |
Type |
Description |
---|---|---|
Scroll to |
string (combobox) |
The direction of scrolling (Up/Down) |
Scroll by (required) |
int |
Length of scrolling in pixels |
Scroll max |
boolean |
Scroll to the top / end of page |
Wait¶
Description¶
Wait Node creates a pauses between pipeline steps. It waits (and does nothing, obviously) for a specified amount of ms and then lets the next step in a pipeline to proceed.
Parameters¶
Two parameters are required:
Parameter |
Type |
Description |
---|---|---|
Miliseconds (required) |
int/float |
Waiting time interval in miliseconds, e.g. entry: 1000 → 1000 ms (1 s) waiting time before another step. |
Add random ms |
int/float |
Adds a random real picked from a uniform distribution defined on the interval (- entered value, + entered value). |