Free trial: As part of the 7-day free trial, you’re entitled to 1,000-page loads
Update available: A new version of the scraper is available. If there is no update button, you have the latest version.
Properties: This is where you’ll see all the scraper properties. Learn more
Delivery preferences: Choose your desired file format, delivery method, and notification settings. Learn more
Output configuration/ Schema: Here, you can go back to edit your output definitions. Learn more
Limitations - Our collectors have a limitation of 100 parallel-running jobs. When more than 100 jobs are triggered, the additional jobs are placed in a queue and wait until the earlier ones finish.
Initiate scraperTo start collecting the data, you have three options:A. Initiate by API
B. Initiate manually
C. Schedule a scraperGet collection resultsOnce the data collection is completed, click the “three dots” icon and select “Statistics” to access the results and download the data.
Realtime job input and output cannot be downloaded since it is not stored on our end
The statistics page presents essential information about the success of the data collection. Below is a list of all the terms included in the statistics table:Statistics actions menu
3 dots
Here you can perform different functions with the data collection job:
The statistics page presents essential information about the success of the data collection. Below is a list of all the terms included in the statistics table:
Job ID - The unique id of the collection
Trigger - The person who initiated the data collection and how (API, manually or scheduled)
Inputs - The number of inputs inserted into the collection
Records - The number of results collected
Failed - The number of pages failed to be crawled
Success rate - The percentage of the results that were successfully collected
Queued at - The queue timestamp
Started at - The date and time when the scraper began collecting
Finished at - The date and time when the scraper finished collecting
Job time - The length of time it took to complete
Estimated time left - The amount of time left until collection is complete
Queue - The name of the job given in the trigger behavior (Queue name)