Upload
promptcloud
View
2.229
Download
0
Tags:
Embed Size (px)
Citation preview
OUTSOURCING WEB CRAWLS?
READ THIS!
USE DATA OFTEN?
Have a DaaS Provider on board?
RELOOK YOUR SLA!
Scope, Quality, Responsibilities are agreed upon
POINTS TO PONDER…
SERVICE-LEVEL AGREEMENT DEFINES SERVICE
CRAWLABILITYCrawls must:
run smooth | mitigate roadblocks | be adaptive
CLICK HERE FOR ADAPTIVE CRAWLS
SCALABILITYProblem changes by an order of magnitude
Always opt a scalable solution
Ensure provider is agnostic to anticipated scale
DATA STRUCTURINGValidate how meticulous DaaS provider can get when extracting information
Always add quality checks at your end to avoid compromises
DATA COVERAGECrawls can end up being missed or skipped
Issues can be cured by keeping logs
Discuss tolerance levels to configure system accordingly
AVAILABILITYRight data at the right time is important!
Notify at the beginning the uptimes you want
ADAPTABILITYData requirements are subject to market dynamism
Check how easily provider can adapt (or not!) to changing data structures, sources or schema
MAINTENANCEMonitoring crawl & structuring of data is important!
Avoid hassles of maintaining crawl bugs & fixes by asking what’s covered in SLA
PROMPTCLOUD COVERS THIS & MORE…
CLICK TO VISIT US
CLICK FOR CUSTOM CRAWLING & DATA EXTRACTION
site-specific crawl
mass-scale crawl
Twitter crawl