Upload
aleph-archives
View
155
Download
0
Embed Size (px)
Citation preview
WEB ARCHIVING FOR
BUSINESS INTELLIGENCE
Making archives dynamic
• Some suggestions from the web archiving
community made us realize that those
digital web archives can be used for other
purposes other than preservation and/or
compliance.
• The Live Page Monitoring service was built
on top of CAMA to provide near real-time
notifications when user-defined keywords
show-up on certain web pages.
CAMA® monitoring service
CAMA® is part of the four main decisional steps in BI:
Data extraction (Web Scrapping): To obtain significant results, one must gather Web-based data wherever they remain. When connected to Web data sources, CAMA®
gathers relevant data and centralizes it in its distributed data warehouse.
Strengthening: Once centralized, data must be analyzed and distributed inside the data warehouse. This pre-processing makes it easier for CAMA® tools to access data, since data warehouses are automated.
Processing: From a request based upon dedicated search forms, the analysis tool collates related data to find relevant information.
Reporting: This step is about broadcasting and presenting information
CAMA® Monitors: daily monitoring of 20 online newspapers about subjects related to:Terrorism, UK royal wedding, European crisis and HightTech security (archives of 5/4/2011)
Define list of
keywords to track
in
any language
Add tags and
comments
to interesting pages
Create
monitors Modify
selected
pages
Export page in
PDF and PNG
Format
Navigate in
the page and
click on
interesting
links
Get instant
notifications
each time a
match is
found.
For More information visit our website
www.aleph-archives.com