21
ABI Lab Forum 2012 Tomás Pariente Lobo – Atos Spain FIRST European research for web information extraction and analysis for supporting financial decision making

First european research for web information extraction and analysis for supporting financial decision making v0.4

Embed Size (px)

DESCRIPTION

FIRST project introduction at the ABI Lab 2012 forum in Milano.

Citation preview

Page 1: First   european research for web information extraction and analysis for supporting financial decision making v0.4

ABI Lab Forum 2012Tomás Pariente Lobo – Atos Spain

FIRSTEuropean research for web information extraction

and analysis for supporting financial decision making

Page 2: First   european research for web information extraction and analysis for supporting financial decision making v0.4

Vision

Innovation

Tools

Motivation

Page 3: First   european research for web information extraction and analysis for supporting financial decision making v0.4

Why FIRST? - Motivations

3

The most reliable data sources today…

…also have their weakness!They do not consider unstructured data, rumors, market sentiments, etc.

Page 4: First   european research for web information extraction and analysis for supporting financial decision making v0.4

Why FIRST? - Motivations

4

Example: Apple iPhone 1 Announcement on 2007-01-09

Stock prices were skyrocketing after the announcement. However, the announcement could be sensed before…

Page 5: First   european research for web information extraction and analysis for supporting financial decision making v0.4

Why FIRST? - Motivations

5

Example: Market surveillance via FIRST (the Google news case)

September 2008: Google news announced “United Airlines bankruptcy”.

Within 12 minutes stock price decreased 75% wiped out US $ 1bn.

The “news” was actually 6 years old… Plausibility checking will help in identifying hoaxes: consistence with

regulatory news and other sources.

Page 6: First   european research for web information extraction and analysis for supporting financial decision making v0.4

Why FIRST? – MotivationsA growing universe of unstructured data

… how to separate the wheat from the chaff ?

6

Page 7: First   european research for web information extraction and analysis for supporting financial decision making v0.4

Vision

Innovation

Tools

Motivation

Page 8: First   european research for web information extraction and analysis for supporting financial decision making v0.4

FIRST Project

Project facts

Running from October 2010 until September 2013

9 partnersMore than 30 peoplePreliminary results availableMore to come...

Stay tuned (http://project-first.eu/)

8

European-funded research project

Page 9: First   european research for web information extraction and analysis for supporting financial decision making v0.4

Who is behind FIRST?

Industrial partners

SMEs

Academic/Research

Page 10: First   european research for web information extraction and analysis for supporting financial decision making v0.4

FIRST Vision

Visionis to make available the relevant information

of the entire financial information space (including unreliable, unstructured, sentiment sources)

to the decision maker in near-real time in an automated way

10

Page 11: First   european research for web information extraction and analysis for supporting financial decision making v0.4

FIRST Vision

Structured

UnstructuredBlog, analysis, bulletin boards… Unreliable, poor quality, noisy…

AUTOMATION

Acquisition AnalysisProcessing Decision support

Financial Resources

11

Page 12: First   european research for web information extraction and analysis for supporting financial decision making v0.4

Vision

Innovation

Tools

Motivation

Page 13: First   european research for web information extraction and analysis for supporting financial decision making v0.4

Mining the Web for financial texts

Data Acquisition pipeline: Web mining

Streaming

Cleaning

Natural Language preprocessing and entity

extraction

Financial terms, Companies,

Intruments …

Page 14: First   european research for web information extraction and analysis for supporting financial decision making v0.4

Data acquisition after one year

Some numbers176 Web sites2,671 RSS sources~40,000 documents per day>5,000,000 documents by end of 2011o And growing

Essential for future evaluation and analysis

14

Page 15: First   european research for web information extraction and analysis for supporting financial decision making v0.4

Analysing sentiments in Web texts

Document with

sentiment sentences

Aggregatedsentiments

SENTIMENTAGGREGATION

per object and feature

The Analytical Pipeline: Identify, extract, classify, aggregate

Documentwithbasic

annotations

SENTIMENTCLASSIFICATION

per object and feature

Sentiment Sentences

ObjectIndicators

Positive sentiment

15

Page 16: First   european research for web information extraction and analysis for supporting financial decision making v0.4

Supporting the decision making process

Qualitative Modeling

Knowledge Base

Outputs:

Forecasts of volatility or returns, Alert on pump and

dump, Reputation change of a counterpart

Signals,Charts,

Topic Spaces,Topic Trends,

Reports…

Machine Learning

Techniques

Visualization Techniques

FIRSTAcquisition &

Analytical Pipelines Forecasting

Models

The Decision Support techniques: Analysis and visualization

16

Page 17: First   european research for web information extraction and analysis for supporting financial decision making v0.4

Glassbox model

Document

sentencesObjectsFeatures

Sentiment

17

Drill down

Page 18: First   european research for web information extraction and analysis for supporting financial decision making v0.4

Vision

Innovation

Tools

Motivation

Page 19: First   european research for web information extraction and analysis for supporting financial decision making v0.4

The three FIRST use cases & their relevance for the industry

Market Surveillance Capital markets compliance can be automated today using structured data, but

the automation does not take unstructured data into account FIRST will

make use of large volumes of unstructured data into financial compliance; develop automated techniques to better detect market abuse/insider

trading..

Reputational Risk Management No off-the-shelf solutions or methodologies for reputational risk management. FIRST will

provide a sustainable tool for reputational risk monitoring; contribute to break new ground in this field of dramatically high impact in FSI.

Retail Brokerage Today, mainly based on quantitative analysis and key figures. FIRST will

use unstructured data to leverage both information for private investors and sophisticated tools for professional users.

19

Page 20: First   european research for web information extraction and analysis for supporting financial decision making v0.4

20Stay tuned (http://project-first.eu/)

Page 21: First   european research for web information extraction and analysis for supporting financial decision making v0.4

AcknowledgementThe research leading to these results has received funding from the

European Community's Seventh Framework Programme (FP7/2007-2013) under grant agreement n°257928.

THANKS