35
Discovering temporal hidden contexts in web sessions for user trail prediction Julia Kiseleva (Eindhoven University of Technology), Hoang Thanh Lam (Eindhoven University of Technology), Mykola Pechenizkiy (Eindhoven University of Technology), Toon Calders (Université Libre de Bruxelles)

Tempweb13

Embed Size (px)

Citation preview

Page 1: Tempweb13

Discovering temporal hidden contexts in web sessionsfor user trail prediction

Julia Kiseleva (Eindhoven University of Technology),Hoang Thanh Lam (Eindhoven University of Technology),Mykola Pechenizkiy (Eindhoven University of Technology),Toon Calders (Université Libre de Bruxelles)

Page 2: Tempweb13

User intention prediction

Search Refine Search

PaymentClick Product

ViewClick ?

What is next?

• Does exist any contextual information?• How we can discover it?• How we can utilize it?

Page 3: Tempweb13

Contextual Partitioning

• Approaches to create local models:o Horizontal partitions

Users from Europe

Users from South America

Session 1 Search

Refine Search

Click on Banner

Product View

Payment

Session 3 Product View

Payment

Session 3 Search

Refine Search

Refine Search

Click on Banner

Session 4 Search

Refine Search

Click on Banner

Product View

Payment

Session 5 Product View

Click on Banner

Search

Page 4: Tempweb13

User navigation graph

Search

Refine Search

Payment

Click on Banner

Product View

1.0 2/3

1/3

1/21/4

Drop out

3/4

1/4

1

1/4

Page 5: Tempweb13

Horizontal Partitioning

Page 6: Tempweb13

• Approaches to create local models:o Horizontal partitiono Vertical partition :

• Two types of behavior:o Ready to buy – (Product View, Payment)

Session 1

Search Refine Search

Click on Banner

Product View

Payment

Session 2

Product View

Payment

Session 3

Search Refine Search

Refine Search

Click on Banner

Session 4

Search Refine Search

Click on Banner

Product View

Payment

Session 5

Product View

Click on Banner

Search

Contextual Partitioning

Page 7: Tempweb13

• Approaches to create local models:o Horizontal partitiono This work is about Vertical partition:

• Two types of behavior:o Ready to by – (Product View, Payment)o Just browsing – (Search, Refine Search, Click on Banner)

Session 1

Search Refine Search

Click on Banner

Product View

Payment

Session 2

Product View

Payment

Session 3

Search Refine Search

Refine Search

Click on Banner

Session 4

Search Refine Search

Click on Banner

Product View

Payment

Session 5

Product View

Click on Banner

Search

Contextual Partitioning

Page 8: Tempweb13

• Approaches to create local models:o Horizontal partitiono Vertical partition:

• Two types of behavior:o Ready to buy – (Product View, Payment)o Just browsing – (Search, Refine Search, Click on Banner)

Session 1

Search Refine Search

Click on Banner

Product View

Payment

Session 2

Product View

Payment

Session 3

Search Refine Search

Refine Search

Click on Banner

Session 4

Search Refine Search

Click on Banner

Product View

Payment

Session 5

Product View

Click on Banner

Search

Contextual Partitioning

Page 9: Tempweb13

Problem Description

Timeline

Search Refine Search

PaymentClick Product

ViewSearch Click

Context ``Find information”

Context ``Buy product”

What is next?Probably user will change

intent?

Page 10: Tempweb13

Problem Description

Timeline

Search=A

Refine Search

=B

PaymentClick

=C

Product View

=D

Search Click

Context ``Find information”

Context ``Buy product”

Page 11: Tempweb13

Context-Awareness Example

Contextual Feature:User intent

DATA

Contextual Categories

Individual Models

Mapping G

Mapping H Context

Discovery

Ready to buy

Just browsing

Page 12: Tempweb13

Context-Awareness

………………

Contextual features

DATA Environment

Contextual Categories

Individual Learners

Mapping G

Mapping H

Context Discovery

Context-Awareness: (G,H)

Page 13: Tempweb13

Temporal Context-Awareness

G H

Temporal Context-Awareness: (G,H,ti)

……

..

G

G

H

H

Web

Sessio

n

S

Contextual features

Contextual Categories

Individual Models

Page 14: Tempweb13

Discovery hidden contexts

Web log

Train:To train

predictive models

Validation:To find Best

clusters

Test:To derive final

accuracy

To find a “Best”

clusters

Calculate final accuracy

To train local predictive

models

Page 15: Tempweb13

Optimization problem

Page 16: Tempweb13

a b c d abababababcdcdababcdcdcd

The number of true predictions = 0

a b c d

1.0 1.0 1.0

1.0

Hierarchical clustering

Page 17: Tempweb13

a b c d

ab

abababababcdcdababcdcdcd

The number of true predictions = 12

a b c d

1.0

1.0

1.0 1.0

Hierarchical clustering

Page 18: Tempweb13

a b c d

ab

abababababcdcdababcdcdcd

The number of true predictions = 20

a b c d

1.0

1.0

1.0

1.0

cd

Hierarchical clustering

Page 19: Tempweb13

a b c d

ab

abababababcdcdababcdcdcd

The number of true predictions = 20

a b c d

1.0

1.0

1.0

1.0

cd

abcd

Stop as long as no additional prediction benefit of merging

Hierarchical clustering

Page 20: Tempweb13

Mastersportal.eu - Homepage

Quick Search

Banner Click

Universities in the spotlight

Page 21: Tempweb13

Mastersportal.eu - Search

Refine Search

Click on Program is Search Result

Click on University

Click on Country

Page 22: Tempweb13

DatasetDate May 2012

#sessions

350.618

#requests

1.775.711

#sessions from Eu

159.991

Page 23: Tempweb13

Results

Page 24: Tempweb13

Explaining the results

eb C d

a1.0

1.0

0.9

0.1

1.0

1.0

Page 25: Tempweb13

Explaining the results

eb C d

a1.0

1.0

0.9

0.1

1.0

1.0

Explanation:1. Users’ behavior doesn’t follow Markov

property2. Ambiguous Transition Matrix

Page 26: Tempweb13

Explaining the results

University on nearby universities

Related Programs

0.315

0.299

Quick Search

0.306

Related Programs

Related Programs

Click on Program

0.283

Page 27: Tempweb13

Resulted ClustersId Summary Cluster

1 Intensive Search Basic Search, Refine Search, EmptySearch Result

2 Explore informationrelated toprogram

Program impression in search result,Banner click, Program click ,Click on university link

3 Start ofbrowsing

University Spotlight impression,Quick search

4 Explorecountry information

File view, Click on country link

5 Exploresearch result

Program impressions in search results,University impression onnearby universities

6 Explore program Program in landing page, Submitinquiry

7 Outlier Submit question, X-node

Page 28: Tempweb13

Site Map• Page is represented as set of possible actions

o Example: Homepage is (Quick Search, University Spot light impression, Question Submit)

o Calculate Jaccard similarity between Page and Cluster

Page 29: Tempweb13

Site map

Page 30: Tempweb13

Conclusion• We formulate the problem of temporal context

discovery as an optimization problem.

• A hierarchical clustering method is proposed to determine the optimal number of hidden contexts and mine temporal contexts.

• We show a real-world use case in which the contexts as we defined them do exist and are useful for prediction.

Page 31: Tempweb13

Future works• Testing our method on another datasets

• Introducing a mechanism for detecting context switching within a web-session

• Considering multidimensional contextualfeatures.

Page 32: Tempweb13

Thank you!• Context identification and integration

it into prediction models• Accurately predicting users’ desired

actions and understanding behavioral patterns of users in various web-applications

• Personalization and adaptation to diverse customer need and preferences

• Accounting for the practical needs within the considered application areas.

Page 33: Tempweb13

Conclusion• We formulate the problem of temporal context

discovery as an optimization problem.

• A hierarchical clustering method is proposed to determine the optimal number of hidden contexts and mine temporal contexts.

• We show a real-world use case in which the contexts as we defined them do exist and are useful for prediction.

Questions?

Page 34: Tempweb13

Outline• Problem description• Def. context-aw => temporal CA• Motivational example• Objectives• Method: HC• Data• Results• Summary• Conclusion• Future works

Page 35: Tempweb13

• Web sessions:• Type of events:• Web session is an

ordered sequence of events:

• Space of contextual features:

• Predictive model:• Evaluation function:• Contextual categories: