Recent advances in computational advertising

Recent advances in computational advertising: design and analysis of ad retrieval systems

Evgeniy Gabrilovichgabr@yahoo-inc.comg @y

What is “Computational Advertising”?

• A new scientific sub-discipline that provides the f d i f b ildi li d i l l ffoundation for building online ad retrieval platforms– To wit: given a certain user in a certain context,

find the most suitable ad

• At the intersection of Large scale text analysis– Large scale text analysis

– Information retrieval– Statistical modeling and machine learningStatistical modeling and machine learning– Optimization– Microeconomics

Computational Advertising at Yahoo! ResearchYahoo! Research

Online advertising spending

Textual advertising

1 Ads driven by search keywords –1. Ads driven by search keywords Sponsored Search (a.k.a. “keyword driven ads”, “paid search”, etc.), p , )

2. Ads directly driven by the content of a web page – Content Match (a k a “contextpage – Content Match (a.k.a. context driven ads”, “contextual ads”, etc.)

Textual advertising on the Web is strongly related

to NLP and information retrieval

Sponsored searchText based ads driven by a keyword searchText-based ads driven by a keyword search

Content match adsText-based ads driven by the page contentText-based ads driven by the page content

C t tContent match

Anatomy of an ad

Bid phrases: {SIGIR 2010,computational advertising

computational advertising, Evgeniy Gabrilovich, ...}Bid: $0.10

CreativeDisplay URLDisplay URL

Landing URL: http://research.yahoo.com/tutorials/sigir10_compadv

So when do advertising dollars actually change hands?actually change hands?

CPM t th d i i– CPM = cost per thousand impressions• Typically used for graphical/banner ads

(brand advertising)

– CPC = cost per clickp• Typically used for textual ads

CPT/CPA = cost per transaction/action– CPT/CPA = cost per transaction/action a.k.a. referral fees or affiliate fees

Beyond keyword matching

• Matching ads is relatively simple for explicitly bid keywordsWhat about queries on which there are no bids ?– Advertisers should be able to bid on “broad queries” and/or

“concept queries”– Advertisers need volume – the total amount of searches on bid

phrases is not enough !

• Suppose your ad is “Good prices on Seattle hotels”• Suppose your ad is Good prices on Seattle hotels• Naïve approach: bid on any query that contains the word Seattle• Problems

• “Seattle's Best Coffee Chicago”

• “Alaska cruises start point”

• Ideally: bid on any query related to Seattle as a travel destination

The old school: heuristic ad matchingheuristic ad matching

• Sponsored searchp– Exact match between the query and the bid phrase

of the ad (modulo simple normalization, e.g., stemming)stemming)

– Advertisers cannot possibly bid on all relevant queries (especially rare ones)

• Use advanced match (e.g., through query-to-query rewrites)

• Content match– Extract bid phrases from pages, thus reducing the

problem to exact match Both essentially perform record lookup

Both essentially perform record lookup

The old school (cont’d)

Query Abbey Road

Front end

Query rewriting moduleSimplistic

lyrics

QueryQuery rewrites

query expansion

Exact matchIgnoring (or underusing) the multitude of information

il bl Candidate ads

Revenue d i Ad slate

available

reordering Ad slate

The new approach: knowledge-based ad retrievalknowledge based ad retrieval

• Ad indexing and scoring based on all the information• Ad indexing and scoring based on all the information available (bid terms, title, creative, URL, landing page, ...)– Similar to document indexing in IR

• Use standard IR tools (text preprocessing – tokenization, stemming, entity extraction; inverted indexes etc.)

– Use multiple features of the query and the ad

• Elaborate query expansion

2nd l d i ( ki )• 2nd pass relevance reordering (re-ranking)– Using features not available to the 1st pass model (e.g., set-level

features, click history)

The new approach (cont’d)

Query Miele

Front end

Ad query<Miele, appliances, kitchen,“appliances repair” “appliance parts”Ad query

generation

Ad query

appliances repair , appliance parts ,Business/Shopping/Home/Appliances>Rich query

Ad search engineThe hidden parts of ads (bid phrases +

First pass retrieval

Relevance

landing pages) allow us to augment the ads (cf. query

Candidate ads

Revenue reordering Ad slate

Relevance reorderingexpansion)

Research questions

Should we show ads

How to select How to

index thequestions show ads at all?relevant

index the ad corpus?Can we generate bid

phrases (or even entire ad campaigns)

automatically?

Wh t i thWhat is the interplay between the organic and

sponsored results?

Competing for users’ attention:On the interplay between organic andOn the interplay between organic and

sponsored search results(WWW 2010 w Danescu Niculescu Mizil et al )(WWW 2010, w. Danescu-Niculescu-Mizil et al.)

The interplay between ads and organic resultsorganic results

“... in an information-rich world, the wealth of information means a dearth of something else: a scarcity of whatever it is thatdearth of something else: a scarcity of whatever it is that information consumes. What information consumes is rather obvious: it consumes the attention of its recipients. Hence a wealth of information creates a poverty of attention and a need to allocate that attention efficiently among theneed to allocate that attention efficiently among the overabundance of information sources that might consume it.”

-- Herbert Simon, “Designing Organizations for an Information-Rich World”, 1971.,

• Is there competition for clicks between ads and organic results ?• Do users prefer ads that are similar to the organic results, or do

they prefer diversity ?they prefer diversity ?

We found that the nature of this interplay depends on the type of the query

on the type of the query

Relation between the CTR of ads and the CTR of organic resultsand the CTR of organic results

• Negative correlation (competition)g ( p )– Users are only willing to spend limited time and effort on

each query

P iti l ti (d d th lit f• Positive correlation (depends on the quality of results)– Easy query (“online radio”) – decent ads and organicEasy query ( online radio ) decent ads and organic

results – clicks on both– Hard query (“who is giving this talk?”) – poor results on

both sides – no clicks on eitherboth sides no clicks on either

• Independence (null hypothesis)– Users consider ads and organic results as two

gindependent sources of information

Findings: competition + positive correlationcompetition + positive correlation

Decoupling the forces

• Users are willing to invest limited effort in geach query competition

• In order to single out the competition effect, we gtried to explicitly model the amount of effortthe user is willing to investL ff i i l i [B d 2002]• Low effort = navigational queries [Broder, 2002] (27% of queries)

“Pandora radio” “Bank of America”– Pandora radio , Bank of America

• High effort = non-navigational queries“Meaning of life” “academia vs industry”

– Meaning of life , academia vs. industry

Competition clearly exists for navigational queriesnavigational queries

We also examined differentWe also examined different degrees of navigationality:

the less navigational the query is, the less competition we

observed

Another viewpoint:Do users prefer ads that are more similar to the organic results or more diverse ads?the organic results or more diverse ads?

• Both have been argued for in prior workBoth have been argued for in prior work• Preference for similarity

– Ads are more likely to be relevant– This assumption is often made in query

i f d ti i [B d t l 2008]expansion for advertising [Broder et al., 2008]

• Preference of diversity– Diversity among organic search results has

often been shown to be desirable (e.g., entire i di it @ WWW 2010)

session on diversity @ WWW 2010)

We found evidence for users’ preferring both diversity and similaritybot d e s ty a d s a ty

So we need to dig deeper

again ...

Overlap measured using the Jaccard

coefficient

between titles of ads and organic

results

Let’s break down by navigationality againby navigationality again

Break down by navigationality (cont’d)(cont d)

Counterintuitive ?

Responsive and incidental ads

• Responsive ads directly address the user’sResponsive ads directly address the user s information need

Incidental ads are only somewhat related to the– More likely to be similar to the organic results

• Incidental ads are only somewhat related to the user’s information need– Unreasonable as organic results but ok for adsUnreasonable as organic results, but ok for ads

• Example: query = “free internet radio”

– More likely to be different from the organic results

• Example: query = free internet radio– Responsive: “Pandora Internet Radio”– Incidental: “Discount Bose Computer Speakers”

– Incidental: Discount Bose Computer Speakers

Now it all make sense ...

Using the featuresUsing the features that quantify this

interplay, we improved the accuracy of CTRaccuracy of CTR prediction by 5%

Summary

1 The financial scale is huge1. The financial scale is huge2. Advertising is a form of information3. Finding the “best ad” is an information

retrieval problem Multiple, possibly contradictory utility functions Classical IR needs significant adaptation

4. The optimal solution requires extensive use of external knowledge

Th k !Thank you!gabr@yahoo-inc.com

http://research.yahoo.com/~gabr

This talk is Copyright Yahoo! 2010.Y h ! d th A th t i ll i ht i l diYahoo! and the Author retain all rights, including

copyright and distribution rights. No publication or further distribution in full or in part is permitted

without explicit written permission.

The opinions expressed herein are the responsibilityThe opinions expressed herein are the responsibility of the author and do not necessarily reflect the

opinion of Yahoo! Inc.

This talk benefitted from the contributions of many colleagues and co-authors at Yahoo! and elsewhere.

Their help is gratefully acknowledged.

Recent advances in computational advertising

Documents

Modern Advances in Computational and Applied Mathematicsoneil/egfest2017/MACAMflyer.pdf · Modern Advances in Computational and Applied Mathematics ... mathematics of the work of

Challenges in Computational Advertising

Advances in the computational modeling of the … Webseite...Advances in the computational modeling of the gecko adhesion mechanism Roger A. Sauer1 Emmy-Noether Research Group on Computational

RECENT ADVANCES IN COMPUTATIONAL MODELING OF …

University of Toronto - MSRGmsrg.org/publications/presentations/2012/moMW12-Location... · 2012. 12. 3. · 1 Computational advertising (targeted advertising) 2 Computational nance

Display Advertising Landscape MS &E 239 Computational Advertising

Computational Advertising

Computational Advertising-The LinkedIn Way

Intro to Computational Advertising - Stanford University

Recent Advances in Computational

ADVANCES in ENVIRONMENT, COMPUTATIONAL CHEMISTRY

Current Advances in the Methodology and Computational ...web.gps.caltech.edu/classes/ge133/reading/ppv_preprints/sec2-1.pdf · Current Advances in the Methodology and Computational

Tutorial 11 (computational advertising)

Recent Computational Advances in Metagenomics (RCAM’15)maiage.jouy.inra.fr/sites/maiage.jouy.inra.fr/files/u43/booklet.pdf · Recent Computational Advances in Metagenomics (RCAM’15)

Advances in Computational & Experimental Engineering & Sciences

ADVANCES in MATHEMATICAL - wseas.org · ADVANCES in MATHEMATICAL and COMPUTATIONAL ... This year the 14th WSEAS International Conference on Mathematical and Computational Methods

Introduction to Computational Advertising - Stanford University

Deepak-Computational Advertising-The LinkedIn Way

Recent Computational Advances in Metagenomics (RCAM’17)maiage.jouy.inra.fr/sites/maiage.jouy.inra.fr/... · Recent Computational Advances in Metagenomics (RCAM’17) RCAM program

MODERN ADVANCES IN COMPUTATIONAL IMAGING AT MICROWAVE …