Transcript
Page 1: Concept search for e commerce with solr

CONCEPT SEARCH FOR ECOMMERCE WITH SOLR

Mikhail KhludnevPrincipal Engineer,eCommerce Search [email protected]

http://goo.gl/2OHk2

Page 2: Concept search for e commerce with solr

Agenda

● eCommerce Search is Special

● Subphrase Query Parser

● Staged Search

● Match Spotting

Page 3: Concept search for e commerce with solr

Part I eCommerce Search is Special

Page 4: Concept search for e commerce with solr
Page 8: Concept search for e commerce with solr

Plain Text Documents

Else Jeans Skinny Jeans, Colored Denim Dark Green-WashIn a dark green wash perfect for fall, these Else Jeans skinny jeans hit the colored-denim trend right on the mark! Lyocell cotton rayon polyester Lycra Machine washable Imported Low rise: approx. 8 inches Skinny fit Skinny leg Zipper fly with button closure 5-pocket style Dark green wash, colored denim Waistband with belt loops Inseam: approx. 30-1/2 inches

Lorem ipsum dolor sit amet, consectetur adipisicing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur. Excepteur sint occaecat cupidatat non proident, sunt in culpa qui officia deserunt mollit anim id est laborum.

Page 9: Concept search for e commerce with solr

Plain Text Documents

Else Jeans Skinny Jeans, Colored Denim Dark Green-WashIn a dark green wash perfect for fall, these Else Jeans skinny jeans hit the colored-denim trend right on the mark! Lyocell cotton rayon polyester Lycra Machine washable Imported Low rise: approx. 8 inches Skinny fit Skinny leg Zipper fly with button closure 5-pocket style Dark green wash, colored denim Waistband with belt loops Inseam: approx. 30-1/2 inches

Lorem ipsum dolor sit amet, consectetur adipisicing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur. Excepteur sint occaecat cupidatat non proident, sunt in culpa qui officia deserunt mollit anim id est laborum.

approx velit

Page 10: Concept search for e commerce with solr

BRAND: Calvin KleinGENDER: Women'sTYPE: JeansFIT: At WaistOCCASION: CasualLEG: Classic StraightWEIGHT: Super Skinny COLOR: Light Wash

Product Catalog Document

Page 11: Concept search for e commerce with solr

Polysemy Problem

Page 12: Concept search for e commerce with solr

Polysemy Problem

COLOR:"Pink" TYPE:"Sweater"

BRAND: "Tomas Pink" TYPE:"Sweater"BRAND: "Pink Lotus" TYPE:"Sweater"

Page 15: Concept search for e commerce with solr
Page 16: Concept search for e commerce with solr

pink jeans by google

https://www.google.com/search?hl=ru&tbm=shop&q=pink+sweater&oq=pink+sweater&gs_l=products-cc.3..0l7j0i5l3.4658.7244.0.7699.12.11.0.1.1.0.298.1153.5j4j1.10.0...0.0...1ac.1.denzbOMlNDg#q=pink+sweater&hl=ru&tbm=shop&ei=oDmIUPXgAoqC4gSylYGACg&start=20&sa=N&num=20&bav=on.2,or.r_gc.r_pw.r_qf.&fp=7898c08770da3440&bpcl=35466521&biw=1280&bih=660

Page 17: Concept search for e commerce with solr

pink jeans by google

https://www.google.com/search?hl=ru&tbm=shop&q=pink+sweater&oq=pink+sweater&gs_l=products-cc.3..0l7j0i5l3.4658.7244.0.7699.12.11.0.1.1.0.298.1153.5j4j1.10.0...0.0...1ac.1.denzbOMlNDg#q=pink+sweater&hl=ru&tbm=shop&ei=oDmIUPXgAoqC4gSylYGACg&start=20&sa=N&num=20&bav=on.2,or.r_gc.r_pw.r_qf.&fp=7898c08770da3440&bpcl=35466521&biw=1280&bih=660

Page 18: Concept search for e commerce with solr

right pink jeans

Page 19: Concept search for e commerce with solr

right pink jeans

Page 20: Concept search for e commerce with solr

Polysemy Problem No.2

BRAND:"Pink Rose" TYPE:"Sweater"

COLOR:"Pink","Rose" TYPE:"Sweater"

sweater

Page 21: Concept search for e commerce with solr

pink rose jeans problem

pink rose sweater

Page 22: Concept search for e commerce with solr

precise pink rose jeans

pink rose sweater

Page 23: Concept search for e commerce with solr

Ranking Can't Help

Page 24: Concept search for e commerce with solr

Ranking Can't Help

Page 25: Concept search for e commerce with solr

Precision is a MUST!

Page 27: Concept search for e commerce with solr
Page 29: Concept search for e commerce with solr

Wrap-Up

● Product Descriptions Differs to Plain Text

● No Query Syntax

● Polysemy Problem

● Ranking Can't Help When Precision is Low

eCommerce Search is Special!

Page 30: Concept search for e commerce with solr

Part II Subphrase Query Parser

Page 31: Concept search for e commerce with solr

Don't tokenize at index time

Page 32: Concept search for e commerce with solr

black and decker toaster oven"black"

"black and"

"black and decker"

... "and"

"and decker"

"and decker toaster"

... "decker"

"decker toaster"

"decker toaster oven"

"toaster oven"

Page 33: Concept search for e commerce with solr

black and decker toaster oven"black"

"black and"

"black and decker"

... "and"

"and decker"

"and decker toaster"

... "decker"

"decker toaster"

"decker toaster oven"

..."toaster oven"

ID: 123BRAND: "black and

decker"TYPE: "toaster

oven"COLOR: white

Page 34: Concept search for e commerce with solr

black and decker toaster oven"black"

"black and"

"black and decker"

... "and"

"and decker"

"and decker toaster"

... "decker"

"toaster""decker toaster oven"

... "oven"

ID: 456BRAND: "black and

decker"TYPE: "oven"COLOR: white

Page 35: Concept search for e commerce with solr

black and decker toaster oven"black"

"black and"

"black and decker"

... "and"

"and decker"

"and decker toaster"

... "decker"

"decker toaster"

"decker toaster oven"

..."toaster oven"

ID: 789BRAND: "decker

toaster"TYPE: "toaster

oven"COLOR: black

Page 36: Concept search for e commerce with solr

fee people sweater

Page 37: Concept search for e commerce with solr
Page 38: Concept search for e commerce with solr

Part III Staged Search

Page 39: Concept search for e commerce with solr

Precision and Recall

Expected

Page 40: Concept search for e commerce with solr

ExpectedActual

Page 41: Concept search for e commerce with solr

ExpectedActual

×× OK

True Positive

False Negative

False Positive

Page 42: Concept search for e commerce with solr

Precision =

ExpectedActual

Actual ∩ Expected

Actual

Page 43: Concept search for e commerce with solr

ExpectedActual

Recall =Actual ∩ Expected

Expected

Page 44: Concept search for e commerce with solr

Expected

Actual

100% Recall

Page 45: Concept search for e commerce with solr

ExpectedActual

Tuning Trade-Off

Page 46: Concept search for e commerce with solr

ExpectedActual

Tuning Trade-Off

Page 47: Concept search for e commerce with solr

ExpectedActual

Tuning Trade-Off

Page 48: Concept search for e commerce with solr

ExpectedActual

Tuning Trade-Off

Page 49: Concept search for e commerce with solr

defaultOperator

AND OR

numFound=100 numFound=1000

Page 50: Concept search for e commerce with solr

defaultSearchField / queryFields

BRAND BRAND TYPE STYLE text

numFound=100 numFound=1000

Page 51: Concept search for e commerce with solr

Stemming & Synonyms

text text_stemmed

numFound=100 numFound=1000

Page 52: Concept search for e commerce with solr

BRAND:(Silver AND Jeans)

BRAND:(Silver OR Jeans)

BRAND_stem:(Silver OR Jean) OR

text_stem:(Silver OR Jean)

ORDer by DESCending Precision

numFound=100

numFound=1000

Page 53: Concept search for e commerce with solr

BRAND:(Silver AND Jeans)

BRAND:(Silver OR Jeans)

BRAND_stem:(Silver OR Jean) OR

text_stem:(Silver OR Jean)

numFound>0

numFound>0

Page 54: Concept search for e commerce with solr

Searching Complexity

O(n log k)

It Depends™

num found

page size

Page 55: Concept search for e commerce with solr

Stages Ratios

50% 30%

20%

Compounds - Exact Match

Compounds - Omit Words

Text Match

Page 57: Concept search for e commerce with solr

Part IVMatch Spotting

Page 58: Concept search for e commerce with solr

Facet Counts

Page 59: Concept search for e commerce with solr

Explain Info15.110098 = (MATCH) sum of: 11.197777 = (MATCH) weight(BRAND:Free People in 28807), product of:

0.81810623 = queryWeight(BRAND:Free People), product of: 9.678479 = idf(docFreq=232, maxDocs=1368899) 0.08452839 = queryNorm

13.687436 = (MATCH) fieldWeight(BRAND:Free People in 28807), product of: 1.4142135 = tf(termFreq(BRAND:Free People)=2) 9.678479 = idf(docFreq=232, maxDocs=1368899) 1.0 = fieldNorm(field=BRAND, doc=28807) 3.912321 = (MATCH) weight(PRODUCT_TYPE:SWEATER in 28807), product of:

0.5750671 = queryWeight(PRODUCT_TYPE:SWEATER), product of: 6.8032427 = idf(docFreq=4130, maxDocs=1368899) 0.08452839 = queryNorm

6.8032427 = (MATCH) fieldWeight(PRODUCT_TYPE:SWEATER in 28807), product of: 1.0 = tf(termFreq(PRODUCT_TYPE:SWEATER)=1) 6.8032427 = idf(docFreq=4130, maxDocs=1368899) 1.0 = fieldNorm(field=PRODUCT_TYPE, doc=28807)

Page 60: Concept search for e commerce with solr

Match Spotting

ExplainInfo

FacetCounts

Page 61: Concept search for e commerce with solr

BRAND:"silver jeans" TYPE:"dress" STYLE:"white"

BRAND:"alfani" TYPE:"dress" STYLE:"silver","jeans"

BRAND:"chaloree" TYPE:"dress" STYLE:"silver"

BRAND:"style&co" TYPE:"jeans dress" STYLE:"silver"

BRAND:"silver jeans" TYPE:"dress" STYLE:"black"

BRAND:"silver jeans" TYPE:"dress" STYLE:"white"

BRAND:"silver jeans" TYPE:"jacket" STYLE: "black"

BRAND:"angie" TYPE:"dress" STYLE:"silver","jeans"

BRAND:"chaloree" TYPE:"jeans dress" STYLE:"silver"

BRAND:"silver jeans" TYPE:"dress" STYLE:"blue"

BRAND:"dotty" TYPE:"dress" STYLE:"silver","jeans"

BRAND:"chaloree" STYLE:"jeans" "dress"

Page 62: Concept search for e commerce with solr

BRAND:"silver jeans" TYPE:"dress" STYLE:"white"

BRAND:"alfani" TYPE:"dress" STYLE:"silver","jeans"

BRAND:"chaloree" TYPE:"dress" STYLE:"silver"

BRAND:"style&co" TYPE:"jeans dress" STYLE:"silver"

BRAND:"silver jeans" TYPE:"dress" STYLE:"black"

BRAND:"silver jeans" TYPE:"dress" STYLE:"white"

BRAND:"silver jeans" TYPE:"jacket" STYLE: "black"

BRAND:"angie" TYPE:"dress" STYLE:"silver","jeans"

BRAND:"chaloree" TYPE:"jeans dress" STYLE:"silver"

BRAND:"silver jeans" TYPE:"dress" STYLE:"blue"

BRAND:"dotty" TYPE:"dress" STYLE:"silver","jeans"

BRAND:"chaloree" STYLE:"jeans" "dress"

silver jeans dress

Page 63: Concept search for e commerce with solr

BRAND:"silver jeans" TYPE:"dress" STYLE:"white"

BRAND:"alfani" TYPE:"dress" STYLE:"silver","jeans"BRAND:"chaloree" TYPE:"dress" STYLE:"silver"

BRAND:"style&co" TYPE:"jeans dress" STYLE:"silver"BRAND:"silver jeans" TYPE:"dress" STYLE:"black"

BRAND:"silver jeans" TYPE:"dress" STYLE:"white"

BRAND:"silver jeans" TYPE:"jacket" STYLE: "black"

BRAND:"angie" TYPE:"dress" STYLE:"silver","jeans"BRAND:"chaloree" TYPE:"jeans dress" STYLE:"silver"BRAND:"silver jeans" TYPE:"dress" STYLE:"blue"

BRAND:"dotty" TYPE:"dress" STYLE:"silver","jeans"BRAND:"chaloree" STYLE:"jeans" "dress"

Page 64: Concept search for e commerce with solr

BRAND:"silver jeans" TYPE:"dress" TYPE:"dress" STYLE:"silver","jeans"

TYPE:"jeans dress" STYLE:"silver"BRAND:"silver jeans" TYPE:"dress" BRAND:"silver jeans" TYPE:"dress"

TYPE:"dress" STYLE:"silver","jeans"TYPE:"jeans dress" STYLE:"silver"

BRAND:"silver jeans" TYPE:"dress" TYPE:"dress" STYLE:"silver","jeans"

Page 65: Concept search for e commerce with solr

BRAND:"silver jeans" TYPE:"dress" TYPE:"dress" STYLE:"silver","jeans"

TYPE:"jeans dress" STYLE:"silver"BRAND:"silver jeans" TYPE:"dress"

BRAND:"silver jeans" TYPE:"dress"

TYPE:"dress" STYLE:"silver","jeans"TYPE:"jeans dress" STYLE:"silver"

BRAND:"silver jeans" TYPE:"dress" TYPE:"dress" STYLE:"silver","jeans"

Page 66: Concept search for e commerce with solr

BRAND:"silver jeans" TYPE:"dress" (4)TYPE:"dress" STYLE:"silver","jeans"

TYPE:"jeans dress" STYLE:"silver"

TYPE:"dress" STYLE:"silver","jeans"TYPE:"jeans dress" STYLE:"silver"

TYPE:"dress" STYLE:"silver","jeans"

Page 67: Concept search for e commerce with solr

BRAND:"silver jeans" TYPE:"dress" (4)TYPE:"dress" STYLE:"silver","jeans"

TYPE:"jeans dress" STYLE:"silver"

TYPE:"dress" STYLE:"silver","jeans"

TYPE:"jeans dress" STYLE:"silver"

TYPE:"dress" STYLE:"silver","jeans"

Page 68: Concept search for e commerce with solr

BRAND:"silver jeans" TYPE:"dress" (4)TYPE:"dress" STYLE:"silver","jeans" (3)

TYPE:"jeans dress" STYLE:"silver"

TYPE:"jeans dress" STYLE:"silver"

Page 69: Concept search for e commerce with solr

BRAND:"silver jeans" TYPE:"dress" (4)TYPE:"dress" STYLE:"silver","jeans" (3)

TYPE:"jeans dress" STYLE:"silver" (2)

Page 70: Concept search for e commerce with solr

BRAND: "Silver Jeans" - (156)

COLOR: "Silver" TYPE: "Jeans" - (34)

found 190

Page 72: Concept search for e commerce with solr

BRAND: "Silver Jeans" - (156)

COLOR: "Silver" TYPE: "Jeans" - (34)

found 190

Page 73: Concept search for e commerce with solr

Wrap-Up

● eCommerce Search is Special

● Subphrase Query Parser

● Staged Search

● Match Spotting

Page 74: Concept search for e commerce with solr

CONTACT Mikhail [email protected]

http://goo.gl/2OHk2


Recommended