SOLR 24 x 7 From Development To Productionpeople.apache.org/~sgoeschl/presentations/solr...SOLR...

Preview:

Citation preview

SOLR 24 x 7From DevelopmentTo Production

Sonntag, 14. April 13

Source : http://www.oewa.at/index.php?id=2

Sonntag, 14. April 13

More Facts & Numbers

• 1,61 million classified ads

• 12,9 million page impression every day

• 24,4 full-text search requests every sec

• 20,0 new classified ads every minute

Sonntag, 14. April 13

The Business Case

Sonntag, 14. April 13

The Business Case

• Linux is no longer supported

• Current license limited to 200 req / sec

• Overall FAST performance decreased

• No FAST experts available

• Provide better user experience

Sonntag, 14. April 13

Sonntag, 14. April 13

Where isfull-text search used?!

Sonntag, 14. April 13

Where isfull-text search used?!

Everywhere

Sonntag, 14. April 13

Sonntag, 14. April 13

Sonntag, 14. April 13

Sonntag, 14. April 13

• A Java open source search engine

• Indexes content sources, processes query requests, returns search results.

• Based on Lucene but adds full enterprise search server features and capabilities.

• A web-based application that processes HTTP requests and returns HTTP responses.

What Is Apache SOLR?

Sonntag, 14. April 13

The Road Ahead

• Create SOLR prototype

• Replace FAST with SOLR

• Improved search experience

• Provide 24 x 7 availability

Sonntag, 14. April 13

SOLR Prototype

• Based on Solritas

• Uses Velocity template engine

• Gives you a head start

• Easy to customize

Sonntag, 14. April 13

Sonntag, 14. April 13

The Road Ahead

• Create SOLR prototype

•Replace FAST with SOLR

• Improved search experience

• Provide 24 x 7 availability

Sonntag, 14. April 13

Willhaben FAST Integration

• FAST Search Server tightly integrated

• Configuration & metadata in database

• Hard to integrate SOLR in parallel

• FAST to SOLR query translation

‣ Sort of hack

‣ Custom recursive parser

Sonntag, 14. April 13

FAST2SOLR Translation

Sonntag, 14. April 13

SOLR Performance

• FAST Search Server tightly integrated

• Configuration & metadata in database

• Hard to integrate SOLR in parallel

• FAST to SOLR query translation

‣ Sort of hack

‣ Custom recursive parser

Sonntag, 14. April 13

The Road Ahead

• Create SOLR prototype

• Replace FAST with SOLR

• Improved search experience

• Provide 24 x 7 availability

Sonntag, 14. April 13

Implicit Wildcard Search

• Classified Ads are very diverse and short

• The German language is hard

• Willhaben users are not computer geeks

• Using “Implicit Wildcard Search”

‣ DictionaryCompoundWordTokenFilter

Sonntag, 14. April 13

Sonntag, 14. April 13

SOLRCustom Components

Sonntag, 14. April 13

System Monitoring

Sonntag, 14. April 13

Slow Query Log

Sonntag, 14. April 13

Frequent Search Terms

Sonntag, 14. April 13

Best User SearchTerms

Sonntag, 14. April 13

Best User SearchTerms

• gümnastikband

Sonntag, 14. April 13

Best User SearchTerms

• gümnastikband

• Auti Q5

Sonntag, 14. April 13

Best User SearchTerms

• gümnastikband

• Auti Q5

• Mitzbuichi

Sonntag, 14. April 13

Best User SearchTerms

• gümnastikband

• Auti Q5

• Mitzbuichi

• Star Woars

Sonntag, 14. April 13

Best User SearchTerms

• gümnastikband

• Auti Q5

• Mitzbuichi

• Star Woars

• kopjuter

Sonntag, 14. April 13

Best User SearchTerms

• gümnastikband

• Auti Q5

• Mitzbuichi

• Star Woars

• kopjuter

• kinderdreirad mit 3 breiten gummireifen aus den 60er jahren

Sonntag, 14. April 13

Questions & Answers

Sonntag, 14. April 13

Recommended