GraphConnect Europe 2016 - Tuning Your Cypher - Petra Selmer, Mark Needham

Tuning CypherMark Needham @markhneedham

Petra Selmer@Aethelraed

Why do we need to tune?

‣ No query planner is ever perfect‣ You know your domain better than the

database

The Cost planner

‣ Introduced in 2.2.0‣ It uses the statistics service in Neo4j to

assign costs to various query execution plans, picking the cheapest one

‣ All queries use this by default

Cypher query execution

‣ http://neo4j.com/docs/snapshot/execution-plans.html‣ http://neo4j.com/blog/introducing-new-cypher-query-optimizer

How do I view a query plan?

‣ EXPLAIN• shows the execution plan without actually

executing it or returning any results.

‣ PROFILE• executes the statement and returns the results

along with profiling information.

Neo4j’s longest plan (so far…)

What is our goal?

At a high level, the goal is simple: get the number of db hits down.

an abstract unit of storage engine work.

What is a database hit?

“”

‣ Operators to look out for• All nodes scan expensive

• Label scan cheaper

• Node index seek cheapest

• Node index scan used for range queries

‣ http://neo4j.com/docs/3.0.0-RC1/execution-plans.html

Execution plan operators

Our data set

Finding The Matrix

MATCH (movie {title: "The Matrix"})

RETURN movie

Finding The Matrix

MATCH (movie

{title: "The Matrix"})

RETURN movie

Tip: Use labels

MATCH (movie:Movie

RETURN movie

Tip: Use labels

MATCH (movie:Movie

RETURN movie

Finding The Matrix MATCH (movie

RETURN movie

MATCH (movie:Movie

RETURN movie

Tip: Use indexes and constraints

‣ Indexes for non unique values‣ Constraints for unique values

CREATE INDEX ON :Movie(title)

CREATE INDEX ON :Person(name)

CREATE CONSTRAINT ON (g:Genre)

ASSERT g.name IS UNIQUE

How does Neo4j use indexes?

‣ Indexes are only used to find the starting point for queries.

Use index scans to look up rows in tables and join them with rows from other tables

Use indexes to find the starting points for a query.

Relational

Tip: Use indexes and constraints

MATCH (movie:Movie

RETURN movie

Finding The Matrix (no index)MATCH (movie:Movie

RETURN movie

(index)MATCH (movie:Movie

RETURN movie

Actors who appeared together

MATCH (a:Person {name:"Tom Hanks"})

-[:ACTS_IN]->()<-[:ACTS_IN]-

(b:Person {name:"Meg Ryan"})

RETURN COUNT(*)

Actors who appeared together

RETURN COUNT(*)

Tip: Enforce index usage

USING INDEX a:Person(name)

USING INDEX b:Person(name)

RETURN COUNT(*)

Tip: Enforce index usage

RETURN COUNT(*)

Actors who appeared togetherMATCH (a:Person {name:"Tom Hanks"})

RETURN COUNT(*)

Tom Hanks’ colleagues’ movies

MATCH (p:Person {name:"Tom Hanks"})

-[:ACTS_IN]->(m1)<-[:ACTS_IN]-

(coActor)-[:ACTS_IN]->(m2)

RETURN distinct m2.title

Tom Hanks’ colleagues’ movies

Tip: Reduce cardinality of WIP

(coActor)

WITH DISTINCT coActor

MATCH (coActor)-[:ACTS_IN]->(m2)

Tip: Reduce cardinality of WIP

(coActor)

-[:ACTS_IN]->(m1)<-[:ACTS_IN]-(coActor)

Tom Hanks’ colleagues’ moviesMATCH (p:Person {name:"Tom Hanks"})

RETURN distinct m2.title;

USING INDEX Force the use of a specific index

MATCH (a:Person {name:"TomHanks"})-[:ACTS_IN]->()

RETURN count(*)

USING SCAN Forces a label scan on lower cardinality labels

MATCH (a:Actor)-->(m:Movie:Comedy)

USING SCAN m:Comedy

RETURN count(distinct a)

Even more tips...

Use parameters

MATCH (p:Person {name: {name}})

-[:ACTS_IN]->(m)

RETURN m.title

-[:ACTS_IN]->(m)

RETURN m.title

Avoid Cartesian products

‣ Easy to do this inadvertently:

MATCH (a:Actor), (m:Movie)

RETURN count(a), count(m)

‣ This is correct, and performs betterMATCH (a:Actor)

WITH count(a) as a_count

MATCH (m:Movie)

RETURN a_count, count(m)

Watch out for those warnings!

Cardinalities

Watch those rows!

Only RETURN what you need

‣ This is not recommended:MATCH (a:Actor)

RETURN a

‣ Use this instead:MATCH (a:Actor)

RETURN a.name, a.birthdate, a.height

‣ View query plans with EXPLAIN and PROFILE‣ Use labels‣ Index your starting points‣ Reduce work in progress‣ Remember the hints

Thanks for coming

‣ And don’t forget, if the tips aren’t working ask us for help on Stack Overflow!

Mark Needham @markhneedham Petra Selmer @Aethelraed

GraphConnect Europe 2016 - Tuning Your Cypher - Petra Selmer, Mark Needham

Technology

Webinar: Intro to Cypher

The Selmer Guitars

Tom Selmer Instal Capace

GraphConnect 2014 SF: The Business Graph

Cypher technique

Cypher Stent - SIRIUS Trial

Data Modeling in Telecoms - GraphConnect NY 2013

ELLIPTIC CURVES OVER FUNCTION FIELDS - MITfengt/selmer-distribution.pdf · 2020-03-13 · 1.1. Arithmetic statistics of Selmer groups. The statistical behavior of Selmer groups has

Intro to Cypher

BECKY KING WINK’s DINER Selmer, Tennessee KING WINK’s DINER Selmer, Tennessee * * * Date: March 2, 2106 Location: Wink’s Diner, Selmer, TN Interviewer: Sara Wood Transcription:

Cypher Brochure Updated (2)

Selmer Woodwind Accessories

Cypher Endeavor Stent

Is There a God? Selmer Bringsjord selmer@rpi.edu 3.22.05

GraphConnect NYC

The CYPHER - saago.org

Selmer Band Manual PDF

Cypher Dietz Ch 2

GraphConnect 2014 SF: Graphing the Supply Chain

Conventions VIBE CYPHER XXL Shem Victor CYPHER. XXL