Cypher no loops, no double paths

Question

I am currently modelling a database with over 50.000 nodes and every node has 2 directed relationships. I try to get all nodes for one input node (the root node), which are connected to it with one relationship and all so-called children of these nodes and so on, until every node connected direct and indirect to this root node is reached.

String query =
  "MATCH (m {title:{title},namespaceID:{namespaceID}})-[:categorieLinkTo*..]->(n) " +
  "RETURN DISTINCT n.title AS Title, n.namespaceID " + 
  "ORDER BY n.title";

Result result = db.execute(query, params);
String infos = result.resultAsString();

I have read that the runtime is more likely in O(n^x), but I cannot find any command that excludes for example loops or multiple paths to one node, so the query takes simple over 2 hours and that is not acceptable for my use case.

There is no GROUP BY operator in Cypher. Did you mean ORDER BY? — Gabor Szarnyas
Hey, yes my bad. That was just a try if something like that worked here.. and i forget to remove that . It didnt wokred with ORDER BY as well. — DanDo

Gabor Szarnyas Gabor Szarnyas · Accepted Answer · 2016-12-04T18:12:54

For simple relationship expressions, Cypher excludes multiple relationships automatically by enforcing uniqueness:

While pattern matching, Neo4j makes sure to not include matches where the same graph relationship is found multiple times in a single pattern.

The documentation is not entirely clear on whether this works for variable length paths - so let's design a small experiment to confirm it:

CREATE
  (n1:Node {name: "n1"}),
  (n2:Node {name: "n2"}),
  (n3:Node {name: "n3"}),
  (n4:Node {name: "n4"}),
  (n1)-[:REL]->(n2),
  (n2)-[:REL]->(n3),
  (n3)-[:REL]->(n2),
  (n2)-[:REL]->(n4)

This results in the following graph:

Query with:

MATCH (n:Node {name:"n1"})-[:REL*..]->(m)
RETURN m

The result is:

╒══════════╕
│m         │
╞══════════╡
│{name: n2}│
├──────────┤
│{name: n3}│
├──────────┤
│{name: n2}│
├──────────┤
│{name: n4}│
├──────────┤
│{name: n4}│
└──────────┘

As you can see n4 is included multiple times (as it can be accessed with avoiding the loop and going through the loop as well). Check the execution with PROFILE:

So we should use DISTINCT to get rid of the duplicates:

MATCH (n:Node {name:"n1"})-[:REL*..]->(m)
RETURN DISTINCT m

The result is:

╒══════════╕
│m         │
╞══════════╡
│{name: n2}│
├──────────┤
│{name: n3}│
├──────────┤
│{name: n4}│
└──────────┘

Again, check the execution with PROFILE:

Cypher no loops, no double paths

3 Answers