neo4j Cypher Grouping

Question

I have a database of tweet nodes and each tweet has a userId field with relationships based on if the tweet was replied to. I am trying to write a query where I group all of the tweets by user while preserving tweet relationships to see the relationship between users.

So far I have

match (n:Tweet) return distinct(n.userId), n

but this does not work because the relationships are not preserved. Does anybody know how to do this?

What is your data model? Show the nodes (and node labels), the relationship types, and how the nodes are connected by those relationship types. — cybersam

Brian Underwood Brian Underwood · Accepted Answer · 2016-02-19T08:27:56

I'm not quite sure what you're asking... You're talking about relationships but you haven't matched on any.

Speaking generally when you use an aggregate function (such as sum, count, collect, etc...) in a RETURN (or a WITH) clause Neo4j will automatically group by the other columns in the clause. Here's an example of something that you might do:

MATCH (source_tweet:Tweet {userId: 1234})<-[:RETWEET_OF]-(retweet:Tweet)
RETURN source_tweet:Tweet.id, source_tweet:Tweet.text, count(retweet)

This will give you one line for each tweet userId #1234 has made and a count of the number of retweets for each of those tweets.

neo4j Cypher Grouping

2 Answers