I’m trying to import 20 million nodes and 250 million relationships to Neo4j using the Batch-importer . I have 8GB of RAM.
Here are my current settings in batch.properties:
use_memory_mapped_buffers=false
neostore.nodestore.db.mapped_memory=500M
neostore.relationshipstore.db.mapped_memory=2000M
neostore.relationshipgroupstore.db.mapped_memory=10M
neostore.propertystore.db.mapped_memory=1G
neostore.propertystore.db.strings.mapped_memory=200M
neostore.propertystore.db.arrays.mapped_memory=0M
neostore.propertystore.db.index.keys.mapped_memory=15M
neostore.propertystore.db.index.mapped_memory=50M
batch_import.node_index.users=exact
batch_import.csv.quotes=false
cache_type=none
It took around 7 minutes to import the 20 million nodes.
It seems that for the relationships, it takes 13 minutes to import 10 million (as per the output on the console).
Meaning it will take around 6 hours (250 / 10 * 13) to import all the relationships. Can we improve this?