I have a single XML file that I want to index using Lucene.NET. The file is basically a large collection of logs. Since the single file itself is beyond 5GB and I am developing code on a system with 2GB RAM, how can I perform the indexing when I am not parsing the file nor am I creating any other fields other than "text" which shall contain the file data?
I am using some code from CodeClimber and at the moment not sure what would be the best approach to index such a large single file.
Is there a way to pass on file data to the index in chunks? Below is the line of code that basically creates the text field and the associated file data
Document doc = new Document();
doc.Add(new Field("Body", text, Field.Store.YES, Field.Index.TOKENIZED));
writer.AddDocument(doc);
Thank you for the guidance