I am working with Hadoop 0.20, and wish to use the NLinesInputFormat, but this functionality isn't present?
Is there an alternative?
Here's what I'm trying to do:
Records in the data span multiple lines, so I shall set NLinesInputFormat to a multiple of 'k', where k is the number of lines each record takes.
Any help would be appreciated.