0
votes

What are some methods for finding X data ranges in Hadoop so that one can use these ranges as partitions in the reducer step?

1

1 Answers

0
votes

Looks like you need something like TotalOrderPartitioner, which allows a total order by reading split points from an externally generated source. You might find this link useful : http://chasebradford.wordpress.com/2010/12/12/reusable-total-order-sorting-in-hadoop/.

Don't know if this is exactly what you need? Apologies if I have get it wrong.