I'm working with text clustering. I want to select specific documents (as a vector) to be a centroID fo k-means.
I have created the TF-IDF for my dataset by using Mahout, and I would like to choose the initial clusters from TFIDF vectors.
Anyone has an idea how I can specify the initial centroids in Mahout?