KNN when using a precomputed affinity matrix in Scikit's spectral clustering?

Question

I have a similarity matrix that I have calculated between a large number of objects, and each object can have a non-zero similarity with any other object. I generated this matrix for another task, and would now like to cluster it for a new analysis.

It seems like scikit's spectral clustering method could be a good fit, because I can pass in a precomputed affinity matrix. I also know that spectral clustering typically uses some number of nearest neighbors when building the affinity matrix, and my similarity matrix does not have that same constraint.

If I pass in a matrix that allows any number of edges between nodes in the affinity matrix, will scikit limit each node to having only a certain number of nearest neighbors? If not, I guess I will have to make that change to my pre-computed affinity matrix.

Thomas Reynaud Thomas Reynaud · Accepted Answer · 2016-10-20T13:41:47

You don't have to compute the affinity yourself to do some spectral clustering, sklearn does that for you.

When you call sc = SpectralClustering(),, the affinity parameter allows you to chose the kernel used to compute the affinity matrix. rbf seems to be the kernel by default and doesn't use a particular number of nearest neighbours. However, if you decide to chose another kernel, you might want to specify that number with the n_neighboursparameter.

You can then use sc.fit_predict(your_matrix) to compute the clusters.

KNN when using a precomputed affinity matrix in Scikit's spectral clustering?

2 Answers