I have Dask to handle big array of vectors that cant fit in memory and using scikit-learn cosine_similarity to compute cosine similarity between those vector i.e:
import dask.array as da
from sklearn.metrics.pairwise import cosine_similarity
vectors = da.from_array(vectors, 10000)
sims_mat = cosine_similarity(vectors)
Works fine but I am not sure if in this way I have any benefits of using Dask or should I look for cosine similarity function for dask arrays