Implementing incremental clustering using Carrot2 DCS

Question

Carrot2 accepts XML inputs that include 'clusters' from its own export on some other documents. Now, if I want to implement incremental clustering i.e. introduce new documents, along with the previous clusters, I have to retain the older documents in the input too. That makes the inputs grow linearly as we progress.

Is there a way to extract clusters along with document features for the respective clusters so as to solve this incremental/online clustering problem?

Stanislaw Osinski Stanislaw Osinski · Accepted Answer · 2017-02-16T15:23:26

Incremental clustering is currently only available in the Lingo3G algorithm (commercial add-on to Carrot2). In Carrot2, the only option for now is re-clustering the whole enlarged document set.

Implementing incremental clustering using Carrot2 DCS

1 Answers