Sentiment Classification using Doc2Vec

Question

I am confused as to how I can use Doc2Vec(using Gensim) for IMDB sentiment classification dataset. I have got the Doc2Vec embeddings after training on my corpus and built my Logistic Regression model using it. How do I use it to make predictions for new reviews? sklearn TF-IDF has a transform method that can be used on test data after training on training data, what is its equivalent in Gensim Doc2Vec?

chefhose chefhose · Accepted Answer · 2019-12-27T14:00:40

To get a vector for an unseen document, use vector = model.infer_vector(["new", "document"]) Then feed vectorinto your classifier: preds = clf.predict([vector]).

Sentiment Classification using Doc2Vec

2 Answers