In NLTK, using a naive bayes classifier, I know from examples its very simply to use a "bag of words" approach and look for unigrams or bigrams or both. Could you do the same using two completely different sets of features?
For instance, could I use unigrams and length of the training set (I know this has been mentioned once on here)? But of more interest to me would be something like bigrams and "bigrams" or combinations of the POS that appear in the document?
Is this beyond the power of the basic NLTK classifier?
Thanks Alex