I have a term doc matrix (16,977 terms, 29,414 documents):
Non-/sparse entries: 355000/499006478
Sparsity : 100%
Maximal term length: 7
Weighting : term frequency (tf)
For further analysis I got to restrict the term number to 2,425. How can I generate a new term doc matrix by including terms with freq over and above 20 for instance?
Since the matrix is large, traditional method as.matrix
cannot be applied.