0
votes

Can somebody explain me what the argument ngram_window in CreateDtm function exactly means? I mean, I know it is "A numeric vector of length 2. The first entry is the minimum n-gram size; the second entry is the maximum n-gram size. Defaults to c(1, 1)." and I know what n-gram means in general terms, but I really do not understand its practical meaning in this specific function. I am examining a text file made of abstracts and I need to understand which are the proper value to put here as maximum n-gram size. Thanks in advance for the help!