I want to use weka for text classification for Persian text. But I have a problem.
Tokenizer, stoplist and stemmer in Persian is different from these in English. So I should use my stemmer, tokenizer and stoplist in weka's interface there is a soulution to use my own stoplist but there is no way to change stemmer and tokennizer.
I want to know is there anyway to change them without modify weka's source code?
Because I am new in java and I don't know how I should modify weka source code.