How does WEKA treat nominal attributes v/s numerical attributes?

Question

If one of my columns in the data set has just 3 possible values .i.e. 0, 1 and 2, how differently does WEKA treat them if I declare them as nominal v/s numerical?

Also, if I have a large number of nominal values for an attribute for a column, is there an easy way to declare this nominal attribute which has a very high ordinal value?

Has QUIT--Anony-Mousse Has QUIT--Anony-Mousse · Accepted Answer · 2012-08-07T04:10:39

Roughly speaking (it depends on the actual algorithm):

When treated as numeric, the difference of 1 to 2 and 1 to 3 will roughly be twice as big. (Given that there are no other attributes).

When treated as strings, they are both probably equally different, as '1' != '2' and '1' != '3'. (However, the result may e.g. depend on the frequency of the numbers, for example; common dissimilarity measures for categorical data involve relative frequencies)

How does WEKA treat nominal attributes v/s numerical attributes?

2 Answers