I have a question about machine learning and decision tree. I work in computational biology (long RNA secondary structure prediction).
I have a program which predicts the accuracy of a predicted RNA secondary structure. The input argument to the program are
- stem length (L) - values from 3,4,5,6,7 and 8
- gap size (G) - values from from 0,1,2,3,4,5,6,7,and 8
- chunk length (c) - values from from 60,70,80,90,100,120,130,140, and 150
I want to know, for a given RNA sequence of length (S), which L,G,C combination gives a maximum accuracy.
I have a training data set of 50 sequence files with sequence lengths S and for each these sequence files, the L,G,C input parameter combinations which gives maximum accuracy output are already known.
Is there a way that we can know which specific L, G, and C parameters to use in order to find maximum accuracy with out all the L,G, and C range values?