I'm using the Scikit learn pipeline object because I have a sequence of tasks to perform (upsampling, feature selection, classification). My upsampling method is a custom one, that means I have to implement a custom transformer for the pipeline.
A transformer must have a transform and fit method. Of course I only want to upsample the training data but not the test data. Does this mean that I only have to implement the fit method but not the transform method (upsampling the dataset passed to the fit method)? As I understand, the transform method is applied to both the training and test set...