According to the document mentioned below, it seems like if I will restart the processor it will reset the value of maximum column value I have provided and will start fetching data from the beginning.
A comma-separated list of column names. The processor will keep track of the maximum value for each column that has been returned since the processor started running.
- However, I tested this behavior, and even if I restart the processor I get incremental load only. is there a mistake in the document or have missed something?
- What would happen if I re-deploy the job, I mean deleting the job and re-creating it from the template?
- In the code, it has mentioned that the value will be stored as part of Scop.CLUSTER. would someone please explain to me what is it? and in which conditions the state will be cleared?
@Stateful(scopes = Scope.CLUSTER, description = "After performing a query on the specified table, the maximum values for " + "the specified column(s) will be retained for use in future executions of the query. This allows the Processor " + "to fetch only those records that have max values greater than the retained values. This can be used for " + "incremental fetching, fetching of newly added rows, etc. To clear the maximum values, clear the state of the processor " + "per the State Management documentation")