I'm trying Flink for basic aggregation of (sorted) timestamped events loaded from a CSV file.
I tell Flink to use Event Time:
env.setStreamTimeCharacteristic(TimeCharacteristic.EventTime)
and then I use a time window on a KeyedStream
val distances = signals
.assignAscendingTimestamps(_.ts)
.map(s => (s.mmsi, s.ts, getPortDistance(s)))
.keyBy(0)
.window(TumblingEventTimeWindows.of(Time.seconds(1)))
.sum(2).print()
The problem is that changing the window to like 10 minutes actually prints results after that amount of time has passed!
My understanding was that by explicitly telling Flink to use the timestamp field as the Event Time, the operation wouldn't be dependant on real time on the machine. Am I missing something here?