I am trying to understand the uses and limitation of outer joins in tableau (tableau online in this case). I have found the beavaiour of tableau to be not what I have expected. I have provided as detailed a description to my problems below, to avoid any ambiguotity and since I don't know where to start anymore. I hope I have not gone overboard(edits welcome).
Specifics of my use case
I am creating a join between two .csv files that have logged natural data at specific time intervals. One set has hourly time intervals, the other at intervals of minutes (which is variable due to various factors).
'Rain' data set(1):
Date and Time
| Rain
01/01/2018 00:00 | 0
01/01/2018 01:00 | 0.4
01/01/2018 02:00 | 1.4
01/01/2018 03:00 | 0.4
'Fill' data set (2):
Date and Time
| Fill
24/04/2018 06:04 | 78
24/04/2018 12:44 | 104
24/04/2018 18:51 | 96
25/04/2018 00:20 | 84
Unsurprisingly, I have many nulls in the data (which is not a problem to me) as:
- 'Rain' has a longer time series
- In either data set, the majority of date times do not have an exact equivalent in the other
What I am trying to achieve
I am trying to graph the two data sets in such a way that that I can compare the full data sets against each other, in all of the following ways:
- Monthly or Yearly aggregation (average)
- Hourly aggregation (average)
- Exact times
Problems (and my limited assumptions)
- Once graphed in tableau some values had
'null'
DateTime values*
. - Once graphed in tableau it appears as if many points are simply missing
**
Graphing using 'Fill' time series
Graphing using 'Rain' time series - I had assumed (giving the full outer join of
'Date and Time(s)'
) tableau would join the data sets in chronological order with a common date time series
*
I had assumed it impossible for the join conditions to have 'null'
values without throwing an error. Also, the data is clean and uniform**
And this is when aggregating monthly, which I assumed would not be affected by any (if any) hourly/minute mismatches
So, finally the question #
In my reading of the online help documentation I am struggling to find a functionality that is native to tableau that can help me achieve these specific goals. I am reaching the worrying conclusion that tableau was not built for this type of 'visual analytics'.
Is there a functionality native to tableau that will allow me to combine the data in the way I described above?
Approaches I have considered
Since I have two .csv
files I could combine both set so that I have the full, granular 'Date and Time'
fields in one tall list.
However, I would like to find a method that is natural to tableau (online) because in future, at least some of the data wil come from a database (postgres) connection but others will likely have to remain as upload as a .csv
or excel
files.
Again I ask
What am I overlooking in regards how (and why) to use tableau? I am not looking for a complete solution, but what tools could I use to achieve this?
Many thanks for any help