From https://en.wikipedia.org/wiki/Box_plot
The whisker of the box plot has the following possible definitions:
- the minimum and maximum of all of the data[1]
- the lowest datum still within 1.5 IQR of the lower quartile, and the highest datum still within 1.5 IQR of the upper quartile
- one standard deviation above and below the mean of the data
- the 9th percentile and the 91st percentile
- the 2nd percentile and the 98th percentile.
I am wondering in the pandas:
df['data'].plot(kind = 'box', sym='bD')
which definition is the whisker using?
Also, for the matplotlib library:
ax.boxplot(dfa.duration)
which definition is the whisker using?
Thanks!