0
votes

Can someone please tell if it's correct to find correlation between a dependent variable that has binary class(0 or 1) and independent variables that have continuous values using pandas df.corr().

I am getting correlation output if I do use it. But I want to understand if it's statistically correct to find pearson correlation(using df.corr()) between a binary categorical output and continuous input variables.

1
Why don't you start with the documentation for df.corr? - cs95
i did go through. But I want to understand if it's statistically correct to find pearson coefficient in this case. - Harikrishnan Balachandran
This might help: 1, 2, 3. - Nilesh Ingle

1 Answers

0
votes

pearson correlation is for continues data if one is categorical and other is binary, you should use ANOVA to see the relation between variables refrence