all coefficients turn zero in Logistic regression using scikit learn

Question

I am working on logistic regression using scikit learn in python. I have the data file that can be downloaded via the following link.

link for data

Below is my code for machine learning part.

from sklearn.linear_model import Lasso
from sklearn.model_selection import train_test_split
from sklearn.preprocessing import StandardScaler
from sklearn.metrics import roc_auc_score
import pandas as pd
scaler = StandardScaler()

data = pd.read_csv('data.csv')
dataX = data.drop('outcome',axis =1).values.astype(float)
X     = scaler.fit_transform(dataX)
dataY = data[['outcome']]
Y = dataY.values

X_train,X_test,y_train,y_test = train_test_split (X,Y,test_size = 0.25, random_state = 33)
lasso = Lasso(alpha=.3)
lasso.fit(X_train,y_train)
print("MC learning completed")
print(lasso.score(X_train,y_train))
print(lasso.score(X_test,y_test))
print(lasso.coef_)

when I print coefficients, it turns out all zero. Can anyone advise me on that?

Let me explain a little bit about my objective. The problem seems to be a classification problem as we can only see 0 or 1 in Ytrain and Ytest. if we put a simple example, 0 can be considered as missed, 1 can be considered as scored. what I am trying to do is to compute the probability scoring for each event when a shot is taken place.

Thanks in advance.

Regards,

Zep

Hi Kumar, Thanks for the reply. I attached the data file as well. just click on the link to download it. — Zephyr
I'm seeing a Lasso model being used instead of a logistic regression. Lasso is used for regression rather than classification. — Scratch'N'Purr
Hi Kumar, I am working on regression not for classification. using the coefficients, i could be able to predict the probability of outcome. Thanks — Zephyr

chuzz chuzz · Accepted Answer · 2018-07-18T09:20:02

1

votes

I just change alpha in Lasso : my result

all coefficients turn zero in Logistic regression using scikit learn

3 Answers

EDIT 1: After taking into account the last paragraph that you just added in your question, use this: