How to identify empty cells in a CVS file using pandas

Question

I am taking a column from a csv file and inputting the data from it into an array using pandas. However, many of the cells are empty and get saved in the array as 'nan'. I want to either identify the empty cells so I can skip them or remove them all from the array after. Something like the following pseudo-code:

if df.row(column number) == nan
    skip

or

if df.row(column number) != nan
    do stuff

Basically how do I identify if a cell from the csv file is empty.

sacuL sacuL · Accepted Answer · 2018-10-01T19:12:52

Best is to get rid of the NaN rows after you load it, by indexing:

df = df[df['column_to_check'].notnull()]

For example to get rid of NaN values found in column 3 in the following dataframe:

>>> df
     0    1    2    3    4
0  1.0  1.0  NaN  1.0  1.0
1  1.0  NaN  1.0  1.0  1.0
2  NaN  NaN  NaN  NaN  NaN
3  NaN  1.0  1.0  NaN  NaN
4  1.0  NaN  NaN  1.0  1.0

>>> df[df[3].notnull()]
     0    1    2    3    4
0  1.0  1.0  NaN  1.0  1.0
1  1.0  NaN  1.0  1.0  1.0
4  1.0  NaN  NaN  1.0  1.0

How to identify empty cells in a CVS file using pandas

3 Answers