Problems loading .csv file into RStudio. EOF within quoted string

Question

when I load this file Chicago_Crimes_2005_to_2007.csv (Link https://www.kaggle.com/currie32/crimes-in-chicago) into RStudio I am always getting an error (Warnmeldung: In scan(file = file, what = what, sep = sep, quote = quote, dec = dec,: EOF in Zeichenkette / English: EOF within quoted string) , and not all obervations are included. Do you know how to solve the problem? With the other 3 files I had no problems. I am using this code:

c2 = read.csv("Chicago_Crimes_2005_to_2007.csv", header = TRUE)

I tried to fix it with this code:

c2 = read.csv("Chicago_Crimes_2005_to_2007.csv", header = TRUE, quote = "", row.names = NULL, stringsAsFactors = FALSE).

Didnt work out. I tried all answers here in stackoverflow with the same error. Nothing helped. Since 1 week no success. Hope somebody can help me. Working with R in RStudio.

try reading the file using data.table::fread()... my experience it that it sometimes will automatically "repair" strange errors in the source files — Wimpel
@Wimpel thanks for your help. Tried it but getting this error: In data.table::fread("Chicago_Crimes_2005_to_2007.csv", header = TRUE) : Stopped early on line 533719. Expected 23 fields but found 46. Consider fill=TRUE and comment.char=. First discarded non-empty line: — S002
<<537288,5601758,HN409865,06/16/2007 08:15:00 PM,020XX E 94TH ST,1330,CRIMINAL TRESPASS,TO LAND,OTHER RAILROAD PROP / TRAIN DEPOT,False,False,413,4.0,8.0,48.0,26,1191237.0,1843038.0,2007,04/15/2016 08:55:02 AM,41.724300463,-87.575094193,"(41.724300463, -87.5,ID,Case Number,Date,Block,IUCR,Primary Type,Description,Location Description,Arrest,Domestic,Beat,District,Ward,Community Area,FBI Code,X Coordinate,Y Coordinate,Year,Updated On,Latitude,Longitude,Location>> — S002
Please edit your question instead of adding comments. Also note that "Zeichenkette" is just string, not quoted string. Clarify the error description --- "doesn't work" is not useful. — Robert

JayJay81 JayJay81 · Accepted Answer · 2018-12-09T20:18:06

Here you go:

require(tidyverse)
df <- readr::read_csv("Chicago_Crimes_2005_to_2007.csv")

You may decide to clean up the column names as some have spaces in them, if so:

colnames(df) <- c("rowNo",
                   "ID",
                   "Case.Number",
                   "Date",
                   "Block",
                   "IUCR",
                   "Primary.Type",
                   "Description",
                   "Location.Description",
                   "Arrest",
                   "Domestic",
                   "Beat",
                   "District",
                   "Ward",
                   "Community.Area",
                   "FBI.Code",
                   "X.Coordinate",
                   "Y.Coordinate",
                   "Year",
                   "Updated.On",
                   "Latitude",
                   "Longitude",
                   "Location")

Problems loading .csv file into RStudio. EOF within quoted string

2 Answers