I am making strings of unpredictable character sets into table, with expected number of columns. I am having a troublesome time of choosing a proper separator.
For instance, a sample table might look like:
FILENAME: foo.txt
SEPARATOR: "\u00AA"
ROW1,COL1: foo
ROW1,COL2: b,ar
ROW1,COL3: fo;obar
ROW1,COL4: bo\tt
And on.
In R I would give
read.table('foo.txt', sep="\u00AA")
and get
invalid 'sep' value: must be one byte
What separator should I use to avoid conflict with the unpredictable strings? Unicode is accepted up to \u007F, but R interprets anything higher to be multi-byte. Why?
,and include a quote character like"after you escape all instances of"in your strings? the command line toolsedis super handy for this kind of thing. - Justin?read.table). That error may be because of some other malformation in the data. You may inspect that line on the shell usingsed -n 72373p filename.txt. - asb