I am trying to use a regex SerDe to create a hive table from a text file. I am starting easy and just want to parse each word in the text document into a row. Each row has one column and that is the word.
The regex I am using is ([a-zA-z]+)
And this is the later half of the create table command I am issuing to hive,
ROW FORMAT SERDE 'org.apache.hadoop.hive.contrib.serde2.RegexSerDe' WITH SERDEPROPERTIES ( "input.regex" = "([a-zA-Z]+)", "output.format.string" = "%1$s" ) STORED AS TEXTFILE;
Currently the table contains almost all NULL.
Any help would be awesome, Thanks!