0
votes

Hello i am trying to skip the first line of a csv file when i import it to Kettle Pentaho PDI 8.1.0.

The first line has the separator declaration

sep=;

The second line has the Headers. Cause of the first line the get fields button read only two variables. The first is the sep= and the second one that does not set a name.

I tried to set that header lines are 2 ,also to escape sep= also to use the Document header lines set to 1 in order to escape the first line but the get fields button does not recognize the headers.

Is there any other idea?

1

1 Answers

1
votes

Get fields will always look at the first line. You will need to enter the field list by hand.

You were on the right track, set headers to 2 and it will read the data correctly.

If you need to parse the separator declaration you will need to parse the file once to determine its structure, then use metadata injection to read a 2nd time for the data.