Print several lines between patterns (first pattern not unique)

Question

Need help with sed/awk/grep/whatever could solve my task. I have a large file and I need to extract multiple sequential lines from it.

I have start pattern: <DN>

and end pattern: </GR>

and several lines in between, like this:

<DN>234</DN>
<DD>sdfsd</DD>
<BR>456456</BR>
<COL>6575675 sdfsd</COL>

<RAC>456464</RAC>
<GR>sdfsdfsFFFDd</GR>

I've tried this:

sed -n '/\<DN\>/,/\<\/GR\>/p'

and several other ones (using awk and sed). It works okay, but the problem is that the source file may contain lines starting with <DN> and without </GR> in the end of the bunch of lines, and then starts a part with another and normal in the end:

<DN>234</DN> - unneded DN
<AB>sdfsd</AB>
<DC>456456</DC>
<EF>6575675 sdfsd</EF>
....really large piece of unwanted text here....

<DN>234</DN>
<DD>sdfsd</DD>
<BR>456456</BR>
<COL>6575675 sdfsd</COL>

<RAC>456464</RAC>
<GR>sdfsdfsFFFDd</GR>
<RAC>456464</RAC>
<GR>sdfsdfsFFFDd</GR>

How can I extract only needed lines and ignore garbage pieces of log, containing <DN> without ending </GR>?

And next, I need to convert a multiline pieces from <DN> to </GR> to a file with single lines, starting with <DN> and ending with </GR>. Any help would be appreciated. I'm stuck

potong potong · Accepted Answer · 2014-10-14T02:47:21

This might work for you (GNU sed):

sed -n '/<DN>/{h;b};x;/./G;x;/<\/GR/{x;/./p;z;x}' file

Use the hold space to store lines between <DN> and </GR>.

Print several lines between patterns (first pattern not unique)

5 Answers