String replacement with awk based on positions in source and target

Question

Assume a multi-line text file file1, where some lines contain the keyword "keyw".

$ cat file1
foo
bar keyw
baz
keyw qux
quux

Further assume a single-line text file file2 that contains as many strings as keyword occurrences in file1. The strings in file2 are separated by single whitespaces.

$ cat file2
string1 string2

I would like to append each string of file2 to a keyword-containing line of file1 based on the respective positions:

The first string in file2 appended to the first line in file1 that contains the keyword.
The second string in file2 appended to the second line in file1 that contains the keyword.
etc.

Here is the sought output:

$ awk ... file1 file2
foo
bar keyw string1
baz
keyw qux string2
quux

What awk-code would you use to conduct this replacement?

What did you try? I am sure some of your previous awk questions lead to interesting code that can help on this topic! — fedorqui 'SO stop harming'
That is lucky that within 25 mins of asking your question you got the best best possible answer and so were able to accept it rather than waiting to see if a better answer would be posted. — Ed Morton

Akshay Hegde Akshay Hegde · Accepted Answer · 2017-10-31T15:07:21

Below one gives desired o/p shown above,

Using awk

awk '
     FNR==NR{split($0,strarr);next}
     /keyw/{$0 = $0 OFS strarr[++i]}1
    ' file2 file1

Since you said,

Further assume a single-line text file file2 that contains as many strings as keyword occurrences in file1. The strings in file2 are separated by single whitespaces.

Explanation

split($0,strarr); is used, so that it will split record by default FS single space, and elements are saved in array strarr
So whenever records matches regexp /keyw/ of file1, we print array element, and variable i will incremented, and go to next line/record
+1 at the end does default operation that is print current/record/row, print $0. To know how awk works try, awk '1' infile, which will print all records/lines, whereas awk '0' infile prints nothing. Any number other than zero is true, which triggers the default behaviour.

Test Results:

$ cat file1
foo
bar keyw
baz
keyw qux
quux

$ cat file2
string1 string2

$ awk 'FNR==NR{split($0,strarr);next}/keyw/{$0 = $0 OFS strarr[++i]}1' file2 file1
foo
bar keyw string1
baz
keyw qux string2
quux

String replacement with awk based on positions in source and target

3 Answers