Split 1 argument into 2 arguments using regexp in a bash script

Question

Here's my situation. Currently, I have a script that accepts two arguments: book name and chapter name. For example:

$ myscript book1 chap1

Now, for reasons that would take a long time to explain, I would prefer my script to be able to take a single argument of the following format: {book name}.{chapter name}. For example:

$ myscript book1.chap1

The difficulty for me is that I do not know how to take a string $1=abc.xyz and turn it into two separate variables, $var1=abc and $var2=xyz. How can I do this?

smocking smocking · Accepted Answer · 2012-07-10T15:02:09

If it's just two tags you can use a bash expression

arg=$1
beforedot=${arg%.*}
afterdot=${arg#*.}

It's faster than cut because it's a shell builtin. Note that this puts everything before the ~~first~~ last dot into beforedot and everything after into afterdot.

EDIT:

There's also a substitution/reinterpretation construct if you want to split by an arbitrary number of tokens:

string=a.b.c.d.e
tokens=(${string//\./ })

You're replacing dots by spaces and then that gets interpreted as an array declaration+definition because of the parentheses around it.

However I've found this to be less portable to bash' siblings and offspring. For example, it doesn't work in my favourite shell, zsh.

Arrays need to be dereferenced with braces and are indexed from 0:

echo "Third token: ${tokens[2]}"

You can loop through them as well by dereferencing the whole array with [@]:

for i in ${tokens[@]}
do
    # do stuff
done

Split 1 argument into 2 arguments using regexp in a bash script

6 Answers

Pattern Subsitution with Shell Parameter Expansion