Python splitting string by parentheses

Question

I asked a question a little while ago (Python splitting unknown string by spaces and parentheses) which worked great until I had to change my way of thinking. I have still not grasped regex so I need some help with this.

If the user types this:

new test (test1 test2 test3) test "test5 test6"

I would like it to look like the output to the variable like this:

["new", "test", "test1 test2 test3", "test", "test5 test6"]

In other words if it is one word seperated by a space then split it from the next word, if it is in parentheses then split the whole group of words in the parentheses and remove them. Same goes for the quotation marks.

I currently am using this code which does not meet the above standard (From the answers in the link above):

>>>import re
>>>strs = "Hello (Test1 test2) (Hello1 hello2) other_stuff"
>>>[", ".join(x.split()) for x in re.split(r'[()]',strs) if x.strip()]
>>>['Hello', 'Test1, test2', 'Hello1, hello2', 'other_stuff']

This works well but there is a problem, if you have this:

strs = "Hello Test (Test1 test2) (Hello1 hello2) other_stuff"

It combines the Hello and Test as one split instead of two.

It also doesn't allow the use of parentheses and quotation marks splitting at the same time.

@möter Do you have a link to lead me to a tutorial? Most everything I find are questions about it that don't really help me and I can't read the python docs to well. If that's all that's left it will have to do. — TrevorPeyton
Sorry, I misread the question. But here's a link to the official tutorial: docs.python.org/2/library/re.html — XORcist

TrevorPeyton TrevorPeyton · Accepted Answer · 2013-06-28T20:26:01

The answer was simply:

re.findall('\[[^\]]*\]|\([^\)]*\)|\"[^\"]*\"|\S+',strs)

Python splitting string by parentheses

5 Answers