python - Easiest way to replace a string using a dictionary of replacements?

Question

Consider..

dict = {
'Спорт':'Досуг',
'russianA':'englishA'
}

s = 'Спорт russianA'

I'd like to replace all dict keys with their respective dict values in s.

This might not be so straightforward. You should probably have an explicit tokenizer (for example {'cat': 'russiancat'} and "caterpillar"). Also overlapping words ({'car':'russiancar', 'pet' : 'russianpet'} and 'carpet'). — Joe
Also see code.activestate.com/recipes/81330-single-pass-multiple-replace — ChristopheD
As an aside: I think dict is best avoided as a variable name, because a variable of this name would shadow the built-in function of the same name. — jochen

Max Shawabkeh Max Shawabkeh · Accepted Answer · 2010-03-08T10:20:39

Using re:

import re

s = 'Спорт not russianA'
d = {
'Спорт':'Досуг',
'russianA':'englishA'
}

pattern = re.compile(r'\b(' + '|'.join(d.keys()) + r')\b')
result = pattern.sub(lambda x: d[x.group()], s)
# Output: 'Досуг not englishA'

This will match whole words only. If you don't need that, use the pattern:

pattern = re.compile('|'.join(d.keys()))

Note that in this case you should sort the words descending by length if some of your dictionary entries are substrings of others.

python - Easiest way to replace a string using a dictionary of replacements?

7 Answers