remove ascii character and replace with non-ascii

Question

I want to remove one ASCII character and then I want replace it with non-ASCII. My code is :

sed -e 's/[\d100\d130]/g'

To explain: I want to replace "100" (in ASCII ,decimal ) with "135" (in ASCII, decimal.) In short, I want to replace 2 letters and one of them will remove. This code is valid?

What does It doesn't work mean? (do you get an error? aren't the d replaced?) — gniourf_gniourf
(Apart from the obvious typo—it should be tr '\144' '\207'—see Thomas Dickey's answer). This is not going to edit your file… is this what you're expecting? — gniourf_gniourf

Thomas Dickey Thomas Dickey · Accepted Answer · 2015-10-28T09:08:27

This is not a valid sed command:

sed -e 's/[\d100\d135]/g'

Perhaps something like

sed -e 's/[\d100]/[\d135]/g'

In a quick test, this "works":

echo 'd' | sed -e 's/[\d100]/[\d135]/g'

The suggested tr command is close, but 135 translates to octal 207, e.g,

tr '\144' '\207'

In a UTF-8 system, you likely will run into problems with 135, since it is not a valid single-byte code as such. The corresponding UTF-8 encoding for 135 uses two bytes, e.g., \302\207

echo 'd' | sed -e 's/\d100/\d194\d135/g'

might be what OP intended. With my locale en_US.UTF-8, it produces a UTF-8 encoded 135 (which shows up in vi-like-emacs as \u0087: this happens to be valid UTF-8, but not a printable character since it is actually a control character in Unicode). Given more information about what OP intended for the output, better advice can be offered.

remove ascii character and replace with non-ascii

2 Answers

UTF-8

Conclusion