python - How can I escape latex code received through user input?

Question

I read in a string from a GUI textbox entered by the user and process it through pandoc. The string contains latex directives for math which have backslash characters. I want to send in the string as a raw string to pandoc for processing. But something like "\theta" becomes a tab and "heta".

How can I convert a string literal that contains backslash characters to a raw string...?

Edit:

Thanks develerx, flying sheep and unutbu. But none of the solutions seem to help me. The reason is that there are other backslashed-characters which do not have any effect in python but do have a meaning in latex.

For example '\lambda'. All the methods suggested produce

\\lambda

which does not go through in latex processing -- it should remain as \lambda.

Another edit:

If i can get this work, i think i should be through. @Mark: All three methods give answers that i dont desire.

a='\nu + \lambda + \theta'; 
b=a.replace(r"\\",r"\\\\"); 
c='%r' %a; 
d=a.encode('string_escape');
print a

u + \lambda +   heta
print b

u + \lambda +   heta
print c
'\nu + \\lambda + \theta'
print d
\nu + \\lambda + \theta

Are you sure the string really contains \\lambda and is not just doubling up when you print it? Try printing mystring[1:] and see if there is still a \ in it. There should be some consistency - if \t is converting to tab then \\ should convert to \ . — Mark Ransom
Can you post the repr of the string as received from the GUI textbox, and show the code you are using to process it through pandoc? — unutbu
Your test is unrealistic. You aren't getting it from a textbox, you're setting it with a string literal, and Python has already converted it in an inconsistent manner by the time it's assigned to a. It is impossible to get your original text back at that point. — Mark Ransom
My apologies. I was doing a silly error in reading the text from the GUI. The problem is now solved. Thanks for your comments and sorry for troubling you. — Vijay Murthy
Note that this question isn't exactly about raw strings; it's about escaping latex code. The OP mistakenly believed them to be the same thing. For a question that's actually about converting special characters into escape sequences, see here. — Aran-Fey

flying sheep flying sheep · Accepted Answer · 2011-08-31T20:23:22

Python’s raw strings are just a way to tell the Python interpreter that it should interpret backslashes as literal slashes. If you read strings entered by the user, they are already past the point where they could have been raw. Also, user input is most likely read in literally, i.e. “raw”.

This means the interpreting happens somewhere else. But if you know that it happens, why not escape the backslashes for whatever is interpreting it?

s = s.replace("\\", "\\\\")

(Note that you can't do r"\" as “a raw string cannot end in a single backslash”, but I could have used r"\\" as well for the second argument.)

If that doesn’t work, your user input is for some arcane reason interpreting the backslashes, so you’ll need a way to tell it to stop that.

python - How can I escape latex code received through user input?

5 Answers