Best method/practice to solve Cross Site Scripting (XSS) in php?

Question

I have a php webpage that uses a URL Parameter to set a variable which is then displayed within that page. URL: webaddress.com/page.php?id=someCity

We take the $_GET['id'] and assign it as a variable ($city) which is then used on the page reconstructing static text in a somewhat dynamic method.

For instance:

Welcome to our page about Somecity. We can help you find products related to someCity because we have vast experience in Somecity. Obviously this would be achieved using <?php echo $city; ?>

My client is being told he is open to a Cross Site Scripting (XSS) vulnerability. My research shows that an iFrame can then be used to steal cookies and do malicious things. The recommended solution is to use the PHP Function htmlspecialchars() which changes characters to "HTML entities". I don't understand how this is more secure than simply removing all the tags by using strip_tags().

So, I use both as well as a string replace and capitalization as this is also needed.

 $step1 = str_replace('_', ' ', $_GET['id']); // Remove underline replace with space
 $step2 = strip_tags($step1); // Strip tags
 $step3 = htmlspecialchars($step2); // Change tag characters to HTML entities
 $city = ucwords($step3);

QUESTION: Is this sufficient to prevent XSS and is it true that there would be additional benefit to htmlspecialchars() over strip_tags()? I understand the difference based on other submissions of similar questions but would like to know how each function (especially htmlspecialchars() ) prevents XSS.

Does this answer your question? Should I use both striptags() and htmlspecialchars() to prevent XSS? — Wesley Smith
The others are similar but don't provide the "why" htmlspecialchars() is sure-fire over strip_tags() which seems the most corrective. — Burndog
Are you sure? The accepted answer there does explain why pretty well — Wesley Smith
@WesleySmith the suggested similar question is not the same in that it reference two cases (either / or). A closer review of that answer and my case shows that using both in sequence IS the best method which answers my question and hopefully helps others in similar cases. — Burndog

Rob Ruchte Rob Ruchte · Accepted Answer · 2020-09-23T22:14:56

The best method is to use a mature and trusted library like HTMLPruifier to sanitize anything that's coming from an untrusted source. Simply running strip_tags is not gonna cut it, there are a lot of creative and insidious XSS attacks out there. I recommend taking a look at the OWASP recommendations for mitigating XSS. It's worth taking the time to be careful about this kind of thing and actually test for vulnerabilities during development.

If you're new to this, I think it's also worth looking into some white-hat capture the flag style infosec training (there are tons of free resources available) so you get get an idea of how these kind of attacks work in the real world. It's pretty eye-opening to see how clever they can get.

Best method/practice to solve Cross Site Scripting (XSS) in php?

4 Answers