Normalization in DOM parsing with java - how does it work?

Question

I saw the line below in code for a DOM parser at this tutorial.

doc.getDocumentElement().normalize();

Why do we do this normalization ?
I read the docs but I could not understand a word.

Puts all Text nodes in the full depth of the sub-tree underneath this Node

Okay, then can someone show me (preferably with a picture) what this tree looks like ?

Can anyone explain me why normalization is needed?
What happens if we don't normalize ?

Irrespective of your question, please read the note on the example: "DOM Parser is slow and will consume a lot of memory when it loads an XML document which contains a lot of data. Please consider SAX parser as solution for it, SAX is faster than DOM and use less memory.". — wulfgarpro
@wulfgar.pro - I understand what you said. But, I want to understand the stuff I asked in the question. I will also do SAX parsing soon. — Apple Grinder
Searching google for "normalize xml" gave some results that seem useful. It looks like its similar to normalization in databases. — Apple Grinder
@EJP - umm...its still not clear because i don't know xml in depth and i only read a few introductory pages on it. BTW, dont get me wrong, you did exactly what the author of the doc did - using complex words instead of plain english (plain as a pike staff = easy to understand). Simple words first and jargon later works better for me. — Apple Grinder
As of this writing the referenced website is referencing this SO post. My brain just threw a dependency error. — chessofnerd

JB Nizet JB Nizet · Accepted Answer · 2012-12-09T13:07:41

The rest of the sentence is:

where only structure (e.g., elements, comments, processing instructions, CDATA sections, and entity references) separates Text nodes, i.e., there are neither adjacent Text nodes nor empty Text nodes.

This basically means that the following XML element

<foo>hello 
wor
ld</foo>

could be represented like this in a denormalized node:

Element foo
    Text node: ""
    Text node: "Hello "
    Text node: "wor"
    Text node: "ld"

When normalized, the node will look like this

Element foo
    Text node: "Hello world"

And the same goes for attributes: <foo bar="Hello world"/>, comments, etc.

Normalization in DOM parsing with java - how does it work?

3 Answers