I'm currently working on a way to convert a wordml-xml (or rather the body-part) into a valid xhtml/html format. The reason for that is that there are a bunch of breaks, paragraphs and so on that I want to display properly in my WebForms-Application.
I've been searching for ways to do this for the past few hours and the only thing I found that somewhat resembles my issues is the following Blog (https://msdn.microsoft.com/en-us/library/ff628051(v=office.14).aspx#XHtml_Using). The problem is that the transformation is based on .docx and not on XML. I could try to convert the XML into a docx and work with that, but that wouldn't really be an effective way to deal with it. not to mention that I'd have to find a way to convert the XML into docx first.
I really hope that somebody out there can help me with this, because I'm somewhat out of ideas.
Thanks in advance, snap.
Example: The w:body-Element inside of the XML looks like this:
<w:body xmlns:w="http://schemas.microsoft.com/office/word/2003/wordml">
<wx:sect xmlns:wx="http://schemas.microsoft.com/office/word/2003/auxHint">
<w:p wsp:rsidR="00FF5F75" wsp:rsidRDefault="00626E80" xmlns:wsp="http://schemas.microsoft.com/office/word/2003/wordml/sp2">
<w:r wsp:rsidRPr="00EA67E2">
<w:rPr>
<w:rFonts w:fareast="Times New Roman" />
<w:sz w:val="26" />
<w:sz-cs w:val="26" />
<w:lang w:fareast="JA" />
</w:rPr>
<w:t>Leider können wir die Kosten für die Impfung gegen %</w:t>
</w:r>
<w:r wsp:rsidRPr="00EA67E2">
<w:rPr>
<w:rFonts w:fareast="Times New Roman" />
<w:sz w:val="26" />
<w:sz-cs w:val="26" />
<w:highlight w:val="yellow" />
<w:lang w:fareast="JA" />
</w:rPr>
<w:t>XY</w:t>
</w:r>
<w:r wsp:rsidRPr="00EA67E2">
<w:rPr>
<w:rFonts w:fareast="Times New Roman" />
<w:sz w:val="26" />
<w:sz-cs w:val="26" />
<w:lang w:fareast="JA" />
</w:rPr>
<w:t>% nicht übernehmen.</w:t>
</w:r>
<w:r wsp:rsidRPr="00EA67E2">
<w:rPr>
<w:rFonts w:fareast="Times New Roman" />
<w:sz w:val="26" />
<w:sz-cs w:val="26" />
<w:lang w:fareast="JA" />
</w:rPr>
<w:br />
</w:r>
<w:r wsp:rsidRPr="00EA67E2">
<w:rPr>
<w:rFonts w:fareast="Times New Roman" />
<w:sz w:val="26" />
<w:sz-cs w:val="26" />
<w:lang w:fareast="JA" />
</w:rPr>
<w:br />
<w:t>Die DAK-Gesundheit zahlt Ihnen die Impfungen, die in den Schutzimpfungs-Richtlinien des Gemeinsamen Bundesausschusses genannt sind. Die Impfung gegen %</w:t>
</w:r>
....
In a regular word-document, where this thing is part of an Add-in word Displays as a break etc. What I want is to convert these elements to proper HTML/XHTML.