Apologies in advance if this question is not appropriate for this website.
I have written some documents in Microsoft Word which I need to also display on a website as HTML. To do this I need to enter the content of these documents into a database with HTML tags. So for example this is what I need to put in the database:
<h1>Document Title</h1>
<p>This is the introduction paragraph for the document</p>
<ol>
<li>This is a summary point</li>
</ol>
My problem is that saving the Microsoft Word as a HTML page adds so much extra markup (mainly presentational with inline CSS) that its hard for me to strip it out to its basic HTML structure like in my example above.
So how does one keep offline and online content in sync? I wanted to avoid making two versions of the same document (one in Word and one in HTML) because keeping them in sync would be difficult.
Can MS Word be setup to save as HTML without any presentational formatting? Or is there a different piece of software I should be using?