Word to HTML without pain?
Via Jeff Zeldman comes word of a nifty sounding cleaner for those Word docs that you have to post as HTML:
"We must put all our Word documents online." Eight words that strike fear into the hearts of web professionals everywhere.Microsoft Word is the default tool of businesses and organizations, and it includes a button to convert documents into HTML. But the HTML Word generates is littered with invalid, proprietary tags and attributes that are included, not to facilitate web publishing, but to ensure that Word docs and Word-generated HTML cannot be edited by non-Microsoft programs and will not display correctly in non-Microsoft browsers. It is the bane of web agencies, freelancers, and in-house professionals charged with the maintenance of large content sites with low budgets.
Textism's Word HTML Cleaner cleans up the junk HTML that Microsoft Word generates, removing proprietary crud while leaving basic formatting and typographic entities (like curly quotes) intact. Created by Textism's Dean Allen, the online tool is available for your use absolutely free. This product is a life saver, and although it is offered free of charge, you might want to slip a buck or two into Mr Allen's virtual tip jar.
I'll have to have a look at that.


