Skip to content Skip to sidebar Skip to footer

Convert Msword To Xml/html On Linux

I need to convert MSWord file into XML or HTML, while preserving the structure of the file (mainly tables). I happened to find tika, which is quite powerful in extracting text from

Solution 1:

Install OpenOffice SDK, it offers powerfull API for all kinds of documents (including conversions).

http://www.oooforum.org/forum/viewtopic.phtml?t=7242

Post a Comment for "Convert Msword To Xml/html On Linux"