I need to convert MSWord file into XML or HTML, while preserving the structure of the file (mainly tables). I happened to find tika, which is quite powerful in extracting text from
Solution 1:
Install OpenOffice SDK, it offers powerfull API for all kinds of documents (including conversions).
Post a Comment for "Convert Msword To Xml/html On Linux"