This is the mail archive of the
xsl-list@mulberrytech.com
mailing list .
You can convert a Word document to HTML using
- From: Micky Allen <fisi at salamander dot co dot uk>
- To: xsl-list at lists dot mulberrytech dot com
- Date: Wed, 06 May 1987 23:40:42 +0100
- Subject: [xsl] You can convert a Word document to HTML using
- Reply-to: xsl-list at lists dot mulberrytech dot com
Hi there XML'ers
There was an IBM tip on this topic recently, I enclose the
summary and the link to the tip below
------------------------------------------------------------------
SUMMARY
We've just explored how to convert Word documents directly into
sdocbook XML using MajiX. Once you have converted a set of Word
documents into sdocbook XML files and stored them in a directory, you
can easily write a program that parses all these sdocbook XML files,
looks for the <title> tags, and generates a Web page that contains
links to all these sdocbook XML files. To display the sdocbook XML
files as HTML on the fly, an XSLT processor such as Xalan can be used
to reformat the sdocbook XML files into HTML using sdocbook
stylesheets.
::: Full text of this tip on the Web :::
http://www-106.ibm.com/developerworks/library/x-tipword.html/?nx-4232
----------------------------------------------------------------------------
---------
cheers
Micky
<www.contaminatedland.co.uk>
>Subject: RE: [xsl] One texdocument in and several xmldocuments out?
>
> File / Save As... and
>selecting HTML or Filtered HTML. The difference between these two is
>HTML preserves all of Word's information such as <span> tags to mark
>spelling and grammar issues, whereas Filtered HTML drops the
>Word-specific tags. Then follow the advice already provided here (e.g.,
>Tidy) to ensure that the HTML is well-formed XML.
XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list