This is the mail archive of the
xsl-list@mulberrytech.com
mailing list .
Re: any good solution for covert html file into xml file.
- From: Larry Garfield <lgarfiel at students dot depaul dot edu>
- To: xsl-list at lists dot mulberrytech dot com
- Date: Thu, 15 Aug 2002 06:45:38 -0500
- Subject: Re: [xsl] any good solution for covert html file into xml file.
- Organization: DePaul University
- References: <200208150116.VAA08759@biglist.com>
- Reply-to: xsl-list at lists dot mulberrytech dot com
First question, is the HTML document sufficiently structured already
that you can extract the needed information? If not, then you're
looking at a headache no matter what you do. :-)
First things first, convert the html files into xhtml files, to make
them XSLT parsable. HTML Tidy now does XHTML, I think. If the files
are already clean HTML and not something that happens to produce the
right output in IE by sheer luck, then a Perl script could probably do
it for you as well.
After that, figure out what the mapping is between the elements and
structure of your XHTML file and the target DTD, then go at it a piece
at a time in XSLT, divide-and-conquer style.
I really can't be more specific without knowing more about your project,
but hopefully that should point you in the right general direction.
stoneson wrote:
>
> Hello all:
>
> I have to translate all html file into xml source with given formate and dtd. Is anyone have any good solution??
> = = = = = = = = = = = = = = = = = = = =
--
Larry Garfield AIM: LOLG42
lgarfiel@students.depaul.edu ICQ: 6817012
-- "If at first you don't succeed, skydiving isn't for you." :-)
XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list