This is the mail archive of the
xsl-list@mulberrytech.com
mailing list .
RE: XSLT match with regex what's the best current solution?
- From: "Steven Noels" <stevenn at outerthought dot org>
- To: <xsl-list at lists dot mulberrytech dot com>
- Date: Tue, 15 Jan 2002 00:08:44 +0100
- Subject: RE: [xsl] XSLT match with regex what's the best current solution?
- Reply-to: xsl-list at lists dot mulberrytech dot com
Hi Gunther,
as you can read in the regular expression thread
http://www.biglist.com/lists/xsl-list/archives/200201/msg00488.html and
further on, we are working on a tool which might be helpful for your
purposes.
It is some mixture between regexes and an XSLT-like language, and we
have called it regexslt. It is implemented in Java, and could be some
sort of pre-processor to transform your non-XML documents into XML and
use XSLT thereafter. It is by no means as sophisticated as Omnimark and
the like, but you would be welcome to give it a try if you would be
interested.
You'll find some code samples in the thread I referred to - and we will
announce the first dot-oo release on this list (which would coincide
with a new release of our website). If we can gather enough community
around it, we would consider it becoming a Sourceforge project.
Regards,
Steven Noels
http://outerthought.org/
(+32)478 292900
> -----Original Message-----
> From: owner-xsl-list@lists.mulberrytech.com
> [mailto:owner-xsl-list@lists.mulberrytech.com]On Behalf Of Gunther
> Schadow
> Sent: maandag 14 januari 2002 23:46
> To: xsl-list@lists.mulberrytech.com
> Subject: [xsl] XSLT match with regex what's the best current solution?
>
>
> Hi,
>
> [for once this has nothing to do with dynamic scoping :-)]
>
> I am working on a suite of scripts that induce structure in
> free text and eventually capture fine grained medical information.
> I have been using AWK so far, but I am thinking about making
> this a process largely of XML transformations. However, since I
> must induce XML structure from semi-structured free text I need
> some more parsing support. First, regular expressions. I know
> there is EXSLT but are regex matches and replaces supported
> in SAXON (I love SAXON, so I would prefer using it.)
>
> Also, any ideas of additional parsing tools and their integration
> into XSLT would be appreciated. Is there a way of running XSLT
> in line-mode and have every line matched against regular
> expressions? Well, I suppose so, with a simple sed script I could
> first wrap each line into a <line>...</line> tag and then use regex
> match on the text node of each <line> element.
>
> Is SAXON easy to extend? I suppose there is some documentation
> of SAXON that tells me how to write extensions in Java, right?
> Any reason why it would be better to use something other than
> SAXON if my platform is Java and I'm not interested in Web stuff
> (in which case I would look into the Apache work.)
>
> thanks for your ideas,
> -Gunther
>
> --
> Gunther Schadow, M.D., Ph.D.
> gschadow@regenstrief.org
> Medical Information Scientist Regenstrief Institute for
> Health Care
> Adjunct Assistant Professor Indiana University School
> of Medicine
> tel:1(317)630-7960
> http://aurora.regenstrief.org
>
>
>
> XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list
XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list