This is the mail archive of the xsl-list@mulberrytech.com mailing list .


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

Re: PDF to FO tool?


jeremyf@isscorp.com wrote:
> 
> I have been earnestly trying to find a tool that
> converts a .PDF file to .FO or some close
> approximation of that.

this problem is Hard in the general case. I have had some success with pdftohtml <http://pdftohtml.sourceforge.net/> in XML mode. Last time I did this, pdftohtml didn't created FO, and actually needed some help to generate well-formed XML. The output was useful, though.

Beware of multicolumn output, weird character encoding and/or weird character subsetting. Here be dragons. Big ones.

I'm told that a company here in Toronto founded by one of the SoftQuad people is working on a commercial solution to this with good AI.

 Stewart



 XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]