This is the mail archive of the docbook-apps@lists.oasis-open.org mailing list .


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

[docbook-apps] dsssl/openjade: Trouble with half-width kanji characters


I am having trouble processing an XML document which contains a mix of
standard-width and half-width kanji characters.

When I process the document to RTF with OpenJade and the DocBook DSSSL
print stylesheets, the half-width kanji characters appear as empty boxes
because the font chosen for them is Arial.  If I select these characters
in Word and manually change the font to a kanji-aware font (such as
mincho or ms gothic) they display correctly.

When I process this document into HTML with Saxon and the DocBook XSLT
stylesheets, everything works as expected.  Similarly, going to PDF with
FOP and the DocBook FO stylesheets works correctly.

My theory is that the half-width characters are single-byte instead of
double-byte and the stylesheets assume that they are western characters
instead of kanji.  As such, they apply a western font.

This is 100% reproducible.  I'll try to include the XML file below, but
I'm not sure that the file's encoding will be preserved at every step in
the mail toolchain.  The last 4 characters in the <title> are
half-width.

Is there anything I can do to fix this situation?

thanks

-karl critz

-----------------------------------------------

<?xml version="1.0" encoding="shift_jis"?>
    <!DOCTYPE book PUBLIC '-//OASIS//DTD DocBook V4.2//EN'>    
<book lang="ja">
 
<title>&#12524;&#12509;&#12540;&#12488;&#12539;&#12472;&#12455;&#12493;&
#12524;&#12540;&#12479;&#12398;&#65409;&#65386;&#65391;&#65400;</title>
   <bookinfo>
 
<title>&#12524;&#12509;&#12540;&#12488;&#12539;&#12472;&#12455;&#12493;&
#12524;&#12540;&#12479;&#12398;&#65409;&#65386;&#65391;&#65400;</title>
 
<subtitle>(2)&#12473;&#12490;&#12483;&#12503;&#12471;&#12519;&#12483;&#1
2488;&#12398;&#65409;&#65386;&#65391;&#65400;</subtitle>
      <author>
 
<firstname>&#12469;&#12452;&#12496;&#12493;&#12483;&#12488;&#65404;&#654
05;&#65411;&#65425;</firstname>
      </author>
      <pubdate>10-Jan-2005 21:45:15</pubdate>
      <abstract>
 
<para>&#12371;&#12398;&#12501;&#12449;&#12452;&#12523;&#12399;&#12473;&#
12490;&#12483;&#12503;&#12471;&#12519;&#12483;&#12488;&#12398;&#34920;&#
31034;&#12434;&#65409;&#65386;&#65391;&#65400;&#12375;&#12414;&#12377;&#
12290;</para>
      </abstract>
      <legalnotice>
 
<para>&#12522;&#12540;&#12460;&#12523;&#12539;&#65417;&#65392;&#65411;&#
65384;&#65405;</para>
      </legalnotice>
   </bookinfo>
   <chapter>
 
<title>&#12475;&#12463;&#12471;&#12519;&#12531;&#12539;&#65408;&#65394;&
#65412;&#65433;</title>
 
<para>&#12371;&#12428;&#12399;&#12497;&#12521;&#12464;&#12521;&#12501;&#
12398;&#65403;&#65437;&#65420;&#65439;&#65433;&#12391;&#12377;&#12290;</
para>
      <para>
         <blockquote>
 
<para>&#12471;&#12473;&#12486;&#12512;&#12539;&#12473;&#12490;&#12483;&#
12503;&#12471;&#12519;&#12483;&#12488;&#12398;&#65399;&#65388;&#65420;&#
65439;&#65404;&#65390;&#65437;</para>
         </blockquote>
         <blockquote>
 
<para>&#12473;&#12467;&#12540;&#12503;&#12539;&#12473;&#12490;&#12483;&#
12503;&#12471;&#12519;&#12483;&#12488;&#12398;&#65399;&#65388;&#65420;&#
65439;&#65404;&#65390;&#65437;</para>
         </blockquote>
      </para>
   </chapter>
</book>


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]