[xep-support] Re: thai line break issues

Vladyslav Sivyakov vsivyakov at renderx.com
Mon Jun 5 13:43:59 PDT 2017


Hello Mok,

We are aware about this problem. The newest versions of XEP rely on 
Java's native mechanism of word breaking, and it gives incorrect results 
in several cases:

* some Thai words are treated incorrectly; besides ถัดไป, I can recall 
แสัดง and เคาเตอร์ that break in the middle while it should not;

* Latin script (e.g., English words) staying within fo:block 
language="th" is also treated incorrectly, including commas, quote 
marks, and open/close parenthesis.

We keep this problem in mind and we hope to get it fixed at some time.

As a partial workaround the problem with Latin script, make sure that 
all English text is properly marked with "language" attribute.

Good bye,
Vladyslav Y. Sivyakov, RenderX.

On 05/06/17 14:33, Mok wrote:
> there seems to be some issues with line-breaking in thai.
> here is the simple fo file:
>
> <fo:root xmlns:fo="http://www.w3.org/1999/XSL/Format" language="th" 
> font-family="Kinnari, Arial">
>   <fo:layout-master-set>
>     <fo:simple-page-master page-height="297mm" page-width="210mm" 
> margin-left="21.09mm" master-name="proc">
>       <fo:region-body column-count="2"/>
>     </fo:simple-page-master>
>   </fo:layout-master-set>
>   <fo:page-sequence master-reference="proc" language="th">
>     <fo:flow flow-name="xsl-region-body">
>       <fo:block>thai word breaking issues</fo:block>
>       <fo:block>3. ไปที่ "เปิดใช้งานใบอนุญาตการผลิต (Activate 
> Production License)" เมื่อคุณล็อกอิน หน้าต่างจะปรากฏขึ้น คลิ๊กที่ภาพ 
> "เว็บไซต์สมาคมช่างเทคนิคมืออาชีพ (Professional Technician Society 
> website)"</fo:block>
>       <fo:block>4. คลิ๊กที่ "ถัดไป (Next)” เพื่อดำเนินการในขั้นตอนถัดไป</fo:block>
>       <fo:block>5. กรอกข้อมูลที่เกี่ยวข้องและคลิ๊กที่ "อัพเดต (Update)"</fo:block>
>       <fo:block>9. หน้าจอจะแสดงหน้าต่างป็อปอัพดังที่แสดงในหน้าด้านบน 
> เพื่อแจ้งว่าเปิดใช้งานได้สำเร็จ ให้คลิ๊ก “ปิด (Close)” IDS (Integrated 
> Diagnostic System) จะสามารถทำงานได้ตามปกติ</fo:block>
>     </fo:flow>
>   </fo:page-sequence>
> </fo:root>
>
> and the results of rendering in 4.16 and 4.25 (thai416.pdf and 
> thai425.pdf) with the issues highlighted in yellow.
>
> in the lines starting with "3.", "5." and "9.", you can see that in 
> 4.25 the lines are breaking in the at ( and ", when it did not in 4.16.
> (however, in 4.16 the ascendants markings are in the wrong place which 
> is fixed in 4.25)
>
> in the line starting with "4." the word เพื่อดำเนินการในขั้นตอนถัดไป 
> is breaking between 2 lines (both in 4.16 and 4.25) and thereby given 
> a different meaning to the word when it is kept together.
>
> please advise
>
>
>
>
>
>
>
> -- 
> Chee Kong Mok
> cheekongmok at gmail.com <mailto:cheekongmok at gmail.com>
> tel. +44 7908 716822
>
>
>
> _______________________________________________
> (*) To unsubscribe, please visit http://lists.renderx.com/mailman/options/xep-support
> (*) By using the Service, you expressly agree to these Terms of Service http://w
> ww.renderx.com/terms-of-service.html

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.renderx.com/pipermail/xep-support/attachments/20170605/b8ecf110/attachment.html>


More information about the Xep-support mailing list