Bug#366992: [debiandoc-sgml-pkgs] Bug#366992: debiandoc-sgml: [INTL:uk] Ukrainian language support

Jens Seidel jensseidel at users.sf.net
Wed May 17 20:00:37 UTC 2006


Hi Eugeniy,

On Fri, May 12, 2006 at 08:19:50PM +0300, Eugeniy Meshcheryakov wrote:
> Please, apply attached patch for Ukrainian language support. This patch
> also contains some fixes needed for UTF-8 support (not ideal, but at
> least useable).
> 
> The patch was made by Borys Yanovych and improved by me.
 
first of all I want to thank you and Borys (isn't Boris the latin
representation?) for this patch.
 
UTF-8 support is really a nice add on, even if it should not be
necessary to support Ukrainian language as Russian demonstrates and
considering the fact that Ukrainian shares the same alphabet as Russian
(except of course the additional i/I character).

The only problem I could imagine is that SGML will not or wrongly complain about
invalid characters. I have to check this.

> diff -urN debiandoc-sgml-1.1.95/sgml/dtd/debiandoc.dcl /home/eugen/borman/devel1/debiandoc-sgml+uk-1.1.95/sgml/dtd/debiandoc.dcl
> --- debiandoc-sgml-1.1.95/sgml/dtd/debiandoc.dcl	2001-04-18 03:07:10.000000000 +0300
> +++ /home/eugen/borman/devel1/debiandoc-sgml+uk-1.1.95/sgml/dtd/debiandoc.dcl	2006-05-07 17:43:31.000000000 +0300
> @@ -15,7 +15,7 @@
>            32 95 32
>           127  1 UNUSED
>  BASESET  "ISO Registration Number 100//CHARSET ECMA-94 Right Part of Latin Alphabet Nr. 1//ESC 2/13 4/1"
> -DESCSET  128 32 UNUSED
> +DESCSET  128 32 32
>           160 96 32
>  CAPACITY PUBLIC    "ISO 8879:1986//CAPACITY Reference//EN"
>  SCOPE    DOCUMENT
> @@ -23,10 +23,7 @@
>  SHUNCHAR CONTROLS   0   1   2   3   4   5   6   7   8   9
>                     10  11  12  13  14  15  16  17  18  19
>                     20  21  22  23  24  25  26  27  28  29
> -                   30  31                     127 128 129
> -                  130 131 132 133 134 135 136 137 138 139
> -                  140 141 142 143 144 145 146 147 148 149
> -                  150 151 152 153 154 155 156 157 158 159
> +                   30  31                     127 

A stupid question from my side, but could you please explain this?
That's Ardo's code and I'm not familiar with it.

> diff -urN debiandoc-sgml-1.1.95/tools/lib/Locale/Alias.pm /home/eugen/borman/devel1/debiandoc-sgml+uk-1.1.95/tools/lib/Locale/Alias.pm
> --- debiandoc-sgml-1.1.95/tools/lib/Locale/Alias.pm	2005-05-26 23:50:38.000000000 +0300
> +++ /home/eugen/borman/devel1/debiandoc-sgml+uk-1.1.95/tools/lib/Locale/Alias.pm	2006-05-07 14:55:27.000000000 +0300
> @@ -161,6 +161,10 @@
>  		   'tr_TR'			=> 'tr_TR.ISO8859-9',
>  		   'tr_TR.ISO8859-9'		=> 'tr_TR.ISO8859-9',
>  
> +		   'uk'				=> 'uk_UA.UTF-8',
> +		   'uk_UA'			=> 'uk_UA.UTF-8',
> +		   'uk_UA.UTF-8'		=> 'uk_UA.UTF-8',
> +		   

I wonder why you do not add a 8 bit encoding as well, but maybe it should
be autogenerated from UTF-8 mode/strings in debiandoc-sgml itself.

> +	   'after begin document' => '\\renewcommand{\\vpageref}[1]{на стор. \\pageref{#1}}',

Probably you want to add a non breakable space "~" in front of the number?

> +	   'pdfhyperref' => 'unicode'

If I remember correctly this is only supported in Acrobat to properly
show bookmarks. xpdf and other PDF viewer just display garbage
(independent of the unicode option).

Spasibo,
Jens




More information about the Debiandoc-sgml-pkgs mailing list