r17755 - in /branches/upstream/libmime-encwords-perl/current: Changes EncWords.pm META.yml README

Mon Mar 17 19:38:28 UTC 2008

Author: gregoa-guest
Date: Mon Mar 17 19:38:28 2008
New Revision: 17755

URL: http://svn.debian.org/wsvn/?sc=1&rev=17755
Log:
[svn-upgrade] Integrating new upstream version, libmime-encwords-perl (1.005)

Modified:
    branches/upstream/libmime-encwords-perl/current/Changes
    branches/upstream/libmime-encwords-perl/current/EncWords.pm
    branches/upstream/libmime-encwords-perl/current/META.yml
    branches/upstream/libmime-encwords-perl/current/README

Modified: branches/upstream/libmime-encwords-perl/current/Changes
URL: http://svn.debian.org/wsvn/branches/upstream/libmime-encwords-perl/current/Changes?rev=17755&op=diff
==============================================================================

--- branches/upstream/libmime-encwords-perl/current/Changes (original)
+++ branches/upstream/libmime-encwords-perl/current/Changes Mon Mar 17 19:38:28 2008
@@ -1,6 +1,13 @@
 2008-03-16	Hatuka*nezumi - IKEDA Soji	<hatuka at nezumi.nu>
 
-	* Release 1.004.
+	* Release 1.005.
+	* Fix: Injected bug on _UNICODE_ conversion.
+	* Fix: decode_mimewords(): line folding of encoded text
+	  is preserved in the result.
+
+2008-03-16	Hatuka*nezumi - IKEDA Soji	<hatuka at nezumi.nu>
+
+	* Release 1.004 - withdrawn.
 	* By this release we require OO interface of MIME::Charset
 	  1.001 or later.
 	* Fix: encode_mimewords(): Newlines were encoded when

Modified: branches/upstream/libmime-encwords-perl/current/EncWords.pm
URL: http://svn.debian.org/wsvn/branches/upstream/libmime-encwords-perl/current/EncWords.pm?rev=17755&op=diff
==============================================================================
--- branches/upstream/libmime-encwords-perl/current/EncWords.pm (original)
+++ branches/upstream/libmime-encwords-perl/current/EncWords.pm Mon Mar 17 19:38:28 2008
@@ -120,7 +120,7 @@
 #------------------------------
 
 ### The package version, both in 1.23 style *and* usable by MakeMaker:
-$VERSION = '1.004';
+$VERSION = '1.005';
 
 ### Public Configuration Attributes
 our $Config = {
@@ -302,6 +302,9 @@
                             }xgi) {
 	    ($word, $charset, $encoding, $enc) = ($&, $1, lc($2), $3);
 	    my $tspc = $4;
+	    # unfold
+	    $tspc =~ s/(?:\r?\n|\r)([\t ])/$1/g;
+	    $tspc =~ s/\r?\n|\r/ /g;
 
 	    # RFC 2231 section 5 extension
 	    if ($charset =~ s/^([^\*]*)\*(.*)/$1/) {
@@ -375,7 +378,6 @@
 	    unless ($t->[1]) {
 		my $charset = &MIME::Charset::_detect_7bit_charset($t->[0]);
 		if ($charset and $charset ne &MIME::Charset::default()) {
-		    $t->[0] =~ s/[\r\n\t ]+/ /g;
 		    $t->[1] = $charset;
 		}
 	    }
@@ -414,17 +416,14 @@
     $charset->{OutputCharset} = $cset->as_string;
     $charset->{Encoder} = $cset->decoder;
 
-    my $converted;
+    my $converted = $s;
     if (is_utf8($s) or $s =~ $WIDECHAR) {
-	if ($charset->output_charset eq "_UNICODE_") {
-	    $converted = $s;
-	} else {
+	if ($charset->output_charset ne "_UNICODE_") {
 	    $converted = $charset->encode($s);
 	}
     } elsif ($charset->output_charset eq "_UNICODE_") {
 	if (!$charset->decoder) {
 	    if ($s =~ $UNSAFE) {
-		$converted = $s;
 		$@ = '';
 		eval {
 		    $converted = decode("UTF-8", $converted, FB_CROAK());
@@ -439,8 +438,6 @@
 	}
     } elsif ($charset->decoder) {
 	$converted = $charset->encode($s);
-    } else {
-	$converted = $s;
     }
 
     $@ = $preserveerr;
@@ -523,7 +520,7 @@
 care of character boundaries of multibyte sequences when Unicode/multibyte
 support is enabled.
 Portions for unencoded data should include surrounding whitespace(s), or
-they will be merged into adjoining encoded word(s).
+they will be merged into adjoining encoded-word(s).
 
 Any arguments past the RAW string are taken to define a hash of options:
 
@@ -553,7 +550,7 @@
 charset is recommended: see L<MIME::Charset>);
 C<"s"> will choose shorter one of either C<"q"> or C<"b">.
 B<NOTE>
-As of release 1.004, The default was changed from C<"q">
+As of release 1.005, The default was changed from C<"q">
 (the default on MIME::Words) to C<"a">.
 
 =item Field

Modified: branches/upstream/libmime-encwords-perl/current/META.yml
URL: http://svn.debian.org/wsvn/branches/upstream/libmime-encwords-perl/current/META.yml?rev=17755&op=diff
==============================================================================
--- branches/upstream/libmime-encwords-perl/current/META.yml (original)
+++ branches/upstream/libmime-encwords-perl/current/META.yml Mon Mar 17 19:38:28 2008
@@ -1,7 +1,7 @@
 # http://module-build.sourceforge.net/META-spec.html
 #XXXXXXX This is a prototype!!!  It will change in the future!!! XXXXX#
 name:         MIME-EncWords
-version:      1.004
+version:      1.005
 version_from: EncWords.pm
 installdirs:  site
 requires:

Modified: branches/upstream/libmime-encwords-perl/current/README
URL: http://svn.debian.org/wsvn/branches/upstream/libmime-encwords-perl/current/README?rev=17755&op=diff
==============================================================================
--- branches/upstream/libmime-encwords-perl/current/README (original)
+++ branches/upstream/libmime-encwords-perl/current/README Mon Mar 17 19:38:28 2008
@@ -6,8 +6,8 @@
     that it will achive more exact conformance with RFC 2047 (former RFC
     1522) specifications. Additionally, it contains some improvements.
     Following synopsis and descriptions are inherited from its inspirer,
-    then added descriptions on improvements (â€¡) or changes and
-    clarifications (â€ ).*
+    then added descriptions on improvements (**) or changes and
+    clarifications (*).*
 
     Before reading further, you should see MIME::Tools to make sure that you
     understand where this module fits into the grand scheme of things. Go
@@ -83,10 +83,10 @@
                 print "", ($_[1] || 'US-ASCII'), ": ", $_[0], "\n";
             }
 
-        â€¡ However, adjacent encoded-words with same charset will be
+        ** However, adjacent encoded-words with same charset will be
         concatenated to handle multibyte sequences safely.
 
-        â€  Whitespaces surrounding unencoded data will not be stripped so
+        * Whitespaces surrounding unencoded data will not be stripped so
         that compatibility with MIME::Words will be ensured.
 
         In a scalar context, joins the "data" elements of the above list
@@ -94,22 +94,23 @@
         and probably *not* what you want, but if you know that all charsets
         in the ENCODED string are identical, it might be useful to you.
         (Before you use this, please see "unmime" in MIME::WordDecoder,
-        which is probably what you want.) â€¡ See also "Charset" option below.
+        which is probably what you want.) ** See also "Charset" option
+        below.
 
         In the event of a syntax error, $@ will be set to a description of
         the error, but parsing will continue as best as possible (so as to
         get *something* back when decoding headers). $@ will be false if no
         error was detected.
 
-        â€  Malformed base64 encoded-words will be kept encoded. In this case
+        * Malformed base64 encoded-words will be kept encoded. In this case
         $@ will be set.
 
         Any arguments past the ENCODED string are taken to define a hash of
-        options. â€¡ When Unicode/multibyte support is disabled (see
+        options. ** When Unicode/multibyte support is disabled (see
         "USE_ENCODE" in MIME::Charset), these options will not have any
         effects.
 
-        Charset â€¡
+        Charset **
             Name of character set by which data elements in scalar context
             will be converted. If this option is specified as special value
             "_UNICODE_", returned value will be Unicode string.
@@ -117,12 +118,14 @@
             Note: This feature is still information-lossy, *except* when
             "_UNICODE_" is specified.
 
-        Detect7bit â€¡
+        Detect7bit **
             Try to detect 7-bit charset on unencoded portions. Default is
-            "YES". Note: This feature was introduced at release 1.000.
-
-        Mapping â€¡
-            NOT YET IMPLEMENTED
+            "YES".
+
+        Mapping **
+            In scalar context, specify mappings actually used for charset
+            names. "EXTENDED" uses extended mappings. "STANDARD" uses
+            standardized strict mappings. Default is "EXTENDED".
 
     encode_mimeword RAW, [ENCODING], [CHARSET]
         *Function.* Encode a single RAW "word" that has unsafe characters.
@@ -131,13 +134,13 @@
             ### Encode "<<Franc,ois>>":
             $encoded = encode_mimeword("\xABFran\xE7ois\xBB");
 
-        You may specify the ENCODING ("Q" or "B"), which defaults to "Q". â€¡
+        You may specify the ENCODING ("Q" or "B"), which defaults to "Q". **
         You may also specify it as ``special'' value: "S" to choose shorter
         one of either "Q" or "B".
 
         You may specify the CHARSET, which defaults to "iso-8859-1".
 
-        â€  Spaces will be escaped with ``_'' by "Q" encoding.
+        * Spaces will be escaped with ``_'' by "Q" encoding.
 
     encode_mimewords RAW, [OPTS]
         *Function.* Given a RAW string, try to find and encode all "unsafe"
@@ -148,11 +151,18 @@
 
         Returns the encoded string.
 
-        â€¡ RAW may be a Unicode string when Unicode/multibyte support is
+        ** RAW may be a Unicode string when Unicode/multibyte support is
         enabled (see "USE_ENCODE" in MIME::Charset). Furthermore, RAW may be
         a reference to that returned by "decode_mimewords" on array context.
         In latter case "Charset" option (see below) will be overridden (see
-        also notes below).
+        also a note below).
+
+        Note: * When RAW is an arrayref, adjacent encoded-words (i.e.
+        elements having non-ASCII charset element) are concatenated. Then
+        they are splitted taking care of character boundaries of multibyte
+        sequences when Unicode/multibyte support is enabled. Portions for
+        unencoded data should include surrounding whitespace(s), or they
+        will be merged into adjoining encoded-word(s).
 
         Any arguments past the RAW string are taken to define a hash of
         options:
@@ -161,7 +171,7 @@
             Encode all unsafe stuff with this charset. Default is
             'ISO-8859-1', a.k.a. "Latin-1".
 
-        Detect7bit â€¡
+        Detect7bit **
             When "Encoding" option (see below) is specified as "a" and
             "Charset" option is unknown, try to detect 7-bit charset on
             given RAW string. Default is "YES". When Unicode/multibyte
@@ -169,22 +179,27 @@
             "USE_ENCODE" in MIME::Charset).
 
         Encoding
-            The encoding to use, "q" or "b". â€¡ You may also specify
+            The encoding to use, "q" or "b". ** You may also specify
             ``special'' values: "a" will automatically choose recommended
             encoding to use (with charset conversion if alternative charset
             is recommended: see MIME::Charset); "s" will choose shorter one
-            of either "q" or "b". NOTE As of release 1.004, The default was
-            changed from "q" (it is also the default on MIME::Words) to "a".
+            of either "q" or "b". NOTE As of release 1.005, The default was
+            changed from "q" (the default on MIME::Words) to "a".
 
         Field
-            Name of the mail field this string will be used in. â€¡ Length of
+            Name of the mail field this string will be used in. ** Length of
             mail field name will be considered in the first line of encoded
             header.
 
-        Mapping â€¡
-            NOT YET IMPLEMENTED
-
-        Minimal â€¡
+        Mapping **
+            Specify mappings actually used for charset names. "EXTENDED"
+            uses extended mappings. "STANDARD" uses standardized strict
+            mappings. Default is "EXTENDED".
+
+        MaxLineLen **
+            Maximum line length excluding newline. The default is 76.
+
+        Minimal **
             Takes care of natural word separators (i.e. whitespaces) in the
             text to be encoded. If "NO" is specified, this module will
             encode whole text (if encoding needed) not regarding
@@ -195,19 +210,11 @@
             ensure compatibility with MIME::Words. On earlier releases, this
             option was fixed to be "NO".
 
-        Replacement â€¡
-            See "ERROR HANDLING" in MIME::Charset. Note: This feature was
-            introduced at release 1.000.
-
-        Notes on improvement by this module: When RAW is an arrayref,
-        adjacent encoded-words are concatenated. Then they are splitted
-        taking care of character boundaries of multibyte sequences, when
-        Unicode/multibyte support is enabled. Portions for unencoded data
-        should include surrounding whitespace(s), or they will be merged
-        into adjoining encoded word(s).
+        Replacement **
+            See "Error Handling" in MIME::Charset.
 
   Configuration Files
-â€¡
+**
     Built-in defaults of option parameters for "decode_mimewords" and
     "encode_mimewords" can be overridden by configuration files:
     MIME/Charset/Defaults.pm and MIME/EncWords/Defaults.pm. For more details