1# Revision history for Perl extension Encode. 2# 3# $Id: Changes,v 2.12 2005/09/08 14:17:17 dankogai Exp dankogai $ 4# 5$Revision: 2.12 $ $Date: 2005/09/08 14:17:17 $ 6! Encode.xs Encode.pm t/fallback.t 7 Now accepts coderef for CHECK! 8! ucm/8859-7.ucm 9 Updated to newer version at unicode.org 10 http://rt.cpan.org/NoAuth/Bug.html?id=14222 11! lib/Encode/Supported.pod 12 More POD typo fixed. 13 <42F5E243.80500@gmail.com> 14! encoding.pm 15 More POD typo leftover fixed. 16 Message-Id: <b77c1dce05080615487f95314@mail.gmail.com> 17 182.11 2005/08/05 10:58:25 19! AUTHORS CHANGES 20 To reflect changes below 21! Encode.pm encoding.pm 22 lib/Encode/Alias.pm lib/Encode/PerlIO.pod lib/Encode/Supported.pod 23 Typo fixed by Piotr Fusik in Change 25261 & 25266 24 Message-ID: <001401c595bd$dccb5d80$0bd34dd5@piec> 25! Encode.xs 26 Addresses "BUG REPORT: panic in Encode.xs". 27 Message-Id: <42EDDA97.2010608@hyper.to> 28+ lib/Encode/MIME/Header/ISO_2022_JP.pm mime_header_iso2022jp.t 29! lib/Encode/MIME/Header.pm lib/Encode/Config.pm 30 Encoding 'MIME-Header-ISO_2022_JP' is introduced by Makamaka 31 Message-Id: <200507311557.j6VFvE2K034605@www231.sakura.ne.jp> 32! Encode/encode.h Encode.pm Encode.xs 33 PerlIO's "encoding(utf-8-strict)" got a problem w/ partial character. 34 Found and addressed by KONNO Hiroharu <hiroharu.konno@bowneglobal.co.jp> 35 See also ext/PerlIO/encoding/encoding.pm 36 Message-Id: <E1DineE-00068X-TB@yok-gs-workman.asia.bgsinternal.com> 37 382.10 2005/05/16 18:46:36 39! Encode.pm 40 fixed decode_utf8() accordingly to RT#8872 41 http://rt.cpan.org/NoAuth/Bug.html?id=8872 42! Encode.xs AUTHORS 43 s/SvIVX/SvIV_set/ by Steve Peters. 44 Message-Id: <2297.67.96.185.36.1114626315.squirrel@webmail3.pair.com> 45! AUTHORS 46 GAAS was missing! 47! Encode.pm 48 New Pod section: "UTF-8 vs utf8"; explains utf-8-strict 49+ t/utf8strict.t 50 Tests utf-8-strict, accordingly to 51 UTF-8 decoder capability and stress test" by Markus Kuhn 52 http://smontagu.damowmow.com/utf8test.html 53 Note that malformed and overlong sequences are not test here 54 because perl already does that for you, utf-8-strict or not. 55! Encode.pm Encode/encode.h t/fallback.t 56 Addressed "encode(..., Encode::LEAVE_SRC) does not work". 57 Now FB_(PERLQQ|HTMLCREF|XMLCREF) implies LEAVE_SRC so 58 you can (en|de)code constant strings with these fallbacks. 59 http://rt.cpan.org/NoAuth/Bug.html?id=8736 60! Encode.pm Encode.xs lib/Encode/Alias.pm t/Aliases.t 61 Make Encode.pm support the real UTF-8, by GAAS 62 Message-Id: <lrfz2mcngd.fsf@caliper.activestate.com> 63 Message-Id: <lr4qizbvvm.fsf@caliper.activestate.com> 64! Encode.pm Encode.xs 65 post-2.09 comment patches from GAAS applied. 66 Message-Id: <lroehacz6q.fsf@caliper.activestate.com> 67 Message-Id: <lrk6rycymu.fsf@caliper.activestate.com> 68 692.09 2004/12/03 19:16:53 70! Encode.pm Encode.xs 71 Addressed " :encoding(utf8) broken in perl-5.8.6". 72 Message-Id: <lrllcfeank.fsf_-_@caliper.activestate.com> 73! Encode.pm 74 Addressed "(de|en)code($valid_encoding, undef) does not warn". 75 http://rt.cpan.org/NoAuth/Bug.html?id=8723 76! Encode.pm t/Encode.t 77 Addressed "Can't encode URI". When a reference is fed to (en|de)code, 78 Encode now stringifies instead of returning undef. 79 http://rt.cpan.org/NoAuth/Bug.html?id=8725 80! Encode.xs t/fallback.t 81 Addressed "FB_HTMLCREF and FB_XMLCREF for the UTF-8 decoder". 82 http://rt.cpan.org/NoAuth/Bug.html?id=8694 83! Encode.pm 84 Addressed "s/digit/number/". 85 http://rt.cpan.org/NoAuth/Bug.html?id=8695 86! Encode.pm 87 Addressed "while (defined(read )) { ... } is an infinite loop". 88 http://rt.cpan.org/NoAuth/Bug.html?id=8696 89! Encode.pm 90 Addressed "What the heck is UCM?". 91 Document fixed so that it no longer contains "UCM-Based Encodings". 92 http://rt.cpan.org/NoAuth/Bug.html?id=8697 93 942.08 2004/10/24 13:00:29 95! Encode.xs lib/Encode/Encoding.pm Unicode/Unicode.{pm,xs} 96 Resolved the issue that was raised by 2.07 -- Encode::utf8 fallbacks 97 that was introduce messed up PerlIO::encoding. 98 * To do so, ->renew() is renewed and ->renewed() was introduced to 99 tell whether the caller is PerlIO or not. 100 Message-Id: <94B2EB12-25B7-11D9-9E6A-000A95DBB50A@dan.co.jp> 101 1022.07 2004/10/22 19:35:52 103! lib/Encode/Encoding.pm 104 "Remove Carp from warnings.pm" that influences Encode, by Tels. 105 Message-Id: <200410161618.29779@bloodgate.com> 106! Encode.xs AUTHORS t/fallback.t 107 Now Encode::utf8's fallbacks are compliant to Encode standard. 108 Thank Bjoern Hoehrmann for persistently convincing me. 109 Message-Id: <41a61aea.638409494@smtp.bjoern.hoehrmann.de> 110! Encode.pm 111 POD further revised. 112 1132.06 2004/10/22 06:23:11 114! ucm/mac* 115 RT #8083 reports that MacThai mapping was obsolete 116 Updated all mac* encodings accordingly to the URI below. 117 One remaining mystery is that MacRomanian vs. MacRumanian. 118 MacRumanian is not found in unicode.org... 119 http://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/ 120! Encode.pm t/Encode.t 121 Fixed RT #8081: "decode(..., bless{},'x') segfault" 122 Two more tests added to test that. 123 http://rt.cpan.org/NoAuth/Bug.html?id=8081 124! Encode.pm 125 POD revised accordingly to RT #7966 126 http://rt.cpan.org/NoAuth/Bug.html?id=7966 127! Unicode/Unicode.pm 128 POD updated explaining why Encode::Unicode always croaks on error 129 rather than giving users choices. 130 http://rt.cpan.org/NoAuth/Bug.html?id=7892 131 1322.05 2004/10/19 04:55:01 133! encoding.pm 134 "unnuke" jhi's patch in bleedperl, with minor correction by dankogai. 135 Message-ID: <41210A84.6060506@iki.fi> 136 Message-ID: <20041018233442.7418113f@r2d2> 137 Message-Id: <2BA3DAC4-218A-11D9-906D-000A95DBB50A@dan.co.jp> 138 1392.04 2004/10/16 21:22:44 140! Makefle.PL 141 From: craigberry@mac.com 142 Subject: [PATCH ext/Encode/Makefile.PL] make Encode.c dependency explicit 143 Message-Id: <41716868.7000102@mac.com> 144 1452.03 2004/10/06 05:07:20 146! lib/Encode/Alias.pm 147 Resolved some alias case sensitivity glitches reported via RT. 148 http://rt.cpan.org/NoAuth/Bug.html?id=7835 149! bin/piconv 150 Resolved Win32 glitches reported via RT. 151 (Fixed by dankogai and tested by Steve Hay) 152 http://rt.cpan.org/Ticket/Display.html?id=7831 153! JP/JP.pm lib/Encode/Alias.pm lib/Encode/Supported.pod AUTHORS 154 /\bwindows-31j$/i is now an alias of CP932, by Steve Hay. 155 http://rt.cpan.org/NoAuth/Bug.html?id=6695 156 1572.02 2004/08/31 10:55:34 158! ucm/big5-hkscs.ucm AUTHORS t/big5-hkscs.enc t/big5-hkscs.utf 159 New map submitted by Deng Liu and Autrijus. Test data needed 160 to be upgrade as well, done by dankogai 161 Message-Id: <20040824204828.GB6999@aut.dyndns.org> 162! bin/ucmsort 163 Now works for characters U+10000 and above. This fix was needed 164 to "tidy" the original map that was submitted. 165! bin/enc2xs 166 "ucmsort" now mentioned in pod 167 1682.01 2004/05/25 16:27:14 169! bin/enc2xs AUTHORS 170 From: domo@computer.org 171 Subject: [PATCH] Correct statistics from enc2xs 172 <4AF60A4A-B8BB-11D8-BF99-000A27839BD6@computer.org> 173 <CDEBBD45-B91D-11D8-BF99-000A27839BD6@computer.org> 174! lib/Encode/Alias.pm 175 Addressed "False [] range "\s-" in regex;" in Encode::Alias.pm 176 <200405271148.i4RBm4KY026529@mail.mvnet.de> 177 1782.01 2004/05/25 16:27:14 179! lib/Encode/CN/HZ.pm lib/Encode/Unicode/UTF7.pm 180 "If someone thinks utf8::upgrade($1) should be croaked like 181 chom?p($1),please try the following patch for Encode.pm." 182 -- sadahiro-san 183 <20040522212704.C068.BQW10602@nifty.com> 184 1852.0 2004/05/16 20:55:15 186* version updated to 2.00 187 -- sorry, no big feature change. I just hate version 1.100 :) 188! lib/Encode/Guess.pm 189 Unicode/Unicode.pm 190 addressed UTF-(8|32LE) + BOM misguessing 191 https://rt.cpan.org/Ticket/Display.html?id=6279 192! Encode.pm 193 s/is_utif8/is_utf8/ in POD 194! Encode/lib/Encode/CN/HZ.pm 195 Fixes "make test" failure after the patch to pp_hot.c 196 by Sadahiro-san 197 Message-Id: <20040222182357.6B39.BQW10602@nifty.com> 198! bin/piconv 199 From: autrijus@autrijus.org 200 Subject: [PATCH] "piconv -C 512" badly broken 201 Message-Id: <1072870210.769.5.camel@localhost> 202 2031.99 2003/12/29 02:47:16 204! Unicode/Unicode.xs 205 find_encoding("UTF-16BE")->encode("abc") now null terminates 206 http://www.mail-archive.com/perl5-porters@perl.org/msg69766.html 207! Encode.pm 208 prototype bug in decode_utf8() fixed 209 Message-Id: <600A4CDA-F004-11D7-B570-000393AE4244@dan.co.jp> 210! Encode.pm /MANIFEST encoding.pm lib/Encode/Supported.pod 211 t/at-cn.t t/at-tw.t t/gsm0338.t ucm/gsm0338.ucm 212+ t/gsm0338.t 213 Merged from maintperl@21987 214 2151.98 2003/08/20 11:15:31 216! lib/Encode/MIME/Header.pm AUTHORS t/mime-header.t 217 Dave Evans has found and corrected a bug in Encode::MIME::Header. 218 Test suite added by Dan Kogai. 219 Message-Id: <3F43440B.7060606@rudolf.org.uk> 220! encoding.pm 221 Typo fixes rolled back in from bleedperl 222! t/at-cn.t t/at-tw.t 223 v-strings, now depreciated in perl 5.8.1, is replaced by sadahiro 224 Message-Id: <20030805002313.9880.BQW10602@nifty.com> 225! bin/enc2xs 226 argv case nit for VMS by Craig 227 Message-ID: <3F2B02DE.10207@mac.com> 228! t/enc_eucjp.t t/enc_utf8.t AUTHORS 229 Encode test fixes for VMS by Peter Prymmer 230 Message-ID: <OFBD4A7559.D7CF9517-ON85256D6B.00534853-85256D6B.00538131@factset.com> 231! lib/Encode/Alias.pm t/Aliases.t 232 koi-8 aliases bug detected and patched by sadahiro. 233 Further fix and test suite by dankogai 234 Message-Id: <20030713102228.C76A.BQW10602@nifty.com> 235 2361.97 2003/07/08 21:52:14 237! encoding.pm lib/Encode/Guess.pm lib/Encode/Alias.pm 238 lib/Encode/JP/JIS7.pm lib/Encode/Encoder.pm Encode.pm 239 $DEBUG replaced with DEBUG() so perl optimizes better, 240 by Rafael with further fixes by dankogai 241 Message-Id: <20030705222023.1f24e041.rgarciasuarez@free.fr> 242! lib/Encode/Aliases.pm 243 Was: define_alias( qr/\bGB[-_ ]?2312(?:\D.*$|$)/i => '"euc-cn"' ); 244 Now: define_alias( qr/\bGB[-_ ]?2312(?!-?raw)/i => '"euc-cn"' ); 245 So new hash seeding introduced in bleedperl works. 246 Message-Id: <20030629100937.GD20285@vipunen.hut.fi> 247! lib/Encode/Guess.pm 248 $Encode::Guess::NoUTFAutoGuess is added so you can turn off 249 automatic utf(8|16|32) guessing -- originally by Autrijus 250 Message-Id: <20030626162731.GA2077@not.autrijus.org> 251! Encode.pm 252 Addressed the following; 253 Subject: [perl #22835] FB_QUIET doesn't work with Encode::encode 254 Message-Id: <rt-22835-59975.6.8650775354304@rt.perl.org> 255 2561.96 2003/06/18 09:29:02 257! lib/Encode/JP/JP.pm t/guess.t 258 m/(...)/ in void context then $1 is considered a Bad Thing 259 Message-Id: <B5AB34D0-A019-11D7-AF03-000393AE4244@dan.co.jp> 260! Encode.pm 261 Mentions in POD that as of perl 5.8.1 utf8::is_utf8() is 262 also available. 263! encengine.c 264 More typecast from maintperl@19739 265 Message-Id: <200306110645.h5B6j5D2009640@smtp3.ActiveState.com> 266! t/perlio.t 267 Tests 37 & 38 failed on Win32 -- yet another CRLF issue 268 Message-Id: <200306090733.h597XQPA031646@smtp3.ActiveState.com> 269! t/Encode.t 270 Now skips for EBCDIC platform. 271 Message-Id: <OF44B38062.A3998148-ON80256D27.004CF379@portsmouth.uk.ibm.com> 272! t/perlio.t 273 Craig's patch applied that addresses "Many systems (DOS, VMS) cannot 274 have more than one C<.> in their filenames." -- perlport. 275 Message-Id: <3ED79E01.8050401@mac.com> 276! bin/piconv 277 Found and fixed the back that -p,--perlqq does not work. 278 Induced by the change from Getopt::Std to Getopt::Long. 279! encoding.pm 280 Addressed [cpan #2629] Wrong assumption in numeric comparison 281 Message-Id: <rt-2629-7326.19.5700583232515@cpan.org> 282! Encode.pm Encode.xs Unicode/Unicode.pm Unicode/Unicode.xs 283 lib/Encode/Encoding.pm t/perlio.t 284 ! API Change: ->new_sequence() => ->renew() 285 + Encode::Unicode makes use of it so it can handle BOM on PerlIO 286 + Encode::XS and Encode::utf8 now supports ->renew() 287 + Encode::Encoding now documents this with examples 288 - Non-XS (en|de)code stripped out of Encode::Unicode 289 Message-Id: <146957DB-8C39-11D7-9C91-000393AE4244@dan.co.jp> 290 2911.95 2003/05/21 08:41:11 292! ucm/8859-*.ucm 293 Since bogus entries were found in iso-8859-6, all entries are 294 re-generated once again out of 295 http://www.unicode.org/Public/MAPPINGS/ISO8859/8859-*.TXT 296 Thank David Graff <graff@unagi.cis.upenn.edu> for the discovery 297 Message-Id: <200305201819.h4KIJRRU013746@unagi.cis.upenn.edu> 298+ lib/Encode/Unicode/UTF7.pm 299! lib/Encode/Config.pm lib/Encode/Alias.pm Unicode/Unicode.pm t/Unicode.t 300 lib/Encode/Supported.pod 301 UTF-7 support is now added. With this Encode now has all transcoding 302 methods in Unicode::String. 303 3041.94 2003/05/10 18:13:59 305! lib/Encode/MIME/Header.pm 306 A more sophisticated solution for double-encoding by dankogai 307! lib/Encode/MIME/Header.pm AUTHORS 308 Two bugs fixed by Bjoern Jacke 309 * "Double Encoding" was not possible 310 i.e. encode("MIME-B" => "=?UTF-8?B?w4RwZmVs?=") 311 * encode("MIME-Q") had UTF-8 flag on 312 Message-Id: <rt-22166-57077.2.12980078979811@bugs6.perl.org> 313! lib/Encode/MIME/Header.pm AUTHORS 314 Two occurances of "croak ()" fixed as "croak qq()". 315 Simon Cozens is added to AUTHORS as a result. 316 Message-Id: <20030509103708.GA30664@deep-dark-truthful-mirror.pad> 317! bin/piconv 318 POD fixes that reflect enhancements by jhi 319! bin/piconv 320 Two enhancements by jhi. 321 + Now uses Getopt::Long so it accepts long name options 322 (--from for -f, for example) 323 + New option: -r,--resolve 324 Message-Id: <20030505114149.GA227075@kosh.hut.fi> 325! MANIFEST META.yml 326 META.yml added upon request of Schwern 327 Message-Id: <F3B0BD2C-7BCB-11D7-A488-000393AE4244@dan.co.jp> 328! AUTHORS 329 Enache Adrian removed upon request -- to live longer than Encode 330 and/or FreeBSD (toy-)?thread :) 331 Message-Id: <20030425015701.GA2069@ratsnest.hole> 332! t/enc_module.t 333 "close STDOUT unless $^O eq 'freebsd';" once again relocated 334 to keep VMS happy in which case "$^O eq 'freebsd'" is required 335 to keep FreeBSD+thread happy. Sigh. 336 Message-Id: <3EA88ADC.3000300@mac.com> 337 3381.93 2003/04/24 17:43:16 339! t/enc_eucjp.t 340 added "no warnings 'pack'" in for loop to keep bleedperl from 341 complaining "Character in 'C' format wrapped in pack". 342! Makefile.PL 343 More elegant perl core detection inspired by Ilya Zakharevich 344 (but further elaborated for general cases). 345! lib/Encode/Encoding.pm lib/Encode/PerlIO.pod 346 POD fixes. 347! t/euc-jp.ucm 348 like cp9??, \x80-\x9F (control + 0x80) are zapped so they 349 are less likely to be confused w/ ISO-8859-* 350! t/CJKT.t 351 RT tests added (vendor encodings are exemplified) 352 -- that successfully found a flaw on iso-2022-kr before the patch. 353! lib/Encode/CJKConstants.pm lib/Encode/KR/2022_KR.pm 354 decode("ISO-2022-KR") has been buggy but no one ever sited 355 that since no one seems to be using it. Bugs discovered by 356 SADAHIRO-san 357 Message-Id: <20030416231757.A545.BQW10602@nifty.com> 358! lib/Encode/CN/HZ.pm t/perlio.t 359 HZ is now perlio_ok, thanks to SADAHIRO-san. perlio.t modified 360 so it adds test for HZ. 361 Message-Id: <20030416231757.A545.BQW10602@nifty.com> 362! lib/Encode/Guess.pm 363 Now guesses UTF-(16|32)(BE|LE) when the string contains \x00. 364 So long as the string contains \x{00}-\x{ff} it does not fail. 365 See perldoc for details. 366 Message-Id: <D2F9BB3C-6DC8-11D7-8F19-000393AE4244@dan.co.jp> 367 3681.92 2003/03/31 03:27:27 369! ucm/big5-eten.ucm ucm/big5-hkscs.ucm 370 Extraneous single-byte chars in range \x80-\xA0 and \xFA-\xFF 371 removed. FYI, IBM's ICU has none of these for java-Big5-1.3_P.ucm 372 but glibc-BIG5-2.1.2.ucm does. 373 Message-Id: <20030325215213.4CA1.BQW10602@nifty.com> 374! ucm/cp932.ucm ucm/cp936.ucm ucm/cp949.ucm ucm/cp950.ucm 375 Maps regenerated again but this time based upon 376 http://oss.software.ibm.com/cvs/icu/charset/data/ucm/ 377 (But where is THE DOCUMENT by MICROSOFT?) 378! t/enc_module.t AUTHORS 379 failure with threaded Perl on FreeBSD addressed. 380 Enache Adrian <enache@rdslink.ro> is added to AUTHORS for this. 381 Message-Id: <20030322230131.GA813@ratsnest.hole> 382! lib/Encode/Guess.pm 383 Some POD fixes. 384! t/CJKT.t 385 Change 18989: Make the :bytes conditional on PerlIO. 386 further Modified by Dan Kogai 387 <200303161730.h2GHU5B16265@smtp3.ActiveState.com> 388! t/enc_module.t 389 Chnage 18966: another fix for failing test on windows ("use encoding" 390 puts STDIN in :raw mode, so chomp() wasn't stripping the CR), by gsar 391 Message-Id: <200303140545.h2E5j5B08856@smtp3.ActiveState.com> 392! t/CJKT.t 393 Change 18970: Hopefully this works also in Win32, by jhi 394 Message-Id: <200303140745.h2E7j6B22729@smtp3.ActiveState.com> 395 Change 18965: fix CJKT.t failures on windows due to incorrect 396 binmode(), by gsar 397 Message-Id: <200303140530.h2E5U5B07046@smtp3.ActiveState.com> 398 3991.91 2003/03/09 20:07:37 400! encoding.pm 401 even more proofread by jhi. 402 Message-Id: <20030309194323.GT20843@kosh.hut.fi> 403! t/enc_module.t 404 -use lib 't'; 405 +use lib qw(t ext/Encode/t ../ext/Encode/t); 406 Message-Id: <20030309182057.GR20843@kosh.hut.fi> 407! AUTHORS 408 s/Hirohito/Hiroto/ig; Sorry, Hiroto-san. 409 Message-Id: <20030309181748.GP20843@kosh.hut.fi> 410! encoding.pm 411 s/logner/longer/ 412 Message-Id: <20030309181907.GQ20843@kosh.hut.fi> 413 4141.90 2003/03/09 17:32:43 415! encoding.pm 416+ t/enc_data.t 417 Inaba-san has added a patch for perl 5.8.1 or later that makes 418 encoding.pm work for <DATA> filehandle. t/enc_data.t is to test 419 that. POD is further revised. 420 Message-Id: <200303091515.h29FF6B03903@smtp3.ActiveState.com> 421! encoding.pm t/enc_module.t 422 encoding vs. ${^UNICODE} resolved. POD revised accordingly. 423 Message-Id: <20030306112940.GN20652@kosh.hut.fi> 424 4251.89 2003/02/28 426! Encode.xs 427 signed vs. unsigned issue discovered by Craig on OpenVM 428 Message-Id: <a05200f12ba81fe9d6298@[172.16.52.1]> 429! encoding.pm AUTHORS 430+ t/Mod_EUCJP.pm t/enc_module.enc t/enc_module.t 431 Because binmode() stacks layers instead of overwrite, you have to 432 ":raw :encoding()" in encoding.pm or your are in trouble when you 433 call encoding.pm multiple times. There are several workarounds 434 but Inaba-san's idea is in. SUGAWARA Hajime <sugawara@hdt.co.jp>, 435 who was the first to address this problem was added to AUTHORS. 436 The test suites was added for this, which is a modified version 437 of SUGAWARA-san's scripts 438 Message-Id: <3E5CF695.6AE07852@st.rim.or.jp> 439 4401.88 2003/02/20 14:42:34 441! Encode.xs 442 one signedness nit for Encode by jhi 443 <200302161933.h1GJX876018710@kosh.hut.fi> 444! ucm/viscii.ucm 445 VISCII map was incorrect; fixed by Sadahiro-san 446 Message-Id: <20030216120828.47D3.BQW10602@nifty.com> 447! t/enc_eucjp.t t/enc_utf8.t AUTHORS 448 You can't unlink files that are opened in cygwin but the last 449 file handle opened in t/enc_*.t left open. Patch submitted 450 by Yitzchak and he was added to AUTHORS. 451 Message-Id: <iN0Q+gzkgmZN092yn@efn.org> 452! t/CJKT.t 453 now works with 'LC_ALL=en_US.UTF-8 PERL_UTF8_LOCALE=1' 454 Message-Id: <20030206104513.GA11081@kosh.hut.fi> 455! Unicode/Unicode.xs 456 For 1.88: Unicode.xs =~ s/regog/recog/ -- jhi 457 Message-Id: <20030206045153.GA6826@kosh.hut.fi> 458 4591.87 2003/02/06 01:52:11 460! AUTHORS 461 * Inaba "Sensei" Hirohito added (I thought I have done so a long 462 ago but apparently I did not). 463 * SUZUKI Norio added for verious and useful bug reports. 464! Byte/Byte.pm KR/KR.pm Unicode/Unicode.pm 465 lib/Encode/Encoder.pm lib/Encode/CJKConstants.pm 466 podchecked so all warnings are gone except for L<http://>. 467! encoding.pm t/enc_eucjp.t 468 * t/uni/tr_utf8.t now t ok on maintperl (sorry, jhi) 469 * Filter option overhaul 470 * POD revision 471! Encode.pm Encode.xs encengine.c Encode/encode.h 472 lib/Encode/Encoding.pm lib/Encode/JP/JIS7.pm 473 Merged inaba-san's patch that fixes "use encoding 'shiftjis'" 474 without filter. podchecked by Dan Kogai. 475 Message-Id: <3E3BC46B.6C687CFD@st.rim.or.jp> 476! lib/Encode/Alias.pm 477 decode('alias', $1) went wild because of local $_ in find_alias() 478 the evil local $_ is eradicated but that changes find_alias() 479 format for coderef aliasing. See Encode::Alias for details 480 Message-Id: <200302051704.AA00042@kipp0.nifty.com> 481 4821.86 2003/01/22 03:29:07 483! encoding.pm 484 * Don't forget to canonize when you attempt an exact match! 485 Message-Id: <73E7F801-2DAA-11D7-BF9A-000393AE4244@dan.co.jp> 486 * ${^ENCODING} exception is off for $] > 5.008 487 Message-Id: <20030122110617T.inaba.hiroto@toshiba-it.co.jp> 488! t/enc_utf8.t 489 $] check commented out so it runs on 5.8.0 490 4911.85 2003/01/21 22:19:14 492! encoding.pm 493 ${^ENCODING} exception is now explicit rather than handled by regex. 494+ t/enc_eucjp.t t/enc_utf8.t 495 Test suite for the better "encoding" pragma support for bleedperl. 496 On 5.8.0, they will just be skipped. 497 4981.84 2003/01/10 12:00:16 499! encoding.pm 500 ${^ENCODING} is no longer set for utf so encoding is no longer fun :) 501 (That is to prevent duplicate encoding first by IO then ${^ENCODING}) 502 Message-Id: <20030108213737.GK331043@lyta.hut.fi> 503! Unicode/Unicode.xs 504 %_ fixes saves the resulting .so .05% smaller, by NC 505 Message-Id: <20021226225709.GF284@Bagpuss.unfortu.net> 506! Encode.pm 507 Silence Encode on undef, by Andreas 508 Message-Id: <m3smwrohd1.fsf@k242.linux.bogus> 509 Message-Id: <m3of7fo7np.fsf@k242.linux.bogus> 510! Unicode/Unicode.xs 511 s/regognised/recognised/ . British spelling left intact to pay 512 respect to two British Nicks :) 513 Message-Id: <20021203020454.GK2274@kosh.hut.fi> 514 5151.83 2002/11/18 17:28:49 516! Encode.xs lib/Encode/JIS7.pm 517 Even more patches from Inaba-san has been applied. With this 518 patch t/uni/tr_7jis.t and t/uni/t_utf8.t of bleedperl will work. 519 Message-Id: <20021115105514D.inaba.hiroto@toshiba-it.co.jp> 520 5211.82 2002/11/14 23:06:12 522! Encode.xs 523 Encode::utf8 (XS Version) assertion botch first found in Cygwin, 524 later found in perls w/ -Dusemymalloc was fixed by NC. 525 Message-Id: <20021114210349.GA288@Bagpuss.unfortu.net> 526 5271.81 2002/11/08 18:29:27 528! Encode.pm Encode.xs 529 Non-XS version of Encode::utf8 is back (with XS being default). 530 Encode::predefine_encodings(0) to turn off XS. 531 This is primarily to cope w/ Cygwin smoke but Sadahiro-san has 532 found that it was Test::More causing the problem, not Encode. 533 But I have already made it configurable so it may be useful in 534 some rare cases.... 535 Message-Id: <20021107210110.2EE4.BQW10602@nifty.com>, et al. 536! bin/enc2xs 537 The ingenious patch by Nicholas Clark that reduces shlib sizes by 538 50% with no penalty and backward compatibility preserved, is in. 539 Message-Id: <20021103231324.GE288@Bagpuss.unfortu.net> 540 5411.80 2002/10/21 20:39:09 542! Encode.xs t/mime-header.t 543 Even more patches from NI-XS regarding Encode::utf8->decode(). 544 And one more test to t/mime-header.t to prove it 545 Message-Id: <E183i0Y-0003mo-00@mserv1c.vianw.co.uk> 546 5471.79 2002/10/21 06:05:37 548! Encode.xs 549 Further patches from NI-XS. Encode::utf8->decode() now checks the 550 value of utf8 flag of the argument. As a result, the fix to 551 lib/Encode/MIME/Header.pm is no longer neccessary but since it did 552 no harm (even speedwise) I'll leave it unreverted. 553! ucm/cp949.ucm ucm/cp950.ucm 554 U+20AC EURO SIGN 555 U+00AE REGISTERED SIGN 556 were missing as a result of 1.78. Discovered by Moriyama-san. 557 Moriyama-san has also developed a test script that compares 558 (en|de)coded results to the corresponding Win32 API result and 559 all cp9?? maps are now verified. 560 Message-Id: <20021021025220.3AED.MSYK@mtg.biglobe.ne.jp> 561 5621.78 2002/10/20 15:44:00 563! lib/Encode/MIME/Header.pm 564 fixed so that it works with new Encode::utf8 565! Encode.pm Encode.xs 566 Encode::utf8 is now in Encode.xs by Nick In-XS. This allows 567 :encoding(UTF-8) to handle partial chars at end of buffers 568 correctly. 569 Message-Id: <20021020134935.2079.3@bactrian.ni-s.u-net.com> 570! lib/Encode/Supported.pod 571 More nitpickings applied. 572+ t/rt.pl MANIFEST 573! t/CJKT.t 574 Moriyama-san has discovered a serious bug in t/CJKT.t; its roundtrip 575 tests were completely useless. To redeem that and get the peace of 576 mind again, I wrote t/rt.pl to test ALL '|0' ENTRIES in all 577 ucm/*.ucm Since this script takes too long to finish (30 seconds on 578 PIII-800MHz, FreeBSD), it is deliberately excluded from 'make test' 579 but you can easily run that by either renaming it or: 580 perl -Mblib t/rt.pl 581 Message-Id: <20021019065420.0C48.MSYK@mtg.biglobe.ne.jp> 582! ucm/cp936.ucm ucm/cp949.ucm ucm/cp950.ucm 583 Other CJKT cp9?? also updated according to the URI below; 584 http://www.microsoft.com/typography/unicode/cscp.htm 585+ bin/ucmsort MANIFEST 586 ucmsort is a crude utility that sorts CHARMAP entries in UCM files 587 to proper order. intended for hardcore develpers only. 588! ucm/cp932.ucm JP/JP.pm AUTHORS 589 CP932 mapping which was based upon the mapping file at unicode.org 590 was found obsolete by MORIYAMA Masayuki msyk@mtg.biglobe.ne.jp>. He 591 has also supplied the patch so he was added to AUTHORS. 592! lib/Encode/Supported.pod 593 ISO-8859-11 != TIS 620 594 == TIS 620 + \xA0 ( ) 595 Message-Id: 596 <DC504E9C3384054C8506D3E6BB012460810D23@bsebe001.americas.nokia.com> 597 5981.77 2002/10/06 03:27:02 599! t/jperl.t 600 * Modified to accomodate up and comming patch by Inaba-san that 601 will fix tr/// needing eval qq{} 602 Message-Id: <9F78A19C-D6C3-11D6-BAC6-0003939A104C@dan.co.jp> 603! encoding.pm 604 * pod fixes/enhancements to reflect the changes above 605! lib/Encode/Alias.pm 606 "Encode::TW is correct, Encode::Alias not." - /Autrijus/ 607 Message-Id: <20021001015648.GB18710@not.autrijus.org> 608 6091.76 2002/08/25 15:09:51 610! t/big5-eten.utf 611 To reflect ucm change by Autrijus. t/big5-eten.enc was regenerated 612 but naturally identical to previous version -- dankogai 613! ucm/big5-eten.ucm 614 Codepoint fixes -- autrijus 615 Message-Id: <20020805040236.GC5220@not.autrijus.org> 616= * 617 copied everything under perl-5.8.0/ext/Encode to make sure Encode 618 is in sync w/ perl core 619! t/CJKT.t t/guess.t 620 Change 17175 by jhi@alpha on 2002/06/10 23:24:42 621 Now that binmode(FH) does implicit ":bytes" revisit 622 the failing tests. The worrisome one is the Digest::MD5 623 test-- how will it fare in CRLF lands now? 624! t/CJKT.t t/guess.t 625 From: Radu Greab <radu@netsoft.ro> 626 Date: Mon, 10 Jun 2002 00:40:34 +0300 627 Message-Id: <200206092140.g59LeYn15745@ix.netsoft.ro> 628 Fixes for en_US.UTF-8 failures, all but ext/PerlIO/t/fallback.t 629 ones which I cannot figure out. 630! lib/Encode/Alias.pm 631 Subject: [Encode PATCH] spurious warning 632 From: Nicholas Clark <nick@unfortu.net> 633 Date: Sun, 2 Jun 2002 20:26:22 +0100 634 Message-ID: <20020602192619.GA320@Bagpuss.unfortu.net> 635 6361.75 2002/06/01 18:07:49 637! lib/Encode/Alias.pm t/Alias.t lib/Encode/Supported.pod TW/TW.pm 638 glibc compliance cited by Autrijus. 639 http://www.li18nux.org/docs/html/CodesetAliasTable-V10.html 640! bin/enc2xs bin/piconv 641 Subject: Re: forewarning: usedevel and versiononly 642 Message-Id: <20020529081515.D570.H.M.BRAND@hccnet.nl> 643 6441.74 2002/05/28 18:33:15 645+ ucm/null.ucm ucm/ctrl.ucm 646! Makefile.PL bin/enc2xs lib/Encode/Supported.pod 647 "null" and "ascii-ctrl" encodings added upon the request of Autrijus 648 Subject: Re: unicode -> &# notation 649 Message-ID: <20020518193704.GB40272@not.autrijus.org> 650 6511.73 2002/05/28 17:26:18 652! */Makefile.PL Makefile.PL bin/enc2xs Encode/Makefile_PL.e2x AUTHORS 653 Chris Nandor has fixed Encode so that it works w/ MacPerl -- 654 at least w/ PPC (68k need static linking which does not work due to 655 64k limit). pudge is added to AUTHORS (I'm surprised he was not 656 there in the list). Encode/Makefile_PL.e2x was additionally fixed 657 by dankogai to reflect changes in other Makefile.PL 658 Message-Id: <p0510030ab9195ed230ff@[10.0.1.107]> 659! t/mime-header.t 660 Subject: Change 16746: -Mutf8 cleanup. 661 Message-Id: <200205222345.g4MNj7e10597@smtp3.ActiveState.com> 662 6631.72 2002/05/20 15:49:56 664! Makefile.PL 665 Subject: [PATCH] Encode should be in perl-core library path 666 Message-Id: <86r8k7h738.wl@mail.edge.co.jp> 667 Message-Id: <20020520161201.A11019@alpha.hut.fi> 668! lib/Encode/MIME/Header.pm 669 Subject: [PATCH] Encode::MIME::Header 670 Message-Id: <86sn4nh7a8.wl@mail.edge.co.jp> 671! Encode/Makefile_PL.e2x 672 Subject: [PATCH] Make Makefile_PL.e2x happy on MSWin32 673 Message-Id: <20020519201031.GA1603@not.autrijus.org> 674! CN/Makefile.PL Byte/Makefile.PL JP/Makefile.PL TW/Makefile.PL 675 Symbol/Makefile.PL KR/Makefile.PL EBCDIC/Makefile.PL Makefile.PL 676 AUTHORS 677 @16628 and @16652 from Vadim. Vadim was added to AUTHORS. 678 Subject: [PATCH] good day for WinCE port of perl. 679 Message-ID: <001301c1fc68$e808e560$a95cc3d9@vad> 680! Encode.xs 681! Unicode/Unicode.xs 682 Even more linting by Robin via @16532 683! Encode.xs 684 Even more typecast by Sarathy in @16460 685 6861.71 2002/05/07 16:22:42 687! Encode.xs 688 even more typecasts by Robin 689 Message-Id: <200205071513.QAA05846@tempest.npl.co.uk> 690! bin/enc2xs 691 A very strange bug that was causing a bugus ucm -> C table 692 generation that was revealed by a UCM file that Andreas was 693 working. This is the king of wierdest bug I've encountered 694 in the course of Encode maintenance. 695 Message-Id: <6C04F0FA-61D4-11D6-B164-00039301D480@dan.co.jp> 696 6971.70 2002/05/06 10:26:48 698! encoding.pm 699 Made more 'module-safe' with conjunction w/ 'no encoding'. 700 Message-Id: <EAB48C16-60DA-11D6-9982-00039301D480@dan.co.jp> 701! lib/Encode/Encoding.pm 702 'require Encode' because ->Define uses Encode::define_encoding(); 703 problem and solution addressed by Miyagawa-kun 704 Message-Id: <86znzdfvuh.wl@mail.edge.co.jp> 705! t/Unicode.t 706 Cuts the frill to make djgpp happier, as suggested by Laszlo 707 Message-Id: <20020506105819.H17012@libra.eth.ericsson.se> 708! bin/enc2xs 709 enc2xs no longer overwrites files w/ -M option, as suggested by Andreas 710 Message-Id: <m3bsbug48n.fsf@anima.de> 711 7121.69 2002/05/04 16:41:18 713! lib/Encode/MIME/Header 714 Floating-point coerced for UNICOS (in integer arithmetics it folds 715 line one character too early). Verification by Mark is pending. 716 Message-Id: <C670F60D-5F4F-11D6-A5CA-00039301D480@dan.co.jp> 717! Unicode/Unicode.pm 718 more doc patch from Elizabeth 719 Message-Id: <4.2.0.58.20020503210946.02f4ed30@mickey.dijkmat.nl> 720! Encode/Makefile_PL.e2x 721 More platform-independent patch from Benjamin 722 Message-Id: <3CD31BE0.69F79B06@earthlink.net> 723! lib/Encode/Guess AUTHORS 724 split regex fix by Graham Barr. Adds him to AUTHORS. 725 Message-Id: <20020504085419.E95940@valueclick.com> 726! Encode/Makefile_PL.e2x 727 enc2xs script discovery made smarter and more sensible, first cited 728 by Miyagawa-kun and further suggestions by Rafael and Andreas 729! Encode.pm lib/Encode/Guess.pm t/fallback.t t/guess.t t/mime-header.t 730 "The EBCDIC remapping of the low 256 bites again" #16372 by jhi 731 7321.68 2002/05/03 12:20:13 733! lib/Encode/Alias.pm lib/Encode/Supported.pod t/Alias.t AUTHORS 734 UCS-4 added to aliases of UTF-32 by Elizabeth Mattijsen. Alias.t 735 and Supported.pod modified to reflect the change. Elizabeth added 736 to Authors. And H.M. is also added for forwarding her patch among 737 other contributions (I was rather surprised to find his name was not 738 there yet!) 739 Message-Id: <20020503114901.D639.H.M.BRAND@hccnet.nl> 740 7411.67 2002/05/02 07:33:09 742! Encode.xs 743 Error message now consistent w/ perlqq (\N{U+} -> \x{}) 744 done in perl@16308 but Philip linted me further. Now the error 745 messages are macronized as ERR_ENCODE_NOMAP and ERR_DECODE_NOMAP 746! lib/Encode/Guess.pm 747 Sanity check for happier -w by Autrijus 748 7491.66 2002/05/01 05:41:06 750! Encode.xs t/fallback.t 751 WARN_ON_ERR no longer assumes RETURN_ON_ERR so you can issue a warning 752 while fallback is in effect. This even came with a welcome side-effect 753 of cleaner code with less nests! Thank you, NI-XS. t/fallback.t is 754 also modified to test this. 755 And of course, the corresponding varialbles to UV[Xx]f are appropriately 756 cast. This should've concluded NI-XS homework. 757! Encode.pm 758 encode(undef) does warn again! Repented upon suggestion by NI-XS. 759 Document for unless vs. '' added 760 Message-Id: <20020430171547.3322.13@bactrian.elixent.com> 761 7621.65 2002/04/30 16:13:37 763! Encode.pm 764 encode(undef) no longer warns for C<Use of uninitialized value in 765 subroutine entry>. Suggested by Paul. 766 Message-Id: <AIEAJICLCBDNAAOLLOKLMEEEEJAA.Paul.Marquess@ntlworld.com> 767! lib/Encode/Supported.pod 768 Encode::MIME::Header and Encode::Guess mentioned 769 Updated for Encode::HanExtra 0.05 and Encode::JIS2K 770! lib/Encode/Guess.pm 771 POD fix by Miyagawa-kun 772 Message-Id: <86k7qqx8p7.wl@mail.edge.co.jp> 773 7741.64 2002/04/29 06:54:06 775! ucm/euc-jp.ucm 776 Now decodes euc-jisx0213 also. CAVEAT: encode("euc-jp"...) and 777 encocde("euc-jisx0213") are still DIFFERENT. 778 Message-Id: <A5DFA5CA-5B3C-11D6-A54F-00039301D480@dan.co.jp> 779! Encode.xs 780 A few white spaces corrected by NI-XS via PerlIO integration to 781 Mainline 782 Subject: Change 16247: Integrate perlio; 783! Encode.pm 784 Document fixes by Andreas 785 Message-Id: <m3k7qsf1we.fsf@anima.de> 786 7871.63 2002/04/27 18:59:50 788! lib/Encode/Encoding.pm 789! Encoding.pm Unicode/Unicode.pm lib/Encode/Guess.pm lib/Encode/CN/HZ.pm 790! lib/Encode/JP/JIS7.pm lib/Encode/MIME/Header.pm lib/Encode/KR/2022_KR.pm 791 Make use of the Encode::Encoding base class! 792 And other cleanups in Encode.xs upon NI-XS suggestions 793 Message-Id: <20020427160718.1290.15@bactrian.ni-s.u-net.com> 794 7951.62 2002/04/27 11:17:39 796! Encode.pm 797 encodings() now just check %ExtModule instead of eval{require} 798 all of them for ":all" to conserve more memory. 799! Encode.xs 800 more "%x" -> "%" UVxf stuff. 801! Encode.pm 802 s/=over2/=over 2/g # oops. 803 8041.61 2002/04/26 03:02:04 805! t/mime-header.t 806 Now does decent tests besides use_ok() 807! lib/Encode/Guess.pm t/guess.t 808 UI streamlined, document added 809! Unicode/Unicode.xs 810 various signed/unsigned mismatch nits (#16173) 811 http://public.activestate.com/cgi-bin/perlbrowse?patch=16173 812! Encode.pm 813 POD: utf8-flag-related caveats added. A few sections completely 814 rewritten. 815! Encode.xs 816! AUTHORS 817 Thou shalt not assume %d works, either! 818 Robin Baker added to AUTHORS for this 819 Message-Id: <200204251132.MAA28237@tempest.npl.co.uk> 820! t/CJKT.t 821 "Change 16144 by gsar@onru on 2002/04/24 18:59:05" 822 8231.60 2002/04/24 20:06:52 824! Encode.xs 825 "Thou shalt not assume %x works." -- jhi 826 Message-Id: <20020424210618.E24347@alpha.hut.fi> 827! CN/Makefile.PL JP/Makefile.PL KR/Makefile.PL TW/Makefile.PL To make 828 low-memory build machines happy, now *.c is created for each *.ucm 829 (no table aggregation). You can still override this by setting 830 $ENV{AGGREGATE_TABLES}. 831 Message-Id: <00B1B3E4-579F-11D6-A441-00039301D480@dan.co.jp> 832+ lib/Encode/Guess.pm 833+ lib/Encode/JP/JIS7.pm 834 Encoding-autodetect (mainly for Japanese encoding) added. In a 835 course of development, JIS7.pm was improved. 836+ lib/Encode/HTML/Header.pm 837+ lib/Encode/Config.pm 838 MIME B/Q Header Encoding Added! 839! Encode.pm Encode.xs t/fallback.t 840 new fallbacks; XMLCREF and HTMLCREF upon Bart's request. 841 Message-Id: <20020424130709.GA14211@tanglefoot> 842 8431.59 $ 2002/04/22 23:54:22 844! Encode.pm Encode.xs 845 needs_lines() and perlio_ok() are added to Internal encodings such 846 as utf8 so XML::SAX is happy. FB_* stub xsubs are now prototyped. 847 8481.58 2002/04/22 23:54:22 849! TW/TW.pm 850 s/MacChineseSimp/MacChineseTrad/ # ... oops. 851! bin/ucm2text 852! t/*.t 853- t/*.euc t/*.ref 854+ t/*.enc t/*.utf 855 Now all CJKT encodings go thru round-trip test via t/CJKT.t. 856 t/(CN|TW).t by Autrijus are renamed at-(cn|tw).t 857 t/(JP|KR).t are aggregated to t/CJKT.t 858 test data are all remade via bin/ucm2text. 859 And .... They are no longer skipped for -Uuseperlio ! 860 8611.57 2002/04/22 20:27:30 862! t/JP.t t/KR.t t/perlio.t 863 unless (find PerlIO::Layer 'perlio') ... line is back again. 864 t/JP.t and t/KR.t were supposed to work but maybe '>:utf8' lines 865 need PerlIO. Sigh.... 866! Encode.xs Unicode/Unicode.pm lib/Encode/JP/JIS7.pm t/perlio.t 867 ->perlio_ok now does eval{ require PerlIO::encoding } there so 868 it correctly returns 1 when PerlIO::encoding is yet loaded. 869! Encode.xs 870 perl-current patch #16072 reflected 871 8721.56 2002/04/22 09:48:07 873! Encode.pm encoding.pm t/perlio.t t/jperl.t 874 New PerlIO::encoding 0.04 compliance met 875 8761.55 2002/04/22 03:43:05 877! Encode.pm Encode.xs Unicode/Unicode.pm 878 needs_lines() defined so Encode::Encoding is no longer needed 879 for perlio 880 8811.54 2002/04/22 02:50:01 882! Encode.pm! Encode.xs! Unicode/Unicode.pm t/perlio.t 883! lib/Encode/Encoding.pm lib/Encode/CN/HZ.pm 884 now perlio_ok is true by default if PerlIO::encoding->VERSION is 885 0.03 or larger. POD in Encode::Encoding revised to reflect this. 886 Encode::XS and Encode::Unicode now has perlio_ok() method. 887! lib/Encode/Supported.pod 888 s/UP-UX/HP-UX/ by jhi 889! AUTHORS Byte/Byte.pm CN/CN.pm Encode.pm JP/JP.pm KR/KR.pm README 890! Symbol/Symbol.pm TW/TW.pm Unicode/Unicode.pm bin/enc2xs bin/piconv 891! bin/ucmlint encoding.pm lib/Encode/Alias.pm lib/Encode/CN/HZ.pm 892! lib/Encode/Config.pm lib/Encode/Encoder.pm lib/Encode/Encoding.pm 893! lib/Encode/KR/2022_KR.pm lib/Encode/PerlIO.pod 894! lib/Encode/Supported.pod 895 Huge document fixes by Philip. 896! AUTHORS 897! t/JP.t 898 s/compare\(/compare_text\(/o by Sarathy. Adds him to AUTHORS 899 http://public.activestate.com/cgi-bin/perlbrowse?patch=16049 900! t/perlio.t 901 binmode() after "<:encoding" to make Win32 happy, by Mattia. 902 Mattia added to AUTHORS file 903 Message-Id: <3CC3150F.5798.22A05AE@localhost> 904 9051.52 2002/04/20 23:43:47 906! t/perlio.t 907 TODO: is now SKIP:, as NI-XS requested. Also adds more 908 eraborate failure analysis added. 909! bin/enc2xs 910 A note on how to make sure of round-trip safety added to POD 911 section (so Autrijus is happier) 912! ucm/big5-hkscs.ucm ucm/big5-eten.ucm t/TW.pm 913 big5-(eten|hkscs) is round-trip safe again! 914 Message-Id: <A2C949CC-54AC-11D6-A5FB-00039301D480@dan.co.jp> 915! encoding.pm 916 Typo fixes by Andreas 917! Encode.pm Encode.xs Unicode/Unicode.xs Encode/Encoding.pm 918! lib/Encode/JP/JIS7.pm lib/Encode/KR/2022_KR.pm t/perlio.t 919 PerIO coodination patches from NI-XS. 920 Message-Id: <2769E572-54A1-11D6-B7E2-00039301D480@dan.co.jp> 921 9221.51 2002/04/20 09:58:23 923! t/TW.t 924 Updated test suite by Autrijis so "make test" is happy again 925 Message-Id: <20020420082104.GA25037@not.autrijus.org> 926+ ucm/big5-eten.ucm 927! ucm/big5-hkscs.ucm lib/Encode/Alias.pm 928- ucm/big5.ucm 929 TW/TW.pm TW/Makefile.PL 930 Updates by Autrijus. 'big5' is no longer a canonical but an 931 alias to 'big5-eten'. big5-hkscs is now in 2001 edition. 932 Message-Id: <20020419195346.GA19597@not.autrijus.org> 933! Encode.xs 934 Fix by NI-XS that fallback may cause SEGV w/ Perl/TK 935 Message-Id: <20020419184509.1924.1@bactrian.ni-s.u-net.com> 936! Encode.pm 937 PerlIO detection a little bit smarter; no longer uses eval qq{} 938 but eval {}. 939 9401.50 2002/04/19 06:13:02 941! ! Encode.pm Encode.xs Encode/encoding.h 942+ t/fallback.pm 943 New Fallback API imlemented and documented. See "perldoc Encode" 944 for details 945! lib/Encode/JP/JIS7.pm Encode.pm 946+ lib/Encode/PerlIO.pod t/perlio.t 947 API compliance met. However, it still does not work unless perlio 948 implements line buffer. See BUGS section in perldoc Encode::PerlIO 949 As a sensible workaround, perlio_ok() added to Encode. 950! encoding.pm 951! lib/Encode/Supported.pod 952 Doc fixes from jhi 953 Message-Id: <20020418174647.J8466@alpha.hut.fi> 954! CN/CN.pm 955 Doc fixes from Autrijus 956 Message-Id: <20020418144131.GA10987@not.autrijus.org> 957! Encode.pm 958 perlqq mode documented 959! t/JP.t 960+ t/jisx0201.euc t/jisx0201.ref 961! t/jisx0208.euc t/jisx0208.ref 962 t/JP.t tests more rigorously and with other encodings 963 t/jisx0201.* added to test JIS7 encodings. jisx0208 is now PURELY 964 in jis0208 (used to contain jisx0201 part). 965! Encode/Makefile_PL.e2x 966 The resulting Makefile.PL that "enc2xs -M" creates now auto-discovers 967 enc2xs and encode.h rather than hard-coded. This allows the resulting 968 module fully CPANizable. 969! encoding.pm t/JP.t t/KR.t 970 PerlIO detection simplified (checks %INC instead of eval{}) 971! Encode.xs Encode/encode.h 972+ Unicode/Makefile.PL Unicode/Unicode.pm Unicode/Unicode.xs 973- lib/Encode/Unicode.pm 974 (en|de)code_xs relocated to where it belongs. Source reindented 975 to my taste 976! bin/enc2xs 977 Additional (U8 *) cast added as suggested by jhi 978 Message-Id: <20020417165916.A28599@alpha.hut.fi> 979 9801.42 Date: 2002/04/17 981- lib/Encode/XS.pm 982 no-op module; Thought of adding a pod there but enc2xs has 983 one so gone. 984! encoding.pm 985! t/JP.pm 986! t/KR.pm 987 correct mechanism to detect Perlio::encoding layar installed. 988! Encode.xs 989 PerlIO Layer detached. 990 9911.41 2002/04/16 23:35:00 992! encoding.pm 993 binmode(STDIN|STDOUT ...) done iff PerlIO is available 994! t/*.t 995 Cleaned up PerlIO skip conditions to prepare for the upcoming 996 Encode - PerlIO forking. 997! Encode.pm 998 exported functions are now prototyped. 999! lib/Encode/CN/HZ.pm 1000! bin/enc2xs 1001! Encode.xs 1002 fallback implemented # was /* FIXME */ 1003 affected programs revised to fit (only HZ was using the try-catch 1004 approach which needed to be fixed for API-compliance). 1005! Encode/Config.pm 1006! Encode/KR/2022_KR.pm 1007! Encode/KR/KR.pm 1008 can find =head1 NAME now, jhi 1009 Message-Id: <20020416083059.V30639@alpha.hut.fi> 1010! encoding.pm 1011 s/\{h\}/{$h}/g ;) 1012! Encode.xs 1013 now complies with less warnings with the pickest compilers. 1014 Suggested by Craig, fixed by Dan. 1015 ! Encode/Makefile_PL.e2x 1016! bin/enc2xs 1017 A bug that fails to find *.e2x in certain conditions fixed 1018 10191.40 2002/04/14 22:27:14 1020+ Encode/ConfigLocal_PM.e2x 1021! lib/Encode/Config.pm 1022! bin/enc2xs 1023 "enc2xs -C" now generates/updates Encode::ConfigLocal. 1024 ConfigLocal_PM.e2x is a skelton thereof. 1025! lib/Encode/Config.pm 1026! CN/CN.pm 1027 "use Encode::CN::HZ;" was missing. 1028! t/Unicode.t 1029! t/unibench.t 1030 More rigorous tests added to test XS, especially on memory allocation. 1031! Encode.xs 1032! lib/Encode/Unicode.pm 1033 NI-S implemented an XS version -- merged 1034 Message-Id: <20020414154857.2066.4@bactrian.ni-s.u-net.com> 1035! encoding.pm 1036! t/jperl.t 1037 Source filter option added. With this option on, you can write 1038 perl 5.8-savvy scripts (such as UTF-8 identifiers) in legacy 1039 encodings. t/jperl.t enhanced to test this feature. 1040! t/Unicode.t 1041 ok() gotcha addressed by Benjamin fixed. Though I didn't exactly 1042 apply his suggestion, this degree of nitting is enough to add him 1043 to AUTHORS list. 1044 Message-Id: <3CB93223.291E5E2E@earthlink.net> 1045! JP/JP.pm 1046+ lib/Encode/JP/JIS7.pm 1047- lib/Encode/JP/JIS.pm 1048- lib/Encode/JP/2022_JP.pm 1049- lib/Encode/JP/2022_JP1.pm 1050 7bit-jis, iso-2022-jp and iso-2022-jp1 are all aggregated to 1051 JIS7.pm for better maintainability and performance 1052! encoding.pm 1053 Added caveat for non-ascii identifiers. 1054! encoding.pm 1055 fixes by jhi, the original author of this pragramtic module. 1056 Message-Id: <20020413231527.V1826@alpha.hut.fi> 1057 10581.34 2002/04/12 20:23:05 (Unreleased) 1059! Encode.pm 1060! t/Unicode.t 1061 EBCDIC fixes addressed by jhi. 1062 Message-Id: <20020412161844.D9383@alpha.hut.fi> 1063! lib/Encode/Encoder.pm 1064 POD fix by Miyagawa-kun 1065 Message-Id: <86bscqq4hu.wl@mail.edge.co.jp> 1066 10671.33 2002/04/10 22:28:40 1068! AUTHORS 1069 Philip's mail address corrected. 1070! AUTHORS 1071! t/Encoder.t 1072! lib/Encode/Encoder.pm 1073 s/ = shift;/ = @_;/ # trivial but a common idiomatic typo :) 1074 This adds Miyagawa-kun to AUTHORS. 1075 * encoding() no longer exported by default but on demand 1076 * t/Encoder.t updated to test all these 1077 Message-Id: <86hemjpdn4.wl@mail.edge.co.jp> 1078! lib/Encode/Unicode.pm 1079! lib/Encode/Supported.pm 1080 Further doc fixes by Anton 1081 10821.32 2002/04/09 20:06:15 1083+ bin/ucmlint 1084+ t/bogus.ucm 1085- ucm/macDevanaga.ucm Unicode Character Map 1086- ucm/macGujarati.ucm Unicode Character Map 1087- ucm/macGurmukhi.ucm Unicode Character Map 1088 A utility to check integrity of .ucm files. t/bogus.ucm is a 1089 ucm that is deliberately bogus. unused Indic mappings are removed 1090 for the time being. 1091! Encode.pm 1092 resolve_alias() added as suggested by jhi. Same as 1093 find_encoding("alias")->name. For convenience. This one is 1094 defined in Encode.pm instead of Alias.pm. 1095 Message-Id: <20020409215846.H17022@alpha.hut.fi> 1096! Encode.xs 1097 Memory Allocate but detected during the devel of ucmlint -- fixed. 1098 Message-Id: <C0DDCE16-4BE7-11D6-9204-00039301D480@dan.co.jp> 1099! lib/Encode/Unicode.pm 1100 valid_ucs2(0) is false but must be true. 1101 3 patches from NI-S as follows. This also has fixed the incident 1102 Andy has reported. 1103! lib/Encode/Alias.pm 1104 find_alias() recursion prevention 1105! t/Aliases.t 1106 Checks for the patch above 1107! t/Encode/Unicode.pm 1108 An extra "F" that causes valid_ucs2() return a bogus value fixed 1109 Message-Id: <20020409133927.17803.1@bactrian.elixent.com> 1110 Message-Id: <Pine.SOL.4.10.10204091338220.10390-100000@maxwell.phys.lafayette.edu> 1111 2 Small Patches from jhi as follows: 1112! Encode.pm 1113 Encode->encodings() lists in case-insensitve order (as it was) 1114! bin/piconv 1115 -l option prints avaiable encodings to STDOUT instead of STDERR 1116! lib/Encode/Aliases.pm 1117 s/defintion/definition/ 1118 Message-Id: <200204082306.CAA21033@alpha.hut.fi> 1119! AUTHORS 1120! lib/Encode/Supported.pod 1121! lib/Encode/Unicode.pm 1122 POD revise by Philip Newton. This adds Philip to AUTHORS list. 1123 Thank you for the exact quote of Douglas Adams :) 1124 Message-Id: <22s3bu4gpvhhsses64nj3afuu0lo927rv3@4ax.com> 1125 11261.31 2002/04/08 18:08:07 1127! lib/Encode/Encoder.pm 1128+ t/Encoder.t 1129 Encode::Encoder, once just a placeholder of an idea, is now much more 1130 practical. See t/Encode.t to find how practical it can be. 1131+ lib/Encode/Config.pm 1132! Encode.pm 1133 my false laziness at Encode.pm is fixed. Now %ExtModules are set 1134 in Encode::Config and they are all literally, not programatically 1135 set. My false laziness was resulting many encodings missing from 1136 %ExtModules. 1137! lib/Encode/Unicode.pm 1138! t/Unicode.t 1139 BOM for 32LE was bogus as noted by Anton. t/Unicode.t is fixed 1140 so that it does not rely Encode::Unicode for BOM values 1141 Message-Id: <FFEC33E9-4AFB-11D6-B415-00039301D480@dan.co.jp> 1142 11431.30 2002/04/08 02:34:51 1144+ lib/Encode/Encoder.pm 1145 Object Oriented Encoder. I reckon something like this is in need. 1146! Encode.pm 1147! t/Unicode.pm 1148! lib/Encode/Supported.pod 1149 * autoloading bug that prevented upper-case canonicals such as UTF-16 1150 is fixed. Now even UTF/UCS are autoloaded! 1151 * encodings() is now more intuitive. 1152 * t/Unicode.t fixed to explicitly use Unicode.pm -- BOM values are 1153 stored therein. 1154 * Obligatory fixes to the POD. 1155! lib/Encode/Supported.pod 1156 Patch from Anton applied. 1157 Message-Id: <66641479.20020408033300@motor.ru> 1158! Encode.pm 1159! lib/Encode/Unicode.pm 1160 Cosmetic changes: "bless $obj, $class" => "bless $obj => class" 1161 11621.28 2002/04/07 18:58:42 1163! MANIFEST 1164+ t/Unicode.t 1165+ t/grow.t 1166 Just a MANIFEST for those missing files. 1167 11681.26 Date: 2002/04/07 15:22:04 1169! JP/Makefile.PL 1170! t/Aliases.PL 1171 Schwarn's patches against Makefile.PL has zapped jis*.ucm. Restored. 1172 And t/Aliases.t fixed to make sure they all exist. 1173 11741.25 2002/04/07 15:01:25 (Unreleased) 1175! Encode.pm 1176! lib/Encode/Unicode.pm 1177 More POD fixes.... 1178! Encode.pm 1179- lib/Encode/UTF_EBCDIC.pm 1180- lib/Encode/Internal.pm 1181- lib/Encode/utf8.pm 1182 Integrated into Encode.pm as closures. That way "one package, one file" 1183 rule is preserved yet less files to require. 1184! encoding.pm 1185 commented out binmode(STDERR ... 1186! Makefile.PL 1187! Byte/Makefile.PL 1188! CN/Makefile.PL 1189! EBCDIC/Makefile.PL 1190! JP/Makefile.PL 1191! KR/Makefile.PL 1192! Symbol/Makefile.PL 1193! TW/Makefile.PL 1194! Encode/Makefile_PL.e2x 1195 Schwarn's MM-compliance patch merged 1196 Message-Id: <20020406082609.GA28758@blackrider> 1197! Encode.pm 1198! lib/Encode/Unicode.pm 1199+ lib/Encode/UTF_EBCDIC.pm 1200+ t/Unicode.t 1201- lib/Encode/10646_1.pm 1202- lib/Encode/ucs2_le.pm 1203 (UCS-2|UTF-(16|32))(LE|BE)? implementation and cleanups. Instead of 1204 per-module based (en|de)code, I saved a number of .pm by 1205 reorganizing it as per-object base (Well, this is what Encode::XS 1206 does under the hood). See Encode::Unicode for details. 1207 The original Unicode.pm is now correctly renamed to UTF_EBCDIC.pm. 1208 This module is used only on EBCDIC environments. 1209 12101.21 2002/04/05 14:46:34 (Not Released) 1211! JP/JP.pm 1212! Encode.pm 1213+ ucm/jis0201.ucm 1214+ ucm/jis0208.ucm 1215+ ucm/jis0212.ucm 1216 Are back to make Perl/Tk happy Smile, NI-S. 1217! t/Alias.pm 1218! lib/Encode/Alias.pm 1219! lib/Encode/Supported.pm 1220! lib/Encode/10646_1.pm 1221! lib/Encode/ucs2_le.pm 1222 UCS-16BE is now canonical for UCS-2/ISO-10646-1. 1223 Leftover implicit aliases in ucs2_le.pm removed. Tests and documents 1224 updated to reflect changes. 1225 essage-Id: <20020405114024.1290.17@bactrian.ni-s.u-net.com> 1226! lib/Encode/Alias.pm 1227! lib/Encode/Supported.pm 1228 Anton's revision commited. Added Dan's own fixes as well. 1229 Message-Id: <159103166906.20020405161134@motor.ru> 1230! lib/Encode/Alias.pm 1231 134c134 1232 < qr/^UCS2-le$/i => '"UCS-2"', ); 1233 --- 1234 > qr/^UCS2-LE$/i => '"UTF-16LE"'); 1235 Sigh. Thank you, Anton. 1236 Message-Id: <14567692196.20020405062020@motor.ru> 1237 Message-Id: <69FEC0B4-483E-11D6-A045-00039301D480@dan.co.jp> 1238 12391.20 2002/04/04 19:50:52 1240+ bin/unidump 1241 the last minute addtion. Just give it a try. Docs remains to be done. 1242 Not installed by default. 1243! lib/Encode/Supported.pod 1244 Enhanced Greatly. 1245! t/Alias.t 1246! lib/Encode/Alias.pm 1247! lib/Encode/utf8.pm 1248! lib/Encode/10464_1.pm 1249! lib/Encode/ucs2_le.pm 1250 Canonical name for 'UCS-2le" is now "UTF-16LE". UCS-2 left 1251 unchanged but UTF-16BE is added as an alias. Implicit aliases 1252 move to Encode::Alias so init_alias() works more as expected. 1253 Also, 'utf8' is now canonical with 'UTF-8' being an alias. 1254 Though pedantically wrong, This should make perl mongers happier. 1255 t/Alias.t is enhanced to test all these. 1256 Message-Id: <9C39BD58-47AF-11D6-9D82-00039301D480@dan.co.jp> 1257! Byte/Makefile.PL 1258 Now all .ucm are stacked in byte_t; They all share ascii part so 50% 1259 of the codepoints are common. CJKT left as is because the saving is 1260 not significant. 1261! Byte/Makefile.PL 1262! CN/Makefile.PL 1263! EBCDIC/Makefile.PL 1264! Encode.xs 1265! Encode/Makefile_PL.e2x 1266! JP/Makefile.PL 1267! KR/Makefile.PL 1268! Makefile.PL 1269! Symbol/Makefile.PL 1270! TW/Makefile.PL 1271! bin/enc2xs 1272! AUTHORS 1273 All occurance of _def.h replaced with .exh so djgpp works happily 1274 ever after! To credit this amazing discovery, Laszlo is now in 1275 AUTHORS list 1276 Message-Id: <20020403181424.GA8778@freemail.hu> 1277 Message-Id: <B5BF0C6F-4732-11D6-B13D-00039301D480@dan.co.jp> 1278! Makefile.PL 1279! */Makefile.PL 1280! Encode/Makefile_PL.skel 1281 bin/enc2xs 1282 No more @INC fiddling! Uses $ENV{PERL_CORE} instead 1283 Message-Id: <20020401222744.GX2000@blackrider>, et al. 1284! t/encoding.t 1285 Two more tests by added jhi 1286 Message-Id: <200204020000.DAA25121@alpha.hut.fi> 1287+ t/grow.t 1288! Encode.xs 1289 The showstopper fixed -- Memory reallocation bug was causing 1290 Encode::XS to fall into infinite loop on certain conditions. 1291 t/grow.t tests that. 1292 Message-Id: <9572CAC4-463C-11D6-ABA5-00039301D480@dan.co.jp>, et al 1293+ bin/txt2ucm 1294! */Makefile.PL 1295! */*.ucm 1296! */XX.pm 1297! lib/Encode/Supported.pod 1298 Vendor encodings rebuilt out of original map files at unicode.org. 1299 Indic languages such as MacDevanagali remain unspported do to the 1300 shortcoming of encengine capabilities (they need algorithmical 1301 conversion and I have no knowledge on that!). Pods fixed for added 1302 encodings. 1303 Oh, macJapan.ucm renamed to macJapanese.ucm. 1304 macROMnn is macRomanian and macRUMnn is macRumanian. 1305 txt2ucm is a crude script that is used to convert them. 1306! bin/enc2xs 1307 Unicode Compound Characters (used extensively on Mac) supported 1308! bin/piconv 1309 Typo fixes and improvements by jhi 1310 Message-Id: <200204010201.FAA03564@alpha.hut.fi>, et al. 1311 13121.11 2002/03/31 22:12:13 1313+ t/encoding.t 1314+ t/jperl.t 1315! MANIFEST 1316 Missing files from the MANIFEST fixed. 1317 Message-Id: <20020401010156.H10509@alpha.hut.fi> 1318 Version incremented just to make CPAN happy. 1319 13201.10 2002/03/31 21:32:42 1321! Makefile.PL 1322! README 1323 INSTALL_UCM option added to Makefile.PL so you can install *.ucm 1324 if you want. This should make Autrijus happy. Also, piconv 1325 is added to default install. 1326+ Encode/*.e2x 1327! bin/enc2xs 1328 Here-documented files that enc2xs generates are now exported 1329 to *.e2x. Much cleaner and easier to debug. 1330! encoding.pm 1331 encoding enhances so you can make it act more like such 1332 (now prehistoric ) "localized" variations of perl like Jperl. 1333+ t/jperl.t 1334 Further test for encoding.pm. Written in euc-jp 1335+ encoding.pm 1336+ t/encoding.t 1337 Taken over form jhi. 1338 Message-Id: <20020330174618.B10154@alpha.hut.fi> 1339- Encode/*.ucm 1340+ ucm/*.ucm 1341! Makefile.PL 1342! */Makefile.PL 1343 *.ucm relocated to ucm/ so MakeMaker will not install'em by default. 1344- ucm2table 1345+ bin/ucm2table 1346 *** 1347! AUTHORS 1348! Byte/Byte.pm 1349! Encode.pm 1350! Encode/macIceland.ucm 1351! lib/Encode/Alias.pm 1352! lib/Encode/Supported.pod 1353 MacIceland fixes and Pod Typo fixes. This adds Andreas to AUTHORS. 1354 Message-Id: <m3lmcavhjt.fsf@anima.de> 1355 13561.01 2002/03/29 20:59:39 1357! Makefile.PL 1358! README 1359 s/USE_SCRIPTS/MORE_SCRIPTS/ 1360! Makefile.PL 1361 installs enc2xs by default for external Encode:: modules in CPAN, 1362 such as Encode::HanExtra 1363! t/*.t 1364 More sensible perl core detection via $ENV{PERL_CORE} 1365 suggested by Spider 1366 Message-Id: <200203291007.FAA07329@Orb.Nashua.NH.US> 1367! bin/enc2xs 1368 Perl core ditection via $^X =~ m/\bminiperl$/o 1369 Message-Id: <A5C7B0CA-42F1-11D6-B5AD-00039301D480@dan.co.jp> 1370 13711.00 Wed Mar 29 2002 1372! * 1373 The version of all files is updated to 1.00 via "ci -f -l1.00", 1374 commemorating version 1.00. All files, including *.ucm are now 1375 under version control. 1376- encode.h 1377+ Encode/encode.h 1378 encode.h moved to Encode/ so it will be installed for the later 1379 use by enc2xs 1380! enc2xs 1381 h2xs-like feature added via "h2xs -M Name *.(enc|ucm)" 1382! Makefile.PL 1383! */Makefile.PL 1384- compile 1385+ bin/enc2xs 1386 compile renamed to enc2xs. 1387 Affected Makefle.PL updated 1388- lib/CN/2022_CN.pm 1389 "Punt it. HanExtra can take care of that later." -- Autrijus 1390 Message-Id: <20020328154338.GA7351@not.autrijus.org> 1391! Encode/johab.ucm 1392! Encode/euc-kr.ucm 1393! Encode/ksc5601.ucm 1394! lib/Encode/CJKConstants.pm 1395! lib/Encode/KR/2022_KR.pm 1396 Table patches for Euro Signs, 2022-KR fixups by Jungshik 1397 Message-Id: <Pine.LNX.4.44.0203280616190.2259-200000@www.ykga.org> 1398! README 1399! Makefile.PL 1400+ bin/piconv 1401 bin/ added for example scripts. They are not installed by default. 1402 to install them, "perl Makefile.PL USE_SCRIPTS". 1403 piconv is iconv reinvented in perl. in addition to all features 1404 of iconv, it also adds perlish features. See L<piconv/1> for more 1405 details. 1406! lib/Encode/Alias.pm 1407 qr/^ replaced with qr/\b so it directly matches locale names 1408 such as en_US.US-ASCII 1409! AUTHORS 1410! t/Aliases.t 1411 Patch by MJD to fix the following problem applied. 1412 Subject: [PATCH 5.7.3 Encode] 1413 Aliases.t not properly skipped when Encode extension not built 1414 Message-Id: <20020328091850.18677.qmail@plover.com> 1415! lib/Encode/KR/2022_KR.pm 1416! lib/Encode/CJKConstants.pm 1417 Another patch from Jungshik to make iso-2022-kr actually work 1418 Message-Id: <Pine.LNX.4.44.0203271745210.30462-200000@www.ykga.org> 1419! Encode/Encode/euc-kr.ucm 1420+ Encode/Encode/johab.ucm 1421! Encode/Encode/ksc5601.ucm 1422! Encode/KR/KR.pm 1423! Encode/KR/Makefile.PL 1424! Encode/lib/Encode/Alias.pm 1425! t/Alias.t 1426 Johab support and complete revision of Korean Encoding by Jungshik 1427 Message-Id: <Pine.LNX.4.44.0203271105060.30462-200000@www.ykga.org> 1428+ Encode.pm 1429 Revised to make up with now-dropped Encode::Details. 1430- lib/Encode/Details.pod 1431 Dropped. Besides being obsolete, the topics are now covered in 1432 respective pods now. 1433! AUTHORS 1434! t/Alias.t 1435 KR/KR.pm 1436 lib/Encode/Alias.pm 1437 Korean aliases fixed thanks to Jungshik Shin 1438 /ks[-_ ]?c[-_ ]?5601-1987$/i => cp936 1439 Message-Id: <Pine.LNX.4.44.0203262102250.1237-100000@www.ykga.org> 1440! *.pm 1441 =head1 NAME added to all modules to make buildtoc happy 1442 Message-Id: <20020327041151.A10618@alpha.hut.fi> 1443- lib/Encode/CJKguide.pod 1444 Too controversial and dropped from the dist. Will be available 1445 separately on the web. 1446! Encode/*.ucm 1447 RCS tags added so table debugging gets easier (should that be 1448 needed! I hope they all stay 1.00!) 1449+ lib/Encode/CJKguide.pod 1450 A detailed guide to mainly, but not limited to, CJK multibyte 1451 encodings. 1452- Encode/roman8.ucm 1453+ Encode/hp-roman8.ucm 1454! Byte/Makefile.PL 1455! Encode/Supported.pod 1456 All occurance of "roman8" replaced with "hp-roman8" to avoid 1457 confusion 1458! Encode/Supported.pod 1459! Encode/mac*.ucm 1460! t/Alias.t 1461 Mac Encodings now comply the Inside Macintosh 1462! t/Alias.t 1463 Test for '-raw' conventions added. 1464! Encode/Alias.pm 1465 aliased gb2312 -> euc-cn, ksc5601 -> euc-kr 1466! Encode/gb12345.ucm 1467! Encode/gb2312.ucm 1468! Encode/ksc5601.ucm 1469 "-raw" appended to canonical names. 1470 File mames stay unchanged thanks to UCM format. 1471! lib/Encode/CN/HZ.pm 1472 Patch from Autrijus to fix gb2312 -> gb2312-raw + code linting 1473 Message-Id: <20020326035210.GA2091@not.autrijus.org> 1474 14750.99 Tue Mar 26 2002 1476- lib/Encode/JP/Const.pm 1477+ lib/Encode/CJKConstants.pm 1478+ lib/Encode/CN/2022_CN.pm 1479+ lib/Encode/KR/2022_KR.pm 1480+ t/KR.t 1481+ t/gb2312.euc 1482+ t/gb2312.ref 1483+ t/ksc5601.euc 1484+ t/ksc5601.ref 1485+ t/table.euc 1486+ t/table.ref 1487+ ucm2table 1488 * Support for ISO-2022-KR and ISO-2022-CN added. 1489 * t/KR.t added! 1490 * more t/*.{euc,ref} added, which was autogenerated from ucm2table 1491 * ucm2table autogenerates character table out of UCM files. 1492- engine.c 1493+ encengine.c 1494- lib/Encode/Supports.pod 1495+ lib/Encode/Supported.pod 1496 Names reverted due to popular demand. 1497 8.3 rule applies only when there is a conflict. 1498 Message-Id: <20020325095924.GD44120@not.autrijus.org> 1499! */Makefile.PL 1500- Encode/*.enc 1501+ Encode/*.ucm 1502- lib/Tcl* 1503- lib/Encode/Format/Enc.pod 1504- t/Tcl.t 1505 * Character tables is now 100% ucm. 1506 * All files under Encode/ is now 8.3-compliant 1507 * some of missing encodings added (i.e. gsm0338 and nextstep) 1508 * Vendor mappings aggregated with appropriate national std in 1509 Makefile.PL, resulting smaller *.so especially for CJK. 1510 Following is result on Dan's FreeBSD box. 1511 Now Then 1512 --------------------------------------------------------------- 1513 blib/arch/auto/Encode/Byte/Byte.so 157,279 171,042 1514 blib/arch/auto/Encode/CN/CN.so 1,634,476 1,626,685 1515 blib/arch/auto/Encode/EBCDIC/EBCDIC.so 18,476 18,476 1516 blib/arch/auto/Encode/Encode.so 27,791 27,791 1517 blib/arch/auto/Encode/JP/JP.so 1,408,056 1,832,811 1518 blib/arch/auto/Encode/KR/KR.so 1,156,518 1,329,587 1519 blib/arch/auto/Encode/Symbol/Symbol.so 23,940 20,990 1520 blib/arch/auto/Encode/TW/TW.so* 948,761 1,316,437 1521 --------------------------------------------------------------- 1522 Total 5,375,297 6,343,819 1523 Saving 968,522 1524 * As a result of ucm-transition, Encode::Tcl dropped because 1525 Encode::Tcl demands *.enc. 1526 Encode::Tcl will be supplied in a separate tarball with *.enc. 1527 Message-Id: <C024E294-3FC3-11D6-8347-00039301D480@dan.co.jp> 1528!compile 1529-encengine.c 1530+encode.c 1531!Encode.pm 1532-lib/Encode/Supported.pod 1533+lib/Encode/Supports.pod 1534-lib/Encode/iso10646_1.pm 1535+lib/Encode/10646_1.pm 1536-lib/Encode/EncFormat.pod 1537+lib/Encode/Format/Enc.pod 1538 Files renamed 8.3 filename compliance. Affected modules/scripts revised. 1539- lib/Encode/JP/Constants.pm 1540+ lib/Encode/JP/Consts.pm 1541! lib/Encode/JP/JIS.pm 1542! lib/Encode/JP/H2Z.pm 1543 Version nit problem and 8.3 rule fix. 1544 > Package namespace installed latest in CPAN file 1545 > Encode::JP::Constants 0.92 1.02 J/JH/JHI/perl-5.7.3.tar.gz 1546 was noted by jhi then Dan discovers "Constants.pm" does not comply 8.3 1547 rule. Contants.pm renamed to Consts.pm and affected modules are fixed 1548 accordingly. In addition, legacy "use vars qw()..." are replaced with 1549 "our"; 1550 Message-Id: <20020325011248.D1561@alpha.hut.fi> 1551 Message-Id: <41023D51-3FB5-11D6-8347-00039301D480@dan.co.jp> 1552! JP/JP.pm 1553- lib/Encode/JP/ISO_2022_JP.pm 1554- lib/Encode/JP/ISO_2022_JP_1.pm 1555+ lib/Encode/JP/2022_JP.pm 1556+ lib/Encode/JP/2022_JP1.pm 1557 01234567.012 1558 8.3 naming conflict for vanilla fat addressed by jhi 1559 Message-Id: <20020324201931.V22596@alpha.hut.fi> 1560 1561! Encode.xs 1562 Typecast fix addressed by jhi 1563 Message-Id: <20020324185540.T22596@alpha.hut.fi> 1564 15650.98 Mon Mar 25 2002 1566! lib/Encode/Supported.pod 1567 Further pod fixes 1568+ lib/Encode/JP/ISO_2022_JP_1.pm 1569! lib/Encode/JP/ISO_2022_JP.pm 1570! lib/Encode/JP/JIS.pm 1571! JP/JP.pm 1572 Now Encode::JP is more strict on the difference between ISO-2022-JP 1573 and ISO-2022-JP-1. See JP/JP.pm for details. I hope this move 1574 makes Anton happier :) FYI the previous version implements 1575 ISO-2022-JP as ISO-2022-JP-1 since it had X0212 support. 1576! lib/Encode/Supported.pod 1577 Further pod fixes 1578! Encode.xs 1579 Avoid core-dump in Encode with PERLIO=mmap by NI-S 1580 Message-Id: <20020324104139.1326.7@bactrian.ni-s.u-net.com> 1581! CN/CN.pm 1582! JP/JP.pm 1583! KR/KR.pm 1584! TW/TW.pm 1585! lib/Encode/Suppoted.pod 1586 pod fixes to replace F<http://...> to L<http://...>, 1587 as suggested by Autrijius in: 1588 Message-Id: <20020324083943.GA14901@not.autrijus.org> 1589! lib/Encode/Suppoted.pod 1590 fixes and enhancements by Anton 1591 Message-Id: <10632060120.20020324103753@motor.ru> 1592! lib/Encode/Alias.pm 1593 > define_alias( qr/^GB[- ]?(\d+)$/i => '"gb$1"' ); 1594 added. Suggested by Anton then deobfuscated by Autrijius 1595 Message-Id: <20020324064455.GA3667@not.autrijus.org> 1596! compile 1597 Further fix by Nicholas Clark 1598 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net> 1599- lib/EncodeFormat.pod 1600+ lib/Encode/EncFormat.pod 1601! MANIFEST 1602 File renamed as suggested by Autrijius 1603! Encode.pm 1604! lib/Encode/Details.pod 1605! lib/Encode/Supported.pod Sun Mar 24 13:29:35 2002 1606! Encode.pm Sun Mar 24 13:43:47 2002 1607 pod fixes by Autrijius. 1608 Message-Id: <20020324062804.GA3595@not.autrijus.org> 1609 Message-Id: <20020324075627.GB11986@not.autrijus.org> 1610! t/Alias.t 1611! lib/Encode/Alias.pm 1612! Encode.pm 1613 now more EBCDIC conscious; 1614 %ExtModules on EBCDIC system excludes CJK so that you don't 1615 have to worry about the matched alias resulting cloaking. 1616 t/Alias.t also revised to reflect changes. Verified by jhi 1617 Message-Id: <20020324022929.D22596@alpha.hut.fi> 1618 16190.97 Sun Mar 24 2002 1620! CN/CN.pm 1621! KR/KR.pm 1622! TW/TW.pm 1623 EBCDIC detection mechanism installed as in JP/JP.pm 1624 Message-Id: <20020323211847.G19148@alpha.hut.fi> 1625! Byte/Makefile.PL 1626! CN/Makefile.PL 1627! EBCDIC/Makefile.PL 1628! JP/Makefile.PL 1629! KR/Makefile.PL 1630! Symbol/Makefile.PL 1631! TW/Makefile.PL 1632 Now all table files used by compile are postfixed '_t' to avoid 1633 namespace collisions in case insensitive file systems once for all! 1634 inspired by: 1635 Message-ID: <58290227735.20020323195659@familiehaase.de> 1636! t/Aliases.t 1637 Since the Encode::JP is unsupported under EBCDIC we 1638 cannot run this test (aliases as such should work fine) -- jhi 1639 Message-Id: <20020323202119.D19148@alpha.hut.fi> 1640! Byte/Makefile.PL 1641 duplicate occurance of ascii.ucm and 8859-1.ucm 1642 causes MacOS X dlyd to cloak 1643! t/CN.t 1644! t/Encode.t 1645! t/JP.t 1646! t/TW.t 1647! t/Tcl.t 1648 < chdir 't' if -d 't'; 1649 --- 1650 > if (! -d 'blib' and -d 't'){ chdir 't' }; 1651 When you are "make test"-ing on Encode/ directory, you must not 1652 change $ENV{PWD}. t/JP.t has been fixed before but others somehow 1653 remain unchanced. Also the situation detection was made simpler 1654 in t/JP.t, which was originally; 1655 > chdir 't' if -d 't' and $ENV{PWD} !~ m,/Encode[^/]*$,o; 1656! Encode.pm 1657 "Use of uninitialized value in string eq at Encode.pm line 96." 1658! Symbol/Makefile.PL 1659! EBCDIC/Makefile.PL 1660! AUTHOR 1661 -- Problem on case insensitive file systems 1662 "coexist of ebcdic.c <> EBCDIC.c on Cygwin not possible" 1663 Message-ID: <88254111953.20020323095503@familiehaase.de> 1664! compile 1665! AUTHOR 1666 "So I think it's a bug in gcc, not perl. But it still needs to be 1667 worked around." 1668 Message-Id: <20020323145840.GD304@Bagpuss.unfortu.net> 1669 Message-Id: <20020323170509.C96475@plum.flirble.org> 1670 16710.96 Sat Mar 23 2002 1672! TW/TW.pm 1673! lib/Encode/Encoding.pm 1674! lib/Encode/Alias.pm 1675! lib/Encode/Supported.pod 1676! KR/KR.pm 1677 Pod Fixes by Michael G Schwern <schwern@pobox.com> via jhi 1678 Message-ID: <20020322073908.GB10539@blackrider> 1679! Makefile.PL 1680! Encode.pm 1681 "...I think we should include ISO 8859-1 as well." -- NI-S 1682 Message-Id: <20020322120230.1332.8@bactrian.elixent.com> 1683! JP/JP.pm 1684! CN/CN.pm 1685! KR/KR.pm 1686! TW/TW.pm 1687! lib/Encode/Alias.pm 1688 alias definitions relocated to Encode::Alias so module autoloading 1689 works for aliases also. 1690! Encode.pm 1691 encodings() now accepts args to check ExtModules. 1692+ Byte/Byte.pm 1693+ Byte/Makefile.PL 1694+ EBCDIC/EBCDIC.pm 1695+ EBCDIC/Makefile.PL 1696+ Symbol/Makefile.PL 1697+ Symbol/Symbol.pm 1698! Encode.pm 1699! Encode.xs 1700 Latin and single byte encodings are reorganized so they are 1701 demand-loaded like Encode::XX. Now only ascii is compiled into 1702 Encode itself. 1703! lib/Encode/Alias.pm 1704 for my $k (keys %hash){ delete $hash{$k}; } 1705 is depreciated; fixed. 1706 17070.95 Fri Mar 22 2002 1708 In this update, pod rewrites and alias fixes are the main issues 1709+ lib/Encode/Supported.pod 1710 Describes supported encodings 1711! Makefile.PL 1712 streamlined compiled-in encodings. 1713! lib/Encode/Description.pod -> lib/Encode/Details.pod 1714 Renamed. 1715+ Encode/ibm-125?.ucm 1716 Added from icu distibution with any occurance of 1717 "IBM-125?" to "cp125?". Filenames remain unchanged to pay 1718 some respect to icu staff, however. 1719+ lib/Encode/Alias.pm 1720! Encode.pm 1721 Alias difinitions in Encode.pm relocated. 1722! AUTHORS 1723! Encode.xs 1724 packWARN patch from Paul Marquess via jhi 1725 Message-Id: <20020321010101.O28978@alpha.hut.fi> 1726 Paul added to AUTHORS as a result. 1727! t/CJKalias.t -> t/Aliases.t 1728 Renamed. Checks even more aliases and alias overloading 1729! Encode.pm 1730! CN/CN.pm 1731 duplicate alias for ujis => euc-jp removed (Encode::JP has one) 1732 gbk => cp936 relocated to CN.pm 1733! t/CJKalias.t 1734 Test::More with plans (by jhi) 1735 17360.94 Thu Mar 21 2002 1737+ lib/Encode/Description.pod 1738! lib/Encode/Encoding.pm 1739 Now the pod in Encode.pm is abridged as programming references. 1740 lib/Encode/Description.pod contains the original, detailed description 1741 and Encode::Encoding explains how to write your own module to 1742 add new encodings. So far, lib/Encode/Description.pod contains 1743 the whole pod once in Encode.pm. This is intentional. 1744! Encode.pm 1745 Pod revisions by Anton Tagunov 1746 Message-Id: <517178431.20020320174824@motor.ru> 1747! lib/Encode/Tcl.pm 1748 all occrance of Encode::Tcl::Extended removed including pod 1749! t/CJKalias.t 1750 test now checks $encoding->name only; $encoding->{name} are 1751 no longer check to find the canonical name. 1752! lib/Encode/JP/JIS.pm 1753! lib/Encode/JP/ISO_2022_JP.pm 1754 ->name() added to be more compliant with API 1755! CN/CN.pm 1756! JP/JP.pm 1757! KR/KR.pm 1758! TW/TW.pm 1759! t/CJKalias.t 1760 Patch by Autrijus to add aliases to TW and fixes to POD 1761 Message-Id: <20020320090619.GA24774@not.autrijus.org> 1762! AUTHORS 1763 SADAHIRO Tomoyuki added as should. My apologies. 1764 17650.93 Wed Mar 20 2002 1766* First release to be uploaded to CPAN. For prehistoric changes, 1767 please see Changes file of perl distibution as well as 1768 perl-unicode@perl.org archive, available at: 1769 http://archive.develooper.com/perl-unicode@perl.org/ 1770 1771 Changes Since 0.92 includes; 1772+ Changes 1773+ AUTHORS 1774! Encode.pm 1775! README 1776 + Mention to perl-unicode@perl.org added 1777! JP/JP.pm 1778 + Encoding aliases added so you can feed locale names 1779 and MIME Charset="" directly. 1780 - Mention to JISX0212 removed because it's fixed 1781! CN/CN.pm 1782! KR/KR.pm 1783 + Encoding aliases added. Note TW is left untouched because 1784 euc-tw is not implemented in TW but in Encode::HanExtra. 1785 Autrijus, you may fix Encode::HanExtra. 1786+ t/CJKalias.t 1787 + to test encode aliases added 1788