1/****************************************************************
2Copyright (C) Lucent Technologies 1997
3All Rights Reserved
4
5Permission to use, copy, modify, and distribute this software and
6its documentation for any purpose and without fee is hereby
7granted, provided that the above copyright notice appear in all
8copies and that both that the copyright notice and this
9permission notice and warranty disclaimer appear in supporting
10documentation, and that the name Lucent Technologies or any of
11its entities not be used in advertising or publicity pertaining
12to distribution of the software without specific, written prior
13permission.
14
15LUCENT DISCLAIMS ALL WARRANTIES WITH REGARD TO THIS SOFTWARE,
16INCLUDING ALL IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS.
17IN NO EVENT SHALL LUCENT OR ANY OF ITS ENTITIES BE LIABLE FOR ANY
18SPECIAL, INDIRECT OR CONSEQUENTIAL DAMAGES OR ANY DAMAGES
19WHATSOEVER RESULTING FROM LOSS OF USE, DATA OR PROFITS, WHETHER
20IN AN ACTION OF CONTRACT, NEGLIGENCE OR OTHER TORTIOUS ACTION,
21ARISING OUT OF OR IN CONNECTION WITH THE USE OR PERFORMANCE OF
22THIS SOFTWARE.
23****************************************************************/
24
25This file lists all bug fixes, changes, etc., made since the
26second edition of the AWK book was published in September 2023.
27
28Jul 28, 2024
29          Fixed readcsvrec resize segfault when reading csv records longer
30          than 8k. Thanks to Ozan Yigit.
31          mktime() added to bsd-features branch. Thanks to Todd Miller.
32
33Jun 23, 2024
34          Fix signal for system-status test. Thanks to Tim van der Molen.
35          Rewrite if-else chain as switch. Thanks to Andrew Sukach.
36
37May 27, 2024
38          Spelling fixes and removal of unneeded prototypes and extern.
39          Thanks to Jonathan Gray.
40
41May 4, 2024
42          Fixed a use-after-free bug with ARGV for "delete ARGV".
43          Also ENVtab is no longer global. Thanks to Benjamin Sturz
44          for spotting the ARGV issue and         Todd Miller for the fix.
45
46May 3, 2024:
47          Remove warnings when compiling with g++. Thanks to Arnold Robbins.
48
49Apr 22, 2024:
50          Fixed regex engine gototab reallocation issue that was
51          Introduced during the Nov 24 rewrite. Thanks to Arnold Robbins.
52          Fixed a scan bug in split in the case the separator is a single
53          character. Thanks to Oguz Ismail for spotting the issue.
54
55Mar 10, 2024:
56          Fixed use-after-free bug in fnematch due to adjbuf invalidating
57          the pointers to buf. Thanks to github user caffe3 for spotting
58          the issue and providing a fix, and to Miguel Pineiro Jr.
59          for the alternative fix.
60          MAX_UTF_BYTES in fnematch has been replaced with awk_mb_cur_max.
61          thanks to Miguel Pineiro Jr.
62
63Jan 22, 2024:
64          Restore the ability to compile with g++. Thanks to
65          Arnold Robbins.
66
67Dec 24, 2023:
68          Matchop dereference after free problem fix when the first
69          argument is a function call. Thanks to Oguz Ismail Uysal.
70          Fix inconsistent handling of --csv and FS set in the
71          command line. Thanks to Wilbert van der Poel.
72          Casting changes to int for is* functions.
73
74Nov 27, 2023:
75          Fix exit status of system on MacOS. Update to REGRESS.
76          Thanks to Arnold Robbins.
77          Fix inconsistent handling of -F and --csv, and loss of csv
78          mode when FS is set.
79
80Nov 24, 2023:
81        Fix issue #199: gototab improvements to dynamically resize the
82        table, qsort and bsearch to improve the lookup speed as the
83        table gets larger for multibyte input. Thanks to Arnold Robbins.
84
85Nov 23, 2023:
86          Fix Issue #169, related to escape sequences in strings.
87          Thanks to Github user rajeevvp.
88          Fix Issue #147, reported by Github user drawkula, and fixed
89          by Miguel Pineiro Jr.
90
91Nov 20, 2023:
92          Rewrite of fnematch to fix a number of issues, including
93          extraneous output, out-of-bounds access, number of bytes
94          to push back after a failed match etc.
95          Thanks to Miguel Pineiro Jr.
96
97Nov 15, 2023:
98          Man page edit, regression test fixes. Thanks to Arnold Robbins
99          Consolidation of sub and gsub into dosub, removing duplicate
100          code. Thanks to Miguel Pineiro Jr.
101          gcc replaced with cc everywhere.
102
103Oct 30, 2023:
104          Multiple fixes and a minor code cleanup.
105          Disabled utf-8 for non-multibyte locales, such as C or POSIX.
106          Fixed a bad char * cast that causes incorrect results on big-endian
107          systems. Also fixed an out-of-bounds read for empty CCL.
108          Fixed a buffer overflow in substr with utf-8 strings.
109          Many thanks to Todd C Miller.
110
111Sep 24, 2023:
112          fnematch and getrune have been overhauled to solve issues around
113          unicode FS and RS. Also fixed gsub null match issue with unicode.
114          Big thanks to Arnold Robbins.
115
116Sep 12, 2023:
117          Fixed a length error in u8_byte2char that set RSTART to
118          incorrect (cannot happen) value for EOL match(str, /$/).
119
120
121-----------------------------------------------------------------
122
123[This entry is a summary, not a precise list of changes.]
124
125          Added --csv option to enable processing of comma-separated
126          values inputs.  When --csv is enabled, fields are separated
127          by commas, fields may be quoted with " double quotes, fields
128          may contain embedded newlines.
129
130          If no explicit separator argument is provided, split() uses
131          the setting of --csv to determine how fields are split.
132
133          Strings may now contain UTF-8 code points (not necessarily
134          characters).  Functions that operate on characters, like
135          length, substr, index, match, etc., use UTF-8, so the length
136          of a string of 3 emojis is 3, not 12 as it would be if bytes
137          were counted.
138
139          Regular expressions are processed as UTF-8.
140
141          Unicode literals can be written as \u followed by one
142          to eight hexadecimal digits.  These may appear in strings and
143          regular expressions.
144