1=head1 NAME 2X<function> 3 4perlfunc - Perl builtin functions 5 6=head1 DESCRIPTION 7 8The functions in this section can serve as terms in an expression. 9They fall into two major categories: list operators and named unary 10operators. These differ in their precedence relationship with a 11following comma. (See the precedence table in L<perlop>.) List 12operators take more than one argument, while unary operators can never 13take more than one argument. Thus, a comma terminates the argument of 14a unary operator, but merely separates the arguments of a list 15operator. A unary operator generally provides a scalar context to its 16argument, while a list operator may provide either scalar or list 17contexts for its arguments. If it does both, the scalar arguments will 18be first, and the list argument will follow. (Note that there can ever 19be only one such list argument.) For instance, splice() has three scalar 20arguments followed by a list, whereas gethostbyname() has four scalar 21arguments. 22 23In the syntax descriptions that follow, list operators that expect a 24list (and provide list context for the elements of the list) are shown 25with LIST as an argument. Such a list may consist of any combination 26of scalar arguments or list values; the list values will be included 27in the list as if each individual element were interpolated at that 28point in the list, forming a longer single-dimensional list value. 29Commas should separate elements of the LIST. 30 31Any function in the list below may be used either with or without 32parentheses around its arguments. (The syntax descriptions omit the 33parentheses.) If you use the parentheses, the simple (but occasionally 34surprising) rule is this: It I<looks> like a function, therefore it I<is> a 35function, and precedence doesn't matter. Otherwise it's a list 36operator or unary operator, and precedence does matter. And whitespace 37between the function and left parenthesis doesn't count--so you need to 38be careful sometimes: 39 40 print 1+2+4; # Prints 7. 41 print(1+2) + 4; # Prints 3. 42 print (1+2)+4; # Also prints 3! 43 print +(1+2)+4; # Prints 7. 44 print ((1+2)+4); # Prints 7. 45 46If you run Perl with the B<-w> switch it can warn you about this. For 47example, the third line above produces: 48 49 print (...) interpreted as function at - line 1. 50 Useless use of integer addition in void context at - line 1. 51 52A few functions take no arguments at all, and therefore work as neither 53unary nor list operators. These include such functions as C<time> 54and C<endpwent>. For example, C<time+86_400> always means 55C<time() + 86_400>. 56 57For functions that can be used in either a scalar or list context, 58nonabortive failure is generally indicated in a scalar context by 59returning the undefined value, and in a list context by returning the 60null list. 61 62Remember the following important rule: There is B<no rule> that relates 63the behavior of an expression in list context to its behavior in scalar 64context, or vice versa. It might do two totally different things. 65Each operator and function decides which sort of value it would be most 66appropriate to return in scalar context. Some operators return the 67length of the list that would have been returned in list context. Some 68operators return the first value in the list. Some operators return the 69last value in the list. Some operators return a count of successful 70operations. In general, they do what you want, unless you want 71consistency. 72X<context> 73 74A named array in scalar context is quite different from what would at 75first glance appear to be a list in scalar context. You can't get a list 76like C<(1,2,3)> into being in scalar context, because the compiler knows 77the context at compile time. It would generate the scalar comma operator 78there, not the list construction version of the comma. That means it 79was never a list to start with. 80 81In general, functions in Perl that serve as wrappers for system calls 82of the same name (like chown(2), fork(2), closedir(2), etc.) all return 83true when they succeed and C<undef> otherwise, as is usually mentioned 84in the descriptions below. This is different from the C interfaces, 85which return C<-1> on failure. Exceptions to this rule are C<wait>, 86C<waitpid>, and C<syscall>. System calls also set the special C<$!> 87variable on failure. Other functions do not, except accidentally. 88 89=head2 Perl Functions by Category 90X<function> 91 92Here are Perl's functions (including things that look like 93functions, like some keywords and named operators) 94arranged by category. Some functions appear in more 95than one place. 96 97=over 4 98 99=item Functions for SCALARs or strings 100X<scalar> X<string> X<character> 101 102C<chomp>, C<chop>, C<chr>, C<crypt>, C<hex>, C<index>, C<lc>, C<lcfirst>, 103C<length>, C<oct>, C<ord>, C<pack>, C<q/STRING/>, C<qq/STRING/>, C<reverse>, 104C<rindex>, C<sprintf>, C<substr>, C<tr///>, C<uc>, C<ucfirst>, C<y///> 105 106=item Regular expressions and pattern matching 107X<regular expression> X<regex> X<regexp> 108 109C<m//>, C<pos>, C<quotemeta>, C<s///>, C<split>, C<study>, C<qr//> 110 111=item Numeric functions 112X<numeric> X<number> X<trigonometric> X<trigonometry> 113 114C<abs>, C<atan2>, C<cos>, C<exp>, C<hex>, C<int>, C<log>, C<oct>, C<rand>, 115C<sin>, C<sqrt>, C<srand> 116 117=item Functions for real @ARRAYs 118X<array> 119 120C<pop>, C<push>, C<shift>, C<splice>, C<unshift> 121 122=item Functions for list data 123X<list> 124 125C<grep>, C<join>, C<map>, C<qw/STRING/>, C<reverse>, C<sort>, C<unpack> 126 127=item Functions for real %HASHes 128X<hash> 129 130C<delete>, C<each>, C<exists>, C<keys>, C<values> 131 132=item Input and output functions 133X<I/O> X<input> X<output> X<dbm> 134 135C<binmode>, C<close>, C<closedir>, C<dbmclose>, C<dbmopen>, C<die>, C<eof>, 136C<fileno>, C<flock>, C<format>, C<getc>, C<print>, C<printf>, C<read>, 137C<readdir>, C<rewinddir>, C<seek>, C<seekdir>, C<select>, C<syscall>, 138C<sysread>, C<sysseek>, C<syswrite>, C<tell>, C<telldir>, C<truncate>, 139C<warn>, C<write> 140 141=item Functions for fixed length data or records 142 143C<pack>, C<read>, C<syscall>, C<sysread>, C<syswrite>, C<unpack>, C<vec> 144 145=item Functions for filehandles, files, or directories 146X<file> X<filehandle> X<directory> X<pipe> X<link> X<symlink> 147 148C<-I<X>>, C<chdir>, C<chmod>, C<chown>, C<chroot>, C<fcntl>, C<glob>, 149C<ioctl>, C<link>, C<lstat>, C<mkdir>, C<open>, C<opendir>, 150C<readlink>, C<rename>, C<rmdir>, C<stat>, C<symlink>, C<sysopen>, 151C<umask>, C<unlink>, C<utime> 152 153=item Keywords related to the control flow of your Perl program 154X<control flow> 155 156C<caller>, C<continue>, C<die>, C<do>, C<dump>, C<eval>, C<exit>, 157C<goto>, C<last>, C<next>, C<redo>, C<return>, C<sub>, C<wantarray> 158 159=item Keywords related to scoping 160 161C<caller>, C<import>, C<local>, C<my>, C<our>, C<package>, C<use> 162 163=item Miscellaneous functions 164 165C<defined>, C<dump>, C<eval>, C<formline>, C<local>, C<my>, C<our>, C<reset>, 166C<scalar>, C<undef>, C<wantarray> 167 168=item Functions for processes and process groups 169X<process> X<pid> X<process id> 170 171C<alarm>, C<exec>, C<fork>, C<getpgrp>, C<getppid>, C<getpriority>, C<kill>, 172C<pipe>, C<qx/STRING/>, C<setpgrp>, C<setpriority>, C<sleep>, C<system>, 173C<times>, C<wait>, C<waitpid> 174 175=item Keywords related to perl modules 176X<module> 177 178C<do>, C<import>, C<no>, C<package>, C<require>, C<use> 179 180=item Keywords related to classes and object-orientedness 181X<object> X<class> X<package> 182 183C<bless>, C<dbmclose>, C<dbmopen>, C<package>, C<ref>, C<tie>, C<tied>, 184C<untie>, C<use> 185 186=item Low-level socket functions 187X<socket> X<sock> 188 189C<accept>, C<bind>, C<connect>, C<getpeername>, C<getsockname>, 190C<getsockopt>, C<listen>, C<recv>, C<send>, C<setsockopt>, C<shutdown>, 191C<socket>, C<socketpair> 192 193=item System V interprocess communication functions 194X<IPC> X<System V> X<semaphore> X<shared memory> X<memory> X<message> 195 196C<msgctl>, C<msgget>, C<msgrcv>, C<msgsnd>, C<semctl>, C<semget>, C<semop>, 197C<shmctl>, C<shmget>, C<shmread>, C<shmwrite> 198 199=item Fetching user and group info 200X<user> X<group> X<password> X<uid> X<gid> X<passwd> X</etc/passwd> 201 202C<endgrent>, C<endhostent>, C<endnetent>, C<endpwent>, C<getgrent>, 203C<getgrgid>, C<getgrnam>, C<getlogin>, C<getpwent>, C<getpwnam>, 204C<getpwuid>, C<setgrent>, C<setpwent> 205 206=item Fetching network info 207X<network> X<protocol> X<host> X<hostname> X<IP> X<address> X<service> 208 209C<endprotoent>, C<endservent>, C<gethostbyaddr>, C<gethostbyname>, 210C<gethostent>, C<getnetbyaddr>, C<getnetbyname>, C<getnetent>, 211C<getprotobyname>, C<getprotobynumber>, C<getprotoent>, 212C<getservbyname>, C<getservbyport>, C<getservent>, C<sethostent>, 213C<setnetent>, C<setprotoent>, C<setservent> 214 215=item Time-related functions 216X<time> X<date> 217 218C<gmtime>, C<localtime>, C<time>, C<times> 219 220=item Functions new in perl5 221X<perl5> 222 223C<abs>, C<bless>, C<chomp>, C<chr>, C<exists>, C<formline>, C<glob>, 224C<import>, C<lc>, C<lcfirst>, C<map>, C<my>, C<no>, C<our>, C<prototype>, 225C<qx>, C<qw>, C<readline>, C<readpipe>, C<ref>, C<sub*>, C<sysopen>, C<tie>, 226C<tied>, C<uc>, C<ucfirst>, C<untie>, C<use> 227 228* - C<sub> was a keyword in perl4, but in perl5 it is an 229operator, which can be used in expressions. 230 231=item Functions obsoleted in perl5 232 233C<dbmclose>, C<dbmopen> 234 235=back 236 237=head2 Portability 238X<portability> X<Unix> X<portable> 239 240Perl was born in Unix and can therefore access all common Unix 241system calls. In non-Unix environments, the functionality of some 242Unix system calls may not be available, or details of the available 243functionality may differ slightly. The Perl functions affected 244by this are: 245 246C<-X>, C<binmode>, C<chmod>, C<chown>, C<chroot>, C<crypt>, 247C<dbmclose>, C<dbmopen>, C<dump>, C<endgrent>, C<endhostent>, 248C<endnetent>, C<endprotoent>, C<endpwent>, C<endservent>, C<exec>, 249C<fcntl>, C<flock>, C<fork>, C<getgrent>, C<getgrgid>, C<gethostbyname>, 250C<gethostent>, C<getlogin>, C<getnetbyaddr>, C<getnetbyname>, C<getnetent>, 251C<getppid>, C<getpgrp>, C<getpriority>, C<getprotobynumber>, 252C<getprotoent>, C<getpwent>, C<getpwnam>, C<getpwuid>, 253C<getservbyport>, C<getservent>, C<getsockopt>, C<glob>, C<ioctl>, 254C<kill>, C<link>, C<lstat>, C<msgctl>, C<msgget>, C<msgrcv>, 255C<msgsnd>, C<open>, C<pipe>, C<readlink>, C<rename>, C<select>, C<semctl>, 256C<semget>, C<semop>, C<setgrent>, C<sethostent>, C<setnetent>, 257C<setpgrp>, C<setpriority>, C<setprotoent>, C<setpwent>, 258C<setservent>, C<setsockopt>, C<shmctl>, C<shmget>, C<shmread>, 259C<shmwrite>, C<socket>, C<socketpair>, 260C<stat>, C<symlink>, C<syscall>, C<sysopen>, C<system>, 261C<times>, C<truncate>, C<umask>, C<unlink>, 262C<utime>, C<wait>, C<waitpid> 263 264For more information about the portability of these functions, see 265L<perlport> and other available platform-specific documentation. 266 267=head2 Alphabetical Listing of Perl Functions 268 269=over 8 270 271=item -X FILEHANDLE 272X<-r>X<-w>X<-x>X<-o>X<-R>X<-W>X<-X>X<-O>X<-e>X<-z>X<-s>X<-f>X<-d>X<-l>X<-p> 273X<-S>X<-b>X<-c>X<-t>X<-u>X<-g>X<-k>X<-T>X<-B>X<-M>X<-A>X<-C> 274 275=item -X EXPR 276 277=item -X 278 279A file test, where X is one of the letters listed below. This unary 280operator takes one argument, either a filename or a filehandle, and 281tests the associated file to see if something is true about it. If the 282argument is omitted, tests C<$_>, except for C<-t>, which tests STDIN. 283Unless otherwise documented, it returns C<1> for true and C<''> for false, or 284the undefined value if the file doesn't exist. Despite the funny 285names, precedence is the same as any other named unary operator, and 286the argument may be parenthesized like any other unary operator. The 287operator may be any of: 288 289 -r File is readable by effective uid/gid. 290 -w File is writable by effective uid/gid. 291 -x File is executable by effective uid/gid. 292 -o File is owned by effective uid. 293 294 -R File is readable by real uid/gid. 295 -W File is writable by real uid/gid. 296 -X File is executable by real uid/gid. 297 -O File is owned by real uid. 298 299 -e File exists. 300 -z File has zero size (is empty). 301 -s File has nonzero size (returns size in bytes). 302 303 -f File is a plain file. 304 -d File is a directory. 305 -l File is a symbolic link. 306 -p File is a named pipe (FIFO), or Filehandle is a pipe. 307 -S File is a socket. 308 -b File is a block special file. 309 -c File is a character special file. 310 -t Filehandle is opened to a tty. 311 312 -u File has setuid bit set. 313 -g File has setgid bit set. 314 -k File has sticky bit set. 315 316 -T File is an ASCII text file (heuristic guess). 317 -B File is a "binary" file (opposite of -T). 318 319 -M Script start time minus file modification time, in days. 320 -A Same for access time. 321 -C Same for inode change time (Unix, may differ for other platforms) 322 323Example: 324 325 while (<>) { 326 chomp; 327 next unless -f $_; # ignore specials 328 #... 329 } 330 331The interpretation of the file permission operators C<-r>, C<-R>, 332C<-w>, C<-W>, C<-x>, and C<-X> is by default based solely on the mode 333of the file and the uids and gids of the user. There may be other 334reasons you can't actually read, write, or execute the file. Such 335reasons may be for example network filesystem access controls, ACLs 336(access control lists), read-only filesystems, and unrecognized 337executable formats. 338 339Also note that, for the superuser on the local filesystems, the C<-r>, 340C<-R>, C<-w>, and C<-W> tests always return 1, and C<-x> and C<-X> return 1 341if any execute bit is set in the mode. Scripts run by the superuser 342may thus need to do a stat() to determine the actual mode of the file, 343or temporarily set their effective uid to something else. 344 345If you are using ACLs, there is a pragma called C<filetest> that may 346produce more accurate results than the bare stat() mode bits. 347When under the C<use filetest 'access'> the above-mentioned filetests 348will test whether the permission can (not) be granted using the 349access() family of system calls. Also note that the C<-x> and C<-X> may 350under this pragma return true even if there are no execute permission 351bits set (nor any extra execute permission ACLs). This strangeness is 352due to the underlying system calls' definitions. Read the 353documentation for the C<filetest> pragma for more information. 354 355Note that C<-s/a/b/> does not do a negated substitution. Saying 356C<-exp($foo)> still works as expected, however--only single letters 357following a minus are interpreted as file tests. 358 359The C<-T> and C<-B> switches work as follows. The first block or so of the 360file is examined for odd characters such as strange control codes or 361characters with the high bit set. If too many strange characters (>30%) 362are found, it's a C<-B> file; otherwise it's a C<-T> file. Also, any file 363containing null in the first block is considered a binary file. If C<-T> 364or C<-B> is used on a filehandle, the current IO buffer is examined 365rather than the first block. Both C<-T> and C<-B> return true on a null 366file, or a file at EOF when testing a filehandle. Because you have to 367read a file to do the C<-T> test, on most occasions you want to use a C<-f> 368against the file first, as in C<next unless -f $file && -T $file>. 369 370If any of the file tests (or either the C<stat> or C<lstat> operators) are given 371the special filehandle consisting of a solitary underline, then the stat 372structure of the previous file test (or stat operator) is used, saving 373a system call. (This doesn't work with C<-t>, and you need to remember 374that lstat() and C<-l> will leave values in the stat structure for the 375symbolic link, not the real file.) (Also, if the stat buffer was filled by 376an C<lstat> call, C<-T> and C<-B> will reset it with the results of C<stat _>). 377Example: 378 379 print "Can do.\n" if -r $a || -w _ || -x _; 380 381 stat($filename); 382 print "Readable\n" if -r _; 383 print "Writable\n" if -w _; 384 print "Executable\n" if -x _; 385 print "Setuid\n" if -u _; 386 print "Setgid\n" if -g _; 387 print "Sticky\n" if -k _; 388 print "Text\n" if -T _; 389 print "Binary\n" if -B _; 390 391=item abs VALUE 392X<abs> X<absolute> 393 394=item abs 395 396Returns the absolute value of its argument. 397If VALUE is omitted, uses C<$_>. 398 399=item accept NEWSOCKET,GENERICSOCKET 400X<accept> 401 402Accepts an incoming socket connect, just as the accept(2) system call 403does. Returns the packed address if it succeeded, false otherwise. 404See the example in L<perlipc/"Sockets: Client/Server Communication">. 405 406On systems that support a close-on-exec flag on files, the flag will 407be set for the newly opened file descriptor, as determined by the 408value of $^F. See L<perlvar/$^F>. 409 410=item alarm SECONDS 411X<alarm> 412X<SIGALRM> 413X<timer> 414 415=item alarm 416 417Arranges to have a SIGALRM delivered to this process after the 418specified number of wallclock seconds has elapsed. If SECONDS is not 419specified, the value stored in C<$_> is used. (On some machines, 420unfortunately, the elapsed time may be up to one second less or more 421than you specified because of how seconds are counted, and process 422scheduling may delay the delivery of the signal even further.) 423 424Only one timer may be counting at once. Each call disables the 425previous timer, and an argument of C<0> may be supplied to cancel the 426previous timer without starting a new one. The returned value is the 427amount of time remaining on the previous timer. 428 429For delays of finer granularity than one second, you may use Perl's 430four-argument version of select() leaving the first three arguments 431undefined, or you might be able to use the C<syscall> interface to 432access setitimer(2) if your system supports it. The Time::HiRes 433module (from CPAN, and starting from Perl 5.8 part of the standard 434distribution) may also prove useful. 435 436It is usually a mistake to intermix C<alarm> and C<sleep> calls. 437(C<sleep> may be internally implemented in your system with C<alarm>) 438 439If you want to use C<alarm> to time out a system call you need to use an 440C<eval>/C<die> pair. You can't rely on the alarm causing the system call to 441fail with C<$!> set to C<EINTR> because Perl sets up signal handlers to 442restart system calls on some systems. Using C<eval>/C<die> always works, 443modulo the caveats given in L<perlipc/"Signals">. 444 445 eval { 446 local $SIG{ALRM} = sub { die "alarm\n" }; # NB: \n required 447 alarm $timeout; 448 $nread = sysread SOCKET, $buffer, $size; 449 alarm 0; 450 }; 451 if ($@) { 452 die unless $@ eq "alarm\n"; # propagate unexpected errors 453 # timed out 454 } 455 else { 456 # didn't 457 } 458 459For more information see L<perlipc>. 460 461=item atan2 Y,X 462X<atan2> X<arctangent> X<tan> X<tangent> 463 464Returns the arctangent of Y/X in the range -PI to PI. 465 466For the tangent operation, you may use the C<Math::Trig::tan> 467function, or use the familiar relation: 468 469 sub tan { sin($_[0]) / cos($_[0]) } 470 471Note that atan2(0, 0) is not well-defined. 472 473=item bind SOCKET,NAME 474X<bind> 475 476Binds a network address to a socket, just as the bind system call 477does. Returns true if it succeeded, false otherwise. NAME should be a 478packed address of the appropriate type for the socket. See the examples in 479L<perlipc/"Sockets: Client/Server Communication">. 480 481=item binmode FILEHANDLE, LAYER 482X<binmode> X<binary> X<text> X<DOS> X<Windows> 483 484=item binmode FILEHANDLE 485 486Arranges for FILEHANDLE to be read or written in "binary" or "text" 487mode on systems where the run-time libraries distinguish between 488binary and text files. If FILEHANDLE is an expression, the value is 489taken as the name of the filehandle. Returns true on success, 490otherwise it returns C<undef> and sets C<$!> (errno). 491 492On some systems (in general, DOS and Windows-based systems) binmode() 493is necessary when you're not working with a text file. For the sake 494of portability it is a good idea to always use it when appropriate, 495and to never use it when it isn't appropriate. Also, people can 496set their I/O to be by default UTF-8 encoded Unicode, not bytes. 497 498In other words: regardless of platform, use binmode() on binary data, 499like for example images. 500 501If LAYER is present it is a single string, but may contain multiple 502directives. The directives alter the behaviour of the file handle. 503When LAYER is present using binmode on text file makes sense. 504 505If LAYER is omitted or specified as C<:raw> the filehandle is made 506suitable for passing binary data. This includes turning off possible CRLF 507translation and marking it as bytes (as opposed to Unicode characters). 508Note that, despite what may be implied in I<"Programming Perl"> (the 509Camel) or elsewhere, C<:raw> is I<not> the simply inverse of C<:crlf> 510-- other layers which would affect binary nature of the stream are 511I<also> disabled. See L<PerlIO>, L<perlrun> and the discussion about the 512PERLIO environment variable. 513 514The C<:bytes>, C<:crlf>, and C<:utf8>, and any other directives of the 515form C<:...>, are called I/O I<layers>. The C<open> pragma can be used to 516establish default I/O layers. See L<open>. 517 518I<The LAYER parameter of the binmode() function is described as "DISCIPLINE" 519in "Programming Perl, 3rd Edition". However, since the publishing of this 520book, by many known as "Camel III", the consensus of the naming of this 521functionality has moved from "discipline" to "layer". All documentation 522of this version of Perl therefore refers to "layers" rather than to 523"disciplines". Now back to the regularly scheduled documentation...> 524 525To mark FILEHANDLE as UTF-8, use C<:utf8>. 526 527In general, binmode() should be called after open() but before any I/O 528is done on the filehandle. Calling binmode() will normally flush any 529pending buffered output data (and perhaps pending input data) on the 530handle. An exception to this is the C<:encoding> layer that 531changes the default character encoding of the handle, see L<open>. 532The C<:encoding> layer sometimes needs to be called in 533mid-stream, and it doesn't flush the stream. The C<:encoding> 534also implicitly pushes on top of itself the C<:utf8> layer because 535internally Perl will operate on UTF-8 encoded Unicode characters. 536 537The operating system, device drivers, C libraries, and Perl run-time 538system all work together to let the programmer treat a single 539character (C<\n>) as the line terminator, irrespective of the external 540representation. On many operating systems, the native text file 541representation matches the internal representation, but on some 542platforms the external representation of C<\n> is made up of more than 543one character. 544 545Mac OS, all variants of Unix, and Stream_LF files on VMS use a single 546character to end each line in the external representation of text (even 547though that single character is CARRIAGE RETURN on Mac OS and LINE FEED 548on Unix and most VMS files). In other systems like OS/2, DOS and the 549various flavors of MS-Windows your program sees a C<\n> as a simple C<\cJ>, 550but what's stored in text files are the two characters C<\cM\cJ>. That 551means that, if you don't use binmode() on these systems, C<\cM\cJ> 552sequences on disk will be converted to C<\n> on input, and any C<\n> in 553your program will be converted back to C<\cM\cJ> on output. This is what 554you want for text files, but it can be disastrous for binary files. 555 556Another consequence of using binmode() (on some systems) is that 557special end-of-file markers will be seen as part of the data stream. 558For systems from the Microsoft family this means that if your binary 559data contains C<\cZ>, the I/O subsystem will regard it as the end of 560the file, unless you use binmode(). 561 562binmode() is not only important for readline() and print() operations, 563but also when using read(), seek(), sysread(), syswrite() and tell() 564(see L<perlport> for more details). See the C<$/> and C<$\> variables 565in L<perlvar> for how to manually set your input and output 566line-termination sequences. 567 568=item bless REF,CLASSNAME 569X<bless> 570 571=item bless REF 572 573This function tells the thingy referenced by REF that it is now an object 574in the CLASSNAME package. If CLASSNAME is omitted, the current package 575is used. Because a C<bless> is often the last thing in a constructor, 576it returns the reference for convenience. Always use the two-argument 577version if a derived class might inherit the function doing the blessing. 578See L<perltoot> and L<perlobj> for more about the blessing (and blessings) 579of objects. 580 581Consider always blessing objects in CLASSNAMEs that are mixed case. 582Namespaces with all lowercase names are considered reserved for 583Perl pragmata. Builtin types have all uppercase names. To prevent 584confusion, you may wish to avoid such package names as well. Make sure 585that CLASSNAME is a true value. 586 587See L<perlmod/"Perl Modules">. 588 589=item caller EXPR 590X<caller> X<call stack> X<stack> X<stack trace> 591 592=item caller 593 594Returns the context of the current subroutine call. In scalar context, 595returns the caller's package name if there is a caller, that is, if 596we're in a subroutine or C<eval> or C<require>, and the undefined value 597otherwise. In list context, returns 598 599 ($package, $filename, $line) = caller; 600 601With EXPR, it returns some extra information that the debugger uses to 602print a stack trace. The value of EXPR indicates how many call frames 603to go back before the current one. 604 605 ($package, $filename, $line, $subroutine, $hasargs, 606 $wantarray, $evaltext, $is_require, $hints, $bitmask) = caller($i); 607 608Here $subroutine may be C<(eval)> if the frame is not a subroutine 609call, but an C<eval>. In such a case additional elements $evaltext and 610C<$is_require> are set: C<$is_require> is true if the frame is created by a 611C<require> or C<use> statement, $evaltext contains the text of the 612C<eval EXPR> statement. In particular, for an C<eval BLOCK> statement, 613$filename is C<(eval)>, but $evaltext is undefined. (Note also that 614each C<use> statement creates a C<require> frame inside an C<eval EXPR> 615frame.) $subroutine may also be C<(unknown)> if this particular 616subroutine happens to have been deleted from the symbol table. 617C<$hasargs> is true if a new instance of C<@_> was set up for the frame. 618C<$hints> and C<$bitmask> contain pragmatic hints that the caller was 619compiled with. The C<$hints> and C<$bitmask> values are subject to change 620between versions of Perl, and are not meant for external use. 621 622Furthermore, when called from within the DB package, caller returns more 623detailed information: it sets the list variable C<@DB::args> to be the 624arguments with which the subroutine was invoked. 625 626Be aware that the optimizer might have optimized call frames away before 627C<caller> had a chance to get the information. That means that C<caller(N)> 628might not return information about the call frame you expect it do, for 629C<< N > 1 >>. In particular, C<@DB::args> might have information from the 630previous time C<caller> was called. 631 632=item chdir EXPR 633X<chdir> 634X<cd> 635 636=item chdir FILEHANDLE 637 638=item chdir DIRHANDLE 639 640=item chdir 641 642Changes the working directory to EXPR, if possible. If EXPR is omitted, 643changes to the directory specified by C<$ENV{HOME}>, if set; if not, 644changes to the directory specified by C<$ENV{LOGDIR}>. (Under VMS, the 645variable C<$ENV{SYS$LOGIN}> is also checked, and used if it is set.) If 646neither is set, C<chdir> does nothing. It returns true upon success, 647false otherwise. See the example under C<die>. 648 649On systems that support fchdir, you might pass a file handle or 650directory handle as argument. On systems that don't support fchdir, 651passing handles produces a fatal error at run time. 652 653=item chmod LIST 654X<chmod> X<permission> X<mode> 655 656Changes the permissions of a list of files. The first element of the 657list must be the numerical mode, which should probably be an octal 658number, and which definitely should I<not> be a string of octal digits: 659C<0644> is okay, C<'0644'> is not. Returns the number of files 660successfully changed. See also L</oct>, if all you have is a string. 661 662 $cnt = chmod 0755, 'foo', 'bar'; 663 chmod 0755, @executables; 664 $mode = '0644'; chmod $mode, 'foo'; # !!! sets mode to 665 # --w----r-T 666 $mode = '0644'; chmod oct($mode), 'foo'; # this is better 667 $mode = 0644; chmod $mode, 'foo'; # this is best 668 669On systems that support fchmod, you might pass file handles among the 670files. On systems that don't support fchmod, passing file handles 671produces a fatal error at run time. 672 673 open(my $fh, "<", "foo"); 674 my $perm = (stat $fh)[2] & 07777; 675 chmod($perm | 0600, $fh); 676 677You can also import the symbolic C<S_I*> constants from the Fcntl 678module: 679 680 use Fcntl ':mode'; 681 682 chmod S_IRWXU|S_IRGRP|S_IXGRP|S_IROTH|S_IXOTH, @executables; 683 # This is identical to the chmod 0755 of the above example. 684 685=item chomp VARIABLE 686X<chomp> X<INPUT_RECORD_SEPARATOR> X<$/> X<newline> X<eol> 687 688=item chomp( LIST ) 689 690=item chomp 691 692This safer version of L</chop> removes any trailing string 693that corresponds to the current value of C<$/> (also known as 694$INPUT_RECORD_SEPARATOR in the C<English> module). It returns the total 695number of characters removed from all its arguments. It's often used to 696remove the newline from the end of an input record when you're worried 697that the final record may be missing its newline. When in paragraph 698mode (C<$/ = "">), it removes all trailing newlines from the string. 699When in slurp mode (C<$/ = undef>) or fixed-length record mode (C<$/> is 700a reference to an integer or the like, see L<perlvar>) chomp() won't 701remove anything. 702If VARIABLE is omitted, it chomps C<$_>. Example: 703 704 while (<>) { 705 chomp; # avoid \n on last field 706 @array = split(/:/); 707 # ... 708 } 709 710If VARIABLE is a hash, it chomps the hash's values, but not its keys. 711 712You can actually chomp anything that's an lvalue, including an assignment: 713 714 chomp($cwd = `pwd`); 715 chomp($answer = <STDIN>); 716 717If you chomp a list, each element is chomped, and the total number of 718characters removed is returned. 719 720If the C<encoding> pragma is in scope then the lengths returned are 721calculated from the length of C<$/> in Unicode characters, which is not 722always the same as the length of C<$/> in the native encoding. 723 724Note that parentheses are necessary when you're chomping anything 725that is not a simple variable. This is because C<chomp $cwd = `pwd`;> 726is interpreted as C<(chomp $cwd) = `pwd`;>, rather than as 727C<chomp( $cwd = `pwd` )> which you might expect. Similarly, 728C<chomp $a, $b> is interpreted as C<chomp($a), $b> rather than 729as C<chomp($a, $b)>. 730 731=item chop VARIABLE 732X<chop> 733 734=item chop( LIST ) 735 736=item chop 737 738Chops off the last character of a string and returns the character 739chopped. It is much more efficient than C<s/.$//s> because it neither 740scans nor copies the string. If VARIABLE is omitted, chops C<$_>. 741If VARIABLE is a hash, it chops the hash's values, but not its keys. 742 743You can actually chop anything that's an lvalue, including an assignment. 744 745If you chop a list, each element is chopped. Only the value of the 746last C<chop> is returned. 747 748Note that C<chop> returns the last character. To return all but the last 749character, use C<substr($string, 0, -1)>. 750 751See also L</chomp>. 752 753=item chown LIST 754X<chown> X<owner> X<user> X<group> 755 756Changes the owner (and group) of a list of files. The first two 757elements of the list must be the I<numeric> uid and gid, in that 758order. A value of -1 in either position is interpreted by most 759systems to leave that value unchanged. Returns the number of files 760successfully changed. 761 762 $cnt = chown $uid, $gid, 'foo', 'bar'; 763 chown $uid, $gid, @filenames; 764 765On systems that support fchown, you might pass file handles among the 766files. On systems that don't support fchown, passing file handles 767produces a fatal error at run time. 768 769Here's an example that looks up nonnumeric uids in the passwd file: 770 771 print "User: "; 772 chomp($user = <STDIN>); 773 print "Files: "; 774 chomp($pattern = <STDIN>); 775 776 ($login,$pass,$uid,$gid) = getpwnam($user) 777 or die "$user not in passwd file"; 778 779 @ary = glob($pattern); # expand filenames 780 chown $uid, $gid, @ary; 781 782On most systems, you are not allowed to change the ownership of the 783file unless you're the superuser, although you should be able to change 784the group to any of your secondary groups. On insecure systems, these 785restrictions may be relaxed, but this is not a portable assumption. 786On POSIX systems, you can detect this condition this way: 787 788 use POSIX qw(sysconf _PC_CHOWN_RESTRICTED); 789 $can_chown_giveaway = not sysconf(_PC_CHOWN_RESTRICTED); 790 791=item chr NUMBER 792X<chr> X<character> X<ASCII> X<Unicode> 793 794=item chr 795 796Returns the character represented by that NUMBER in the character set. 797For example, C<chr(65)> is C<"A"> in either ASCII or Unicode, and 798chr(0x263a) is a Unicode smiley face. Note that characters from 128 799to 255 (inclusive) are by default not encoded in UTF-8 Unicode for 800backward compatibility reasons (but see L<encoding>). 801 802If NUMBER is omitted, uses C<$_>. 803 804For the reverse, use L</ord>. 805 806Note that under the C<bytes> pragma the NUMBER is masked to 807the low eight bits. 808 809See L<perlunicode> and L<encoding> for more about Unicode. 810 811=item chroot FILENAME 812X<chroot> X<root> 813 814=item chroot 815 816This function works like the system call by the same name: it makes the 817named directory the new root directory for all further pathnames that 818begin with a C</> by your process and all its children. (It doesn't 819change your current working directory, which is unaffected.) For security 820reasons, this call is restricted to the superuser. If FILENAME is 821omitted, does a C<chroot> to C<$_>. 822 823=item close FILEHANDLE 824X<close> 825 826=item close 827 828Closes the file or pipe associated with the file handle, returning 829true only if IO buffers are successfully flushed and closes the system 830file descriptor. Closes the currently selected filehandle if the 831argument is omitted. 832 833You don't have to close FILEHANDLE if you are immediately going to do 834another C<open> on it, because C<open> will close it for you. (See 835C<open>.) However, an explicit C<close> on an input file resets the line 836counter (C<$.>), while the implicit close done by C<open> does not. 837 838If the file handle came from a piped open, C<close> will additionally 839return false if one of the other system calls involved fails, or if the 840program exits with non-zero status. (If the only problem was that the 841program exited non-zero, C<$!> will be set to C<0>.) Closing a pipe 842also waits for the process executing on the pipe to complete, in case you 843want to look at the output of the pipe afterwards, and 844implicitly puts the exit status value of that command into C<$?>. 845 846Prematurely closing the read end of a pipe (i.e. before the process 847writing to it at the other end has closed it) will result in a 848SIGPIPE being delivered to the writer. If the other end can't 849handle that, be sure to read all the data before closing the pipe. 850 851Example: 852 853 open(OUTPUT, '|sort >foo') # pipe to sort 854 or die "Can't start sort: $!"; 855 #... # print stuff to output 856 close OUTPUT # wait for sort to finish 857 or warn $! ? "Error closing sort pipe: $!" 858 : "Exit status $? from sort"; 859 open(INPUT, 'foo') # get sort's results 860 or die "Can't open 'foo' for input: $!"; 861 862FILEHANDLE may be an expression whose value can be used as an indirect 863filehandle, usually the real filehandle name. 864 865=item closedir DIRHANDLE 866X<closedir> 867 868Closes a directory opened by C<opendir> and returns the success of that 869system call. 870 871=item connect SOCKET,NAME 872X<connect> 873 874Attempts to connect to a remote socket, just as the connect system call 875does. Returns true if it succeeded, false otherwise. NAME should be a 876packed address of the appropriate type for the socket. See the examples in 877L<perlipc/"Sockets: Client/Server Communication">. 878 879=item continue BLOCK 880X<continue> 881 882C<continue> is actually a flow control statement rather than a function. If 883there is a C<continue> BLOCK attached to a BLOCK (typically in a C<while> or 884C<foreach>), it is always executed just before the conditional is about to 885be evaluated again, just like the third part of a C<for> loop in C. Thus 886it can be used to increment a loop variable, even when the loop has been 887continued via the C<next> statement (which is similar to the C C<continue> 888statement). 889 890C<last>, C<next>, or C<redo> may appear within a C<continue> 891block. C<last> and C<redo> will behave as if they had been executed within 892the main block. So will C<next>, but since it will execute a C<continue> 893block, it may be more entertaining. 894 895 while (EXPR) { 896 ### redo always comes here 897 do_something; 898 } continue { 899 ### next always comes here 900 do_something_else; 901 # then back the top to re-check EXPR 902 } 903 ### last always comes here 904 905Omitting the C<continue> section is semantically equivalent to using an 906empty one, logically enough. In that case, C<next> goes directly back 907to check the condition at the top of the loop. 908 909=item cos EXPR 910X<cos> X<cosine> X<acos> X<arccosine> 911 912=item cos 913 914Returns the cosine of EXPR (expressed in radians). If EXPR is omitted, 915takes cosine of C<$_>. 916 917For the inverse cosine operation, you may use the C<Math::Trig::acos()> 918function, or use this relation: 919 920 sub acos { atan2( sqrt(1 - $_[0] * $_[0]), $_[0] ) } 921 922=item crypt PLAINTEXT,SALT 923X<crypt> X<digest> X<hash> X<salt> X<plaintext> X<password> 924X<decrypt> X<cryptography> X<passwd> 925 926Creates a digest string exactly like the crypt(3) function in the C 927library (assuming that you actually have a version there that has not 928been extirpated as a potential munitions). 929 930crypt() is a one-way hash function. The PLAINTEXT and SALT is turned 931into a short string, called a digest, which is returned. The same 932PLAINTEXT and SALT will always return the same string, but there is no 933(known) way to get the original PLAINTEXT from the hash. Small 934changes in the PLAINTEXT or SALT will result in large changes in the 935digest. 936 937There is no decrypt function. This function isn't all that useful for 938cryptography (for that, look for F<Crypt> modules on your nearby CPAN 939mirror) and the name "crypt" is a bit of a misnomer. Instead it is 940primarily used to check if two pieces of text are the same without 941having to transmit or store the text itself. An example is checking 942if a correct password is given. The digest of the password is stored, 943not the password itself. The user types in a password that is 944crypt()'d with the same salt as the stored digest. If the two digests 945match the password is correct. 946 947When verifying an existing digest string you should use the digest as 948the salt (like C<crypt($plain, $digest) eq $digest>). The SALT used 949to create the digest is visible as part of the digest. This ensures 950crypt() will hash the new string with the same salt as the digest. 951This allows your code to work with the standard L<crypt|/crypt> and 952with more exotic implementations. In other words, do not assume 953anything about the returned string itself, or how many bytes in the 954digest matter. 955 956Traditionally the result is a string of 13 bytes: two first bytes of 957the salt, followed by 11 bytes from the set C<[./0-9A-Za-z]>, and only 958the first eight bytes of the digest string mattered, but alternative 959hashing schemes (like MD5), higher level security schemes (like C2), 960and implementations on non-UNIX platforms may produce different 961strings. 962 963When choosing a new salt create a random two character string whose 964characters come from the set C<[./0-9A-Za-z]> (like C<join '', ('.', 965'/', 0..9, 'A'..'Z', 'a'..'z')[rand 64, rand 64]>). This set of 966characters is just a recommendation; the characters allowed in 967the salt depend solely on your system's crypt library, and Perl can't 968restrict what salts C<crypt()> accepts. 969 970Here's an example that makes sure that whoever runs this program knows 971their password: 972 973 $pwd = (getpwuid($<))[1]; 974 975 system "stty -echo"; 976 print "Password: "; 977 chomp($word = <STDIN>); 978 print "\n"; 979 system "stty echo"; 980 981 if (crypt($word, $pwd) ne $pwd) { 982 die "Sorry...\n"; 983 } else { 984 print "ok\n"; 985 } 986 987Of course, typing in your own password to whoever asks you 988for it is unwise. 989 990The L<crypt|/crypt> function is unsuitable for hashing large quantities 991of data, not least of all because you can't get the information 992back. Look at the L<Digest> module for more robust algorithms. 993 994If using crypt() on a Unicode string (which I<potentially> has 995characters with codepoints above 255), Perl tries to make sense 996of the situation by trying to downgrade (a copy of the string) 997the string back to an eight-bit byte string before calling crypt() 998(on that copy). If that works, good. If not, crypt() dies with 999C<Wide character in crypt>. 1000 1001=item dbmclose HASH 1002X<dbmclose> 1003 1004[This function has been largely superseded by the C<untie> function.] 1005 1006Breaks the binding between a DBM file and a hash. 1007 1008=item dbmopen HASH,DBNAME,MASK 1009X<dbmopen> X<dbm> X<ndbm> X<sdbm> X<gdbm> 1010 1011[This function has been largely superseded by the C<tie> function.] 1012 1013This binds a dbm(3), ndbm(3), sdbm(3), gdbm(3), or Berkeley DB file to a 1014hash. HASH is the name of the hash. (Unlike normal C<open>, the first 1015argument is I<not> a filehandle, even though it looks like one). DBNAME 1016is the name of the database (without the F<.dir> or F<.pag> extension if 1017any). If the database does not exist, it is created with protection 1018specified by MASK (as modified by the C<umask>). If your system supports 1019only the older DBM functions, you may perform only one C<dbmopen> in your 1020program. In older versions of Perl, if your system had neither DBM nor 1021ndbm, calling C<dbmopen> produced a fatal error; it now falls back to 1022sdbm(3). 1023 1024If you don't have write access to the DBM file, you can only read hash 1025variables, not set them. If you want to test whether you can write, 1026either use file tests or try setting a dummy hash entry inside an C<eval>, 1027which will trap the error. 1028 1029Note that functions such as C<keys> and C<values> may return huge lists 1030when used on large DBM files. You may prefer to use the C<each> 1031function to iterate over large DBM files. Example: 1032 1033 # print out history file offsets 1034 dbmopen(%HIST,'/usr/lib/news/history',0666); 1035 while (($key,$val) = each %HIST) { 1036 print $key, ' = ', unpack('L',$val), "\n"; 1037 } 1038 dbmclose(%HIST); 1039 1040See also L<AnyDBM_File> for a more general description of the pros and 1041cons of the various dbm approaches, as well as L<DB_File> for a particularly 1042rich implementation. 1043 1044You can control which DBM library you use by loading that library 1045before you call dbmopen(): 1046 1047 use DB_File; 1048 dbmopen(%NS_Hist, "$ENV{HOME}/.netscape/history.db") 1049 or die "Can't open netscape history file: $!"; 1050 1051=item defined EXPR 1052X<defined> X<undef> X<undefined> 1053 1054=item defined 1055 1056Returns a Boolean value telling whether EXPR has a value other than 1057the undefined value C<undef>. If EXPR is not present, C<$_> will be 1058checked. 1059 1060Many operations return C<undef> to indicate failure, end of file, 1061system error, uninitialized variable, and other exceptional 1062conditions. This function allows you to distinguish C<undef> from 1063other values. (A simple Boolean test will not distinguish among 1064C<undef>, zero, the empty string, and C<"0">, which are all equally 1065false.) Note that since C<undef> is a valid scalar, its presence 1066doesn't I<necessarily> indicate an exceptional condition: C<pop> 1067returns C<undef> when its argument is an empty array, I<or> when the 1068element to return happens to be C<undef>. 1069 1070You may also use C<defined(&func)> to check whether subroutine C<&func> 1071has ever been defined. The return value is unaffected by any forward 1072declarations of C<&func>. Note that a subroutine which is not defined 1073may still be callable: its package may have an C<AUTOLOAD> method that 1074makes it spring into existence the first time that it is called -- see 1075L<perlsub>. 1076 1077Use of C<defined> on aggregates (hashes and arrays) is deprecated. It 1078used to report whether memory for that aggregate has ever been 1079allocated. This behavior may disappear in future versions of Perl. 1080You should instead use a simple test for size: 1081 1082 if (@an_array) { print "has array elements\n" } 1083 if (%a_hash) { print "has hash members\n" } 1084 1085When used on a hash element, it tells you whether the value is defined, 1086not whether the key exists in the hash. Use L</exists> for the latter 1087purpose. 1088 1089Examples: 1090 1091 print if defined $switch{'D'}; 1092 print "$val\n" while defined($val = pop(@ary)); 1093 die "Can't readlink $sym: $!" 1094 unless defined($value = readlink $sym); 1095 sub foo { defined &$bar ? &$bar(@_) : die "No bar"; } 1096 $debugging = 0 unless defined $debugging; 1097 1098Note: Many folks tend to overuse C<defined>, and then are surprised to 1099discover that the number C<0> and C<""> (the zero-length string) are, in fact, 1100defined values. For example, if you say 1101 1102 "ab" =~ /a(.*)b/; 1103 1104The pattern match succeeds, and C<$1> is defined, despite the fact that it 1105matched "nothing". It didn't really fail to match anything. Rather, it 1106matched something that happened to be zero characters long. This is all 1107very above-board and honest. When a function returns an undefined value, 1108it's an admission that it couldn't give you an honest answer. So you 1109should use C<defined> only when you're questioning the integrity of what 1110you're trying to do. At other times, a simple comparison to C<0> or C<""> is 1111what you want. 1112 1113See also L</undef>, L</exists>, L</ref>. 1114 1115=item delete EXPR 1116X<delete> 1117 1118Given an expression that specifies a hash element, array element, hash slice, 1119or array slice, deletes the specified element(s) from the hash or array. 1120In the case of an array, if the array elements happen to be at the end, 1121the size of the array will shrink to the highest element that tests 1122true for exists() (or 0 if no such element exists). 1123 1124Returns a list with the same number of elements as the number of elements 1125for which deletion was attempted. Each element of that list consists of 1126either the value of the element deleted, or the undefined value. In scalar 1127context, this means that you get the value of the last element deleted (or 1128the undefined value if that element did not exist). 1129 1130 %hash = (foo => 11, bar => 22, baz => 33); 1131 $scalar = delete $hash{foo}; # $scalar is 11 1132 $scalar = delete @hash{qw(foo bar)}; # $scalar is 22 1133 @array = delete @hash{qw(foo bar baz)}; # @array is (undef,undef,33) 1134 1135Deleting from C<%ENV> modifies the environment. Deleting from 1136a hash tied to a DBM file deletes the entry from the DBM file. Deleting 1137from a C<tie>d hash or array may not necessarily return anything. 1138 1139Deleting an array element effectively returns that position of the array 1140to its initial, uninitialized state. Subsequently testing for the same 1141element with exists() will return false. Also, deleting array elements 1142in the middle of an array will not shift the index of the elements 1143after them down. Use splice() for that. See L</exists>. 1144 1145The following (inefficiently) deletes all the values of %HASH and @ARRAY: 1146 1147 foreach $key (keys %HASH) { 1148 delete $HASH{$key}; 1149 } 1150 1151 foreach $index (0 .. $#ARRAY) { 1152 delete $ARRAY[$index]; 1153 } 1154 1155And so do these: 1156 1157 delete @HASH{keys %HASH}; 1158 1159 delete @ARRAY[0 .. $#ARRAY]; 1160 1161But both of these are slower than just assigning the empty list 1162or undefining %HASH or @ARRAY: 1163 1164 %HASH = (); # completely empty %HASH 1165 undef %HASH; # forget %HASH ever existed 1166 1167 @ARRAY = (); # completely empty @ARRAY 1168 undef @ARRAY; # forget @ARRAY ever existed 1169 1170Note that the EXPR can be arbitrarily complicated as long as the final 1171operation is a hash element, array element, hash slice, or array slice 1172lookup: 1173 1174 delete $ref->[$x][$y]{$key}; 1175 delete @{$ref->[$x][$y]}{$key1, $key2, @morekeys}; 1176 1177 delete $ref->[$x][$y][$index]; 1178 delete @{$ref->[$x][$y]}[$index1, $index2, @moreindices]; 1179 1180=item die LIST 1181X<die> X<throw> X<exception> X<raise> X<$@> X<abort> 1182 1183Outside an C<eval>, prints the value of LIST to C<STDERR> and 1184exits with the current value of C<$!> (errno). If C<$!> is C<0>, 1185exits with the value of C<<< ($? >> 8) >>> (backtick `command` 1186status). If C<<< ($? >> 8) >>> is C<0>, exits with C<255>. Inside 1187an C<eval(),> the error message is stuffed into C<$@> and the 1188C<eval> is terminated with the undefined value. This makes 1189C<die> the way to raise an exception. 1190 1191Equivalent examples: 1192 1193 die "Can't cd to spool: $!\n" unless chdir '/usr/spool/news'; 1194 chdir '/usr/spool/news' or die "Can't cd to spool: $!\n" 1195 1196If the last element of LIST does not end in a newline, the current 1197script line number and input line number (if any) are also printed, 1198and a newline is supplied. Note that the "input line number" (also 1199known as "chunk") is subject to whatever notion of "line" happens to 1200be currently in effect, and is also available as the special variable 1201C<$.>. See L<perlvar/"$/"> and L<perlvar/"$.">. 1202 1203Hint: sometimes appending C<", stopped"> to your message will cause it 1204to make better sense when the string C<"at foo line 123"> is appended. 1205Suppose you are running script "canasta". 1206 1207 die "/etc/games is no good"; 1208 die "/etc/games is no good, stopped"; 1209 1210produce, respectively 1211 1212 /etc/games is no good at canasta line 123. 1213 /etc/games is no good, stopped at canasta line 123. 1214 1215See also exit(), warn(), and the Carp module. 1216 1217If LIST is empty and C<$@> already contains a value (typically from a 1218previous eval) that value is reused after appending C<"\t...propagated">. 1219This is useful for propagating exceptions: 1220 1221 eval { ... }; 1222 die unless $@ =~ /Expected exception/; 1223 1224If LIST is empty and C<$@> contains an object reference that has a 1225C<PROPAGATE> method, that method will be called with additional file 1226and line number parameters. The return value replaces the value in 1227C<$@>. i.e. as if C<< $@ = eval { $@->PROPAGATE(__FILE__, __LINE__) }; >> 1228were called. 1229 1230If C<$@> is empty then the string C<"Died"> is used. 1231 1232die() can also be called with a reference argument. If this happens to be 1233trapped within an eval(), $@ contains the reference. This behavior permits 1234a more elaborate exception handling implementation using objects that 1235maintain arbitrary state about the nature of the exception. Such a scheme 1236is sometimes preferable to matching particular string values of $@ using 1237regular expressions. Here's an example: 1238 1239 use Scalar::Util 'blessed'; 1240 1241 eval { ... ; die Some::Module::Exception->new( FOO => "bar" ) }; 1242 if ($@) { 1243 if (blessed($@) && $@->isa("Some::Module::Exception")) { 1244 # handle Some::Module::Exception 1245 } 1246 else { 1247 # handle all other possible exceptions 1248 } 1249 } 1250 1251Because perl will stringify uncaught exception messages before displaying 1252them, you may want to overload stringification operations on such custom 1253exception objects. See L<overload> for details about that. 1254 1255You can arrange for a callback to be run just before the C<die> 1256does its deed, by setting the C<$SIG{__DIE__}> hook. The associated 1257handler will be called with the error text and can change the error 1258message, if it sees fit, by calling C<die> again. See 1259L<perlvar/$SIG{expr}> for details on setting C<%SIG> entries, and 1260L<"eval BLOCK"> for some examples. Although this feature was 1261to be run only right before your program was to exit, this is not 1262currently the case--the C<$SIG{__DIE__}> hook is currently called 1263even inside eval()ed blocks/strings! If one wants the hook to do 1264nothing in such situations, put 1265 1266 die @_ if $^S; 1267 1268as the first line of the handler (see L<perlvar/$^S>). Because 1269this promotes strange action at a distance, this counterintuitive 1270behavior may be fixed in a future release. 1271 1272=item do BLOCK 1273X<do> X<block> 1274 1275Not really a function. Returns the value of the last command in the 1276sequence of commands indicated by BLOCK. When modified by the C<while> or 1277C<until> loop modifier, executes the BLOCK once before testing the loop 1278condition. (On other statements the loop modifiers test the conditional 1279first.) 1280 1281C<do BLOCK> does I<not> count as a loop, so the loop control statements 1282C<next>, C<last>, or C<redo> cannot be used to leave or restart the block. 1283See L<perlsyn> for alternative strategies. 1284 1285=item do SUBROUTINE(LIST) 1286X<do> 1287 1288This form of subroutine call is deprecated. See L<perlsub>. 1289 1290=item do EXPR 1291X<do> 1292 1293Uses the value of EXPR as a filename and executes the contents of the 1294file as a Perl script. 1295 1296 do 'stat.pl'; 1297 1298is just like 1299 1300 eval `cat stat.pl`; 1301 1302except that it's more efficient and concise, keeps track of the current 1303filename for error messages, searches the @INC directories, and updates 1304C<%INC> if the file is found. See L<perlvar/Predefined Names> for these 1305variables. It also differs in that code evaluated with C<do FILENAME> 1306cannot see lexicals in the enclosing scope; C<eval STRING> does. It's the 1307same, however, in that it does reparse the file every time you call it, 1308so you probably don't want to do this inside a loop. 1309 1310If C<do> cannot read the file, it returns undef and sets C<$!> to the 1311error. If C<do> can read the file but cannot compile it, it 1312returns undef and sets an error message in C<$@>. If the file is 1313successfully compiled, C<do> returns the value of the last expression 1314evaluated. 1315 1316Note that inclusion of library modules is better done with the 1317C<use> and C<require> operators, which also do automatic error checking 1318and raise an exception if there's a problem. 1319 1320You might like to use C<do> to read in a program configuration 1321file. Manual error checking can be done this way: 1322 1323 # read in config files: system first, then user 1324 for $file ("/share/prog/defaults.rc", 1325 "$ENV{HOME}/.someprogrc") 1326 { 1327 unless ($return = do $file) { 1328 warn "couldn't parse $file: $@" if $@; 1329 warn "couldn't do $file: $!" unless defined $return; 1330 warn "couldn't run $file" unless $return; 1331 } 1332 } 1333 1334=item dump LABEL 1335X<dump> X<core> X<undump> 1336 1337=item dump 1338 1339This function causes an immediate core dump. See also the B<-u> 1340command-line switch in L<perlrun>, which does the same thing. 1341Primarily this is so that you can use the B<undump> program (not 1342supplied) to turn your core dump into an executable binary after 1343having initialized all your variables at the beginning of the 1344program. When the new binary is executed it will begin by executing 1345a C<goto LABEL> (with all the restrictions that C<goto> suffers). 1346Think of it as a goto with an intervening core dump and reincarnation. 1347If C<LABEL> is omitted, restarts the program from the top. 1348 1349B<WARNING>: Any files opened at the time of the dump will I<not> 1350be open any more when the program is reincarnated, with possible 1351resulting confusion on the part of Perl. 1352 1353This function is now largely obsolete, partly because it's very 1354hard to convert a core file into an executable, and because the 1355real compiler backends for generating portable bytecode and compilable 1356C code have superseded it. That's why you should now invoke it as 1357C<CORE::dump()>, if you don't want to be warned against a possible 1358typo. 1359 1360If you're looking to use L<dump> to speed up your program, consider 1361generating bytecode or native C code as described in L<perlcc>. If 1362you're just trying to accelerate a CGI script, consider using the 1363C<mod_perl> extension to B<Apache>, or the CPAN module, CGI::Fast. 1364You might also consider autoloading or selfloading, which at least 1365make your program I<appear> to run faster. 1366 1367=item each HASH 1368X<each> X<hash, iterator> 1369 1370When called in list context, returns a 2-element list consisting of the 1371key and value for the next element of a hash, so that you can iterate over 1372it. When called in scalar context, returns only the key for the next 1373element in the hash. 1374 1375Entries are returned in an apparently random order. The actual random 1376order is subject to change in future versions of perl, but it is 1377guaranteed to be in the same order as either the C<keys> or C<values> 1378function would produce on the same (unmodified) hash. Since Perl 13795.8.1 the ordering is different even between different runs of Perl 1380for security reasons (see L<perlsec/"Algorithmic Complexity Attacks">). 1381 1382When the hash is entirely read, a null array is returned in list context 1383(which when assigned produces a false (C<0>) value), and C<undef> in 1384scalar context. The next call to C<each> after that will start iterating 1385again. There is a single iterator for each hash, shared by all C<each>, 1386C<keys>, and C<values> function calls in the program; it can be reset by 1387reading all the elements from the hash, or by evaluating C<keys HASH> or 1388C<values HASH>. If you add or delete elements of a hash while you're 1389iterating over it, you may get entries skipped or duplicated, so 1390don't. Exception: It is always safe to delete the item most recently 1391returned by C<each()>, which means that the following code will work: 1392 1393 while (($key, $value) = each %hash) { 1394 print $key, "\n"; 1395 delete $hash{$key}; # This is safe 1396 } 1397 1398The following prints out your environment like the printenv(1) program, 1399only in a different order: 1400 1401 while (($key,$value) = each %ENV) { 1402 print "$key=$value\n"; 1403 } 1404 1405See also C<keys>, C<values> and C<sort>. 1406 1407=item eof FILEHANDLE 1408X<eof> 1409X<end of file> 1410X<end-of-file> 1411 1412=item eof () 1413 1414=item eof 1415 1416Returns 1 if the next read on FILEHANDLE will return end of file, or if 1417FILEHANDLE is not open. FILEHANDLE may be an expression whose value 1418gives the real filehandle. (Note that this function actually 1419reads a character and then C<ungetc>s it, so isn't very useful in an 1420interactive context.) Do not read from a terminal file (or call 1421C<eof(FILEHANDLE)> on it) after end-of-file is reached. File types such 1422as terminals may lose the end-of-file condition if you do. 1423 1424An C<eof> without an argument uses the last file read. Using C<eof()> 1425with empty parentheses is very different. It refers to the pseudo file 1426formed from the files listed on the command line and accessed via the 1427C<< <> >> operator. Since C<< <> >> isn't explicitly opened, 1428as a normal filehandle is, an C<eof()> before C<< <> >> has been 1429used will cause C<@ARGV> to be examined to determine if input is 1430available. Similarly, an C<eof()> after C<< <> >> has returned 1431end-of-file will assume you are processing another C<@ARGV> list, 1432and if you haven't set C<@ARGV>, will read input from C<STDIN>; 1433see L<perlop/"I/O Operators">. 1434 1435In a C<< while (<>) >> loop, C<eof> or C<eof(ARGV)> can be used to 1436detect the end of each file, C<eof()> will only detect the end of the 1437last file. Examples: 1438 1439 # reset line numbering on each input file 1440 while (<>) { 1441 next if /^\s*#/; # skip comments 1442 print "$.\t$_"; 1443 } continue { 1444 close ARGV if eof; # Not eof()! 1445 } 1446 1447 # insert dashes just before last line of last file 1448 while (<>) { 1449 if (eof()) { # check for end of last file 1450 print "--------------\n"; 1451 } 1452 print; 1453 last if eof(); # needed if we're reading from a terminal 1454 } 1455 1456Practical hint: you almost never need to use C<eof> in Perl, because the 1457input operators typically return C<undef> when they run out of data, or if 1458there was an error. 1459 1460=item eval EXPR 1461X<eval> X<try> X<catch> X<evaluate> X<parse> X<execute> 1462 1463=item eval BLOCK 1464 1465=item eval 1466 1467In the first form, the return value of EXPR is parsed and executed as if it 1468were a little Perl program. The value of the expression (which is itself 1469determined within scalar context) is first parsed, and if there weren't any 1470errors, executed in the lexical context of the current Perl program, so 1471that any variable settings or subroutine and format definitions remain 1472afterwards. Note that the value is parsed every time the C<eval> executes. 1473If EXPR is omitted, evaluates C<$_>. This form is typically used to 1474delay parsing and subsequent execution of the text of EXPR until run time. 1475 1476In the second form, the code within the BLOCK is parsed only once--at the 1477same time the code surrounding the C<eval> itself was parsed--and executed 1478within the context of the current Perl program. This form is typically 1479used to trap exceptions more efficiently than the first (see below), while 1480also providing the benefit of checking the code within BLOCK at compile 1481time. 1482 1483The final semicolon, if any, may be omitted from the value of EXPR or within 1484the BLOCK. 1485 1486In both forms, the value returned is the value of the last expression 1487evaluated inside the mini-program; a return statement may be also used, just 1488as with subroutines. The expression providing the return value is evaluated 1489in void, scalar, or list context, depending on the context of the C<eval> 1490itself. See L</wantarray> for more on how the evaluation context can be 1491determined. 1492 1493If there is a syntax error or runtime error, or a C<die> statement is 1494executed, an undefined value is returned by C<eval>, and C<$@> is set to the 1495error message. If there was no error, C<$@> is guaranteed to be a null 1496string. Beware that using C<eval> neither silences perl from printing 1497warnings to STDERR, nor does it stuff the text of warning messages into C<$@>. 1498To do either of those, you have to use the C<$SIG{__WARN__}> facility, or 1499turn off warnings inside the BLOCK or EXPR using S<C<no warnings 'all'>>. 1500See L</warn>, L<perlvar>, L<warnings> and L<perllexwarn>. 1501 1502Note that, because C<eval> traps otherwise-fatal errors, it is useful for 1503determining whether a particular feature (such as C<socket> or C<symlink>) 1504is implemented. It is also Perl's exception trapping mechanism, where 1505the die operator is used to raise exceptions. 1506 1507If the code to be executed doesn't vary, you may use the eval-BLOCK 1508form to trap run-time errors without incurring the penalty of 1509recompiling each time. The error, if any, is still returned in C<$@>. 1510Examples: 1511 1512 # make divide-by-zero nonfatal 1513 eval { $answer = $a / $b; }; warn $@ if $@; 1514 1515 # same thing, but less efficient 1516 eval '$answer = $a / $b'; warn $@ if $@; 1517 1518 # a compile-time error 1519 eval { $answer = }; # WRONG 1520 1521 # a run-time error 1522 eval '$answer ='; # sets $@ 1523 1524Using the C<eval{}> form as an exception trap in libraries does have some 1525issues. Due to the current arguably broken state of C<__DIE__> hooks, you 1526may wish not to trigger any C<__DIE__> hooks that user code may have installed. 1527You can use the C<local $SIG{__DIE__}> construct for this purpose, 1528as shown in this example: 1529 1530 # a very private exception trap for divide-by-zero 1531 eval { local $SIG{'__DIE__'}; $answer = $a / $b; }; 1532 warn $@ if $@; 1533 1534This is especially significant, given that C<__DIE__> hooks can call 1535C<die> again, which has the effect of changing their error messages: 1536 1537 # __DIE__ hooks may modify error messages 1538 { 1539 local $SIG{'__DIE__'} = 1540 sub { (my $x = $_[0]) =~ s/foo/bar/g; die $x }; 1541 eval { die "foo lives here" }; 1542 print $@ if $@; # prints "bar lives here" 1543 } 1544 1545Because this promotes action at a distance, this counterintuitive behavior 1546may be fixed in a future release. 1547 1548With an C<eval>, you should be especially careful to remember what's 1549being looked at when: 1550 1551 eval $x; # CASE 1 1552 eval "$x"; # CASE 2 1553 1554 eval '$x'; # CASE 3 1555 eval { $x }; # CASE 4 1556 1557 eval "\$$x++"; # CASE 5 1558 $$x++; # CASE 6 1559 1560Cases 1 and 2 above behave identically: they run the code contained in 1561the variable $x. (Although case 2 has misleading double quotes making 1562the reader wonder what else might be happening (nothing is).) Cases 3 1563and 4 likewise behave in the same way: they run the code C<'$x'>, which 1564does nothing but return the value of $x. (Case 4 is preferred for 1565purely visual reasons, but it also has the advantage of compiling at 1566compile-time instead of at run-time.) Case 5 is a place where 1567normally you I<would> like to use double quotes, except that in this 1568particular situation, you can just use symbolic references instead, as 1569in case 6. 1570 1571C<eval BLOCK> does I<not> count as a loop, so the loop control statements 1572C<next>, C<last>, or C<redo> cannot be used to leave or restart the block. 1573 1574Note that as a very special case, an C<eval ''> executed within the C<DB> 1575package doesn't see the usual surrounding lexical scope, but rather the 1576scope of the first non-DB piece of code that called it. You don't normally 1577need to worry about this unless you are writing a Perl debugger. 1578 1579=item exec LIST 1580X<exec> X<execute> 1581 1582=item exec PROGRAM LIST 1583 1584The C<exec> function executes a system command I<and never returns>-- 1585use C<system> instead of C<exec> if you want it to return. It fails and 1586returns false only if the command does not exist I<and> it is executed 1587directly instead of via your system's command shell (see below). 1588 1589Since it's a common mistake to use C<exec> instead of C<system>, Perl 1590warns you if there is a following statement which isn't C<die>, C<warn>, 1591or C<exit> (if C<-w> is set - but you always do that). If you 1592I<really> want to follow an C<exec> with some other statement, you 1593can use one of these styles to avoid the warning: 1594 1595 exec ('foo') or print STDERR "couldn't exec foo: $!"; 1596 { exec ('foo') }; print STDERR "couldn't exec foo: $!"; 1597 1598If there is more than one argument in LIST, or if LIST is an array 1599with more than one value, calls execvp(3) with the arguments in LIST. 1600If there is only one scalar argument or an array with one element in it, 1601the argument is checked for shell metacharacters, and if there are any, 1602the entire argument is passed to the system's command shell for parsing 1603(this is C</bin/sh -c> on Unix platforms, but varies on other platforms). 1604If there are no shell metacharacters in the argument, it is split into 1605words and passed directly to C<execvp>, which is more efficient. 1606Examples: 1607 1608 exec '/bin/echo', 'Your arguments are: ', @ARGV; 1609 exec "sort $outfile | uniq"; 1610 1611If you don't really want to execute the first argument, but want to lie 1612to the program you are executing about its own name, you can specify 1613the program you actually want to run as an "indirect object" (without a 1614comma) in front of the LIST. (This always forces interpretation of the 1615LIST as a multivalued list, even if there is only a single scalar in 1616the list.) Example: 1617 1618 $shell = '/bin/csh'; 1619 exec $shell '-sh'; # pretend it's a login shell 1620 1621or, more directly, 1622 1623 exec {'/bin/csh'} '-sh'; # pretend it's a login shell 1624 1625When the arguments get executed via the system shell, results will 1626be subject to its quirks and capabilities. See L<perlop/"`STRING`"> 1627for details. 1628 1629Using an indirect object with C<exec> or C<system> is also more 1630secure. This usage (which also works fine with system()) forces 1631interpretation of the arguments as a multivalued list, even if the 1632list had just one argument. That way you're safe from the shell 1633expanding wildcards or splitting up words with whitespace in them. 1634 1635 @args = ( "echo surprise" ); 1636 1637 exec @args; # subject to shell escapes 1638 # if @args == 1 1639 exec { $args[0] } @args; # safe even with one-arg list 1640 1641The first version, the one without the indirect object, ran the I<echo> 1642program, passing it C<"surprise"> an argument. The second version 1643didn't--it tried to run a program literally called I<"echo surprise">, 1644didn't find it, and set C<$?> to a non-zero value indicating failure. 1645 1646Beginning with v5.6.0, Perl will attempt to flush all files opened for 1647output before the exec, but this may not be supported on some platforms 1648(see L<perlport>). To be safe, you may need to set C<$|> ($AUTOFLUSH 1649in English) or call the C<autoflush()> method of C<IO::Handle> on any 1650open handles in order to avoid lost output. 1651 1652Note that C<exec> will not call your C<END> blocks, nor will it call 1653any C<DESTROY> methods in your objects. 1654 1655=item exists EXPR 1656X<exists> X<autovivification> 1657 1658Given an expression that specifies a hash element or array element, 1659returns true if the specified element in the hash or array has ever 1660been initialized, even if the corresponding value is undefined. The 1661element is not autovivified if it doesn't exist. 1662 1663 print "Exists\n" if exists $hash{$key}; 1664 print "Defined\n" if defined $hash{$key}; 1665 print "True\n" if $hash{$key}; 1666 1667 print "Exists\n" if exists $array[$index]; 1668 print "Defined\n" if defined $array[$index]; 1669 print "True\n" if $array[$index]; 1670 1671A hash or array element can be true only if it's defined, and defined if 1672it exists, but the reverse doesn't necessarily hold true. 1673 1674Given an expression that specifies the name of a subroutine, 1675returns true if the specified subroutine has ever been declared, even 1676if it is undefined. Mentioning a subroutine name for exists or defined 1677does not count as declaring it. Note that a subroutine which does not 1678exist may still be callable: its package may have an C<AUTOLOAD> 1679method that makes it spring into existence the first time that it is 1680called -- see L<perlsub>. 1681 1682 print "Exists\n" if exists &subroutine; 1683 print "Defined\n" if defined &subroutine; 1684 1685Note that the EXPR can be arbitrarily complicated as long as the final 1686operation is a hash or array key lookup or subroutine name: 1687 1688 if (exists $ref->{A}->{B}->{$key}) { } 1689 if (exists $hash{A}{B}{$key}) { } 1690 1691 if (exists $ref->{A}->{B}->[$ix]) { } 1692 if (exists $hash{A}{B}[$ix]) { } 1693 1694 if (exists &{$ref->{A}{B}{$key}}) { } 1695 1696Although the deepest nested array or hash will not spring into existence 1697just because its existence was tested, any intervening ones will. 1698Thus C<< $ref->{"A"} >> and C<< $ref->{"A"}->{"B"} >> will spring 1699into existence due to the existence test for the $key element above. 1700This happens anywhere the arrow operator is used, including even: 1701 1702 undef $ref; 1703 if (exists $ref->{"Some key"}) { } 1704 print $ref; # prints HASH(0x80d3d5c) 1705 1706This surprising autovivification in what does not at first--or even 1707second--glance appear to be an lvalue context may be fixed in a future 1708release. 1709 1710See L<perlref/"Pseudo-hashes: Using an array as a hash"> for specifics 1711on how exists() acts when used on a pseudo-hash. 1712 1713Use of a subroutine call, rather than a subroutine name, as an argument 1714to exists() is an error. 1715 1716 exists ⊂ # OK 1717 exists &sub(); # Error 1718 1719=item exit EXPR 1720X<exit> X<terminate> X<abort> 1721 1722=item exit 1723 1724Evaluates EXPR and exits immediately with that value. Example: 1725 1726 $ans = <STDIN>; 1727 exit 0 if $ans =~ /^[Xx]/; 1728 1729See also C<die>. If EXPR is omitted, exits with C<0> status. The only 1730universally recognized values for EXPR are C<0> for success and C<1> 1731for error; other values are subject to interpretation depending on the 1732environment in which the Perl program is running. For example, exiting 173369 (EX_UNAVAILABLE) from a I<sendmail> incoming-mail filter will cause 1734the mailer to return the item undelivered, but that's not true everywhere. 1735 1736Don't use C<exit> to abort a subroutine if there's any chance that 1737someone might want to trap whatever error happened. Use C<die> instead, 1738which can be trapped by an C<eval>. 1739 1740The exit() function does not always exit immediately. It calls any 1741defined C<END> routines first, but these C<END> routines may not 1742themselves abort the exit. Likewise any object destructors that need to 1743be called are called before the real exit. If this is a problem, you 1744can call C<POSIX:_exit($status)> to avoid END and destructor processing. 1745See L<perlmod> for details. 1746 1747=item exp EXPR 1748X<exp> X<exponential> X<antilog> X<antilogarithm> X<e> 1749 1750=item exp 1751 1752Returns I<e> (the natural logarithm base) to the power of EXPR. 1753If EXPR is omitted, gives C<exp($_)>. 1754 1755=item fcntl FILEHANDLE,FUNCTION,SCALAR 1756X<fcntl> 1757 1758Implements the fcntl(2) function. You'll probably have to say 1759 1760 use Fcntl; 1761 1762first to get the correct constant definitions. Argument processing and 1763value return works just like C<ioctl> below. 1764For example: 1765 1766 use Fcntl; 1767 fcntl($filehandle, F_GETFL, $packed_return_buffer) 1768 or die "can't fcntl F_GETFL: $!"; 1769 1770You don't have to check for C<defined> on the return from C<fcntl>. 1771Like C<ioctl>, it maps a C<0> return from the system call into 1772C<"0 but true"> in Perl. This string is true in boolean context and C<0> 1773in numeric context. It is also exempt from the normal B<-w> warnings 1774on improper numeric conversions. 1775 1776Note that C<fcntl> will produce a fatal error if used on a machine that 1777doesn't implement fcntl(2). See the Fcntl module or your fcntl(2) 1778manpage to learn what functions are available on your system. 1779 1780Here's an example of setting a filehandle named C<REMOTE> to be 1781non-blocking at the system level. You'll have to negotiate C<$|> 1782on your own, though. 1783 1784 use Fcntl qw(F_GETFL F_SETFL O_NONBLOCK); 1785 1786 $flags = fcntl(REMOTE, F_GETFL, 0) 1787 or die "Can't get flags for the socket: $!\n"; 1788 1789 $flags = fcntl(REMOTE, F_SETFL, $flags | O_NONBLOCK) 1790 or die "Can't set flags for the socket: $!\n"; 1791 1792=item fileno FILEHANDLE 1793X<fileno> 1794 1795Returns the file descriptor for a filehandle, or undefined if the 1796filehandle is not open. This is mainly useful for constructing 1797bitmaps for C<select> and low-level POSIX tty-handling operations. 1798If FILEHANDLE is an expression, the value is taken as an indirect 1799filehandle, generally its name. 1800 1801You can use this to find out whether two handles refer to the 1802same underlying descriptor: 1803 1804 if (fileno(THIS) == fileno(THAT)) { 1805 print "THIS and THAT are dups\n"; 1806 } 1807 1808(Filehandles connected to memory objects via new features of C<open> may 1809return undefined even though they are open.) 1810 1811 1812=item flock FILEHANDLE,OPERATION 1813X<flock> X<lock> X<locking> 1814 1815Calls flock(2), or an emulation of it, on FILEHANDLE. Returns true 1816for success, false on failure. Produces a fatal error if used on a 1817machine that doesn't implement flock(2), fcntl(2) locking, or lockf(3). 1818C<flock> is Perl's portable file locking interface, although it locks 1819only entire files, not records. 1820 1821Two potentially non-obvious but traditional C<flock> semantics are 1822that it waits indefinitely until the lock is granted, and that its locks 1823B<merely advisory>. Such discretionary locks are more flexible, but offer 1824fewer guarantees. This means that programs that do not also use C<flock> 1825may modify files locked with C<flock>. See L<perlport>, 1826your port's specific documentation, or your system-specific local manpages 1827for details. It's best to assume traditional behavior if you're writing 1828portable programs. (But if you're not, you should as always feel perfectly 1829free to write for your own system's idiosyncrasies (sometimes called 1830"features"). Slavish adherence to portability concerns shouldn't get 1831in the way of your getting your job done.) 1832 1833OPERATION is one of LOCK_SH, LOCK_EX, or LOCK_UN, possibly combined with 1834LOCK_NB. These constants are traditionally valued 1, 2, 8 and 4, but 1835you can use the symbolic names if you import them from the Fcntl module, 1836either individually, or as a group using the ':flock' tag. LOCK_SH 1837requests a shared lock, LOCK_EX requests an exclusive lock, and LOCK_UN 1838releases a previously requested lock. If LOCK_NB is bitwise-or'ed with 1839LOCK_SH or LOCK_EX then C<flock> will return immediately rather than blocking 1840waiting for the lock (check the return status to see if you got it). 1841 1842To avoid the possibility of miscoordination, Perl now flushes FILEHANDLE 1843before locking or unlocking it. 1844 1845Note that the emulation built with lockf(3) doesn't provide shared 1846locks, and it requires that FILEHANDLE be open with write intent. These 1847are the semantics that lockf(3) implements. Most if not all systems 1848implement lockf(3) in terms of fcntl(2) locking, though, so the 1849differing semantics shouldn't bite too many people. 1850 1851Note that the fcntl(2) emulation of flock(3) requires that FILEHANDLE 1852be open with read intent to use LOCK_SH and requires that it be open 1853with write intent to use LOCK_EX. 1854 1855Note also that some versions of C<flock> cannot lock things over the 1856network; you would need to use the more system-specific C<fcntl> for 1857that. If you like you can force Perl to ignore your system's flock(2) 1858function, and so provide its own fcntl(2)-based emulation, by passing 1859the switch C<-Ud_flock> to the F<Configure> program when you configure 1860perl. 1861 1862Here's a mailbox appender for BSD systems. 1863 1864 use Fcntl ':flock'; # import LOCK_* constants 1865 1866 sub lock { 1867 flock(MBOX,LOCK_EX); 1868 # and, in case someone appended 1869 # while we were waiting... 1870 seek(MBOX, 0, 2); 1871 } 1872 1873 sub unlock { 1874 flock(MBOX,LOCK_UN); 1875 } 1876 1877 open(MBOX, ">>/usr/spool/mail/$ENV{'USER'}") 1878 or die "Can't open mailbox: $!"; 1879 1880 lock(); 1881 print MBOX $msg,"\n\n"; 1882 unlock(); 1883 1884On systems that support a real flock(), locks are inherited across fork() 1885calls, whereas those that must resort to the more capricious fcntl() 1886function lose the locks, making it harder to write servers. 1887 1888See also L<DB_File> for other flock() examples. 1889 1890=item fork 1891X<fork> X<child> X<parent> 1892 1893Does a fork(2) system call to create a new process running the 1894same program at the same point. It returns the child pid to the 1895parent process, C<0> to the child process, or C<undef> if the fork is 1896unsuccessful. File descriptors (and sometimes locks on those descriptors) 1897are shared, while everything else is copied. On most systems supporting 1898fork(), great care has gone into making it extremely efficient (for 1899example, using copy-on-write technology on data pages), making it the 1900dominant paradigm for multitasking over the last few decades. 1901 1902Beginning with v5.6.0, Perl will attempt to flush all files opened for 1903output before forking the child process, but this may not be supported 1904on some platforms (see L<perlport>). To be safe, you may need to set 1905C<$|> ($AUTOFLUSH in English) or call the C<autoflush()> method of 1906C<IO::Handle> on any open handles in order to avoid duplicate output. 1907 1908If you C<fork> without ever waiting on your children, you will 1909accumulate zombies. On some systems, you can avoid this by setting 1910C<$SIG{CHLD}> to C<"IGNORE">. See also L<perlipc> for more examples of 1911forking and reaping moribund children. 1912 1913Note that if your forked child inherits system file descriptors like 1914STDIN and STDOUT that are actually connected by a pipe or socket, even 1915if you exit, then the remote server (such as, say, a CGI script or a 1916backgrounded job launched from a remote shell) won't think you're done. 1917You should reopen those to F</dev/null> if it's any issue. 1918 1919=item format 1920X<format> 1921 1922Declare a picture format for use by the C<write> function. For 1923example: 1924 1925 format Something = 1926 Test: @<<<<<<<< @||||| @>>>>> 1927 $str, $%, '$' . int($num) 1928 . 1929 1930 $str = "widget"; 1931 $num = $cost/$quantity; 1932 $~ = 'Something'; 1933 write; 1934 1935See L<perlform> for many details and examples. 1936 1937=item formline PICTURE,LIST 1938X<formline> 1939 1940This is an internal function used by C<format>s, though you may call it, 1941too. It formats (see L<perlform>) a list of values according to the 1942contents of PICTURE, placing the output into the format output 1943accumulator, C<$^A> (or C<$ACCUMULATOR> in English). 1944Eventually, when a C<write> is done, the contents of 1945C<$^A> are written to some filehandle. You could also read C<$^A> 1946and then set C<$^A> back to C<"">. Note that a format typically 1947does one C<formline> per line of form, but the C<formline> function itself 1948doesn't care how many newlines are embedded in the PICTURE. This means 1949that the C<~> and C<~~> tokens will treat the entire PICTURE as a single line. 1950You may therefore need to use multiple formlines to implement a single 1951record format, just like the format compiler. 1952 1953Be careful if you put double quotes around the picture, because an C<@> 1954character may be taken to mean the beginning of an array name. 1955C<formline> always returns true. See L<perlform> for other examples. 1956 1957=item getc FILEHANDLE 1958X<getc> X<getchar> 1959 1960=item getc 1961 1962Returns the next character from the input file attached to FILEHANDLE, 1963or the undefined value at end of file, or if there was an error (in 1964the latter case C<$!> is set). If FILEHANDLE is omitted, reads from 1965STDIN. This is not particularly efficient. However, it cannot be 1966used by itself to fetch single characters without waiting for the user 1967to hit enter. For that, try something more like: 1968 1969 if ($BSD_STYLE) { 1970 system "stty cbreak </dev/tty >/dev/tty 2>&1"; 1971 } 1972 else { 1973 system "stty", '-icanon', 'eol', "\001"; 1974 } 1975 1976 $key = getc(STDIN); 1977 1978 if ($BSD_STYLE) { 1979 system "stty -cbreak </dev/tty >/dev/tty 2>&1"; 1980 } 1981 else { 1982 system "stty", 'icanon', 'eol', '^@'; # ASCII null 1983 } 1984 print "\n"; 1985 1986Determination of whether $BSD_STYLE should be set 1987is left as an exercise to the reader. 1988 1989The C<POSIX::getattr> function can do this more portably on 1990systems purporting POSIX compliance. See also the C<Term::ReadKey> 1991module from your nearest CPAN site; details on CPAN can be found on 1992L<perlmodlib/CPAN>. 1993 1994=item getlogin 1995X<getlogin> X<login> 1996 1997This implements the C library function of the same name, which on most 1998systems returns the current login from F</etc/utmp>, if any. If null, 1999use C<getpwuid>. 2000 2001 $login = getlogin || getpwuid($<) || "Kilroy"; 2002 2003Do not consider C<getlogin> for authentication: it is not as 2004secure as C<getpwuid>. 2005 2006=item getpeername SOCKET 2007X<getpeername> X<peer> 2008 2009Returns the packed sockaddr address of other end of the SOCKET connection. 2010 2011 use Socket; 2012 $hersockaddr = getpeername(SOCK); 2013 ($port, $iaddr) = sockaddr_in($hersockaddr); 2014 $herhostname = gethostbyaddr($iaddr, AF_INET); 2015 $herstraddr = inet_ntoa($iaddr); 2016 2017=item getpgrp PID 2018X<getpgrp> X<group> 2019 2020Returns the current process group for the specified PID. Use 2021a PID of C<0> to get the current process group for the 2022current process. Will raise an exception if used on a machine that 2023doesn't implement getpgrp(2). If PID is omitted, returns process 2024group of current process. Note that the POSIX version of C<getpgrp> 2025does not accept a PID argument, so only C<PID==0> is truly portable. 2026 2027=item getppid 2028X<getppid> X<parent> X<pid> 2029 2030Returns the process id of the parent process. 2031 2032Note for Linux users: on Linux, the C functions C<getpid()> and 2033C<getppid()> return different values from different threads. In order to 2034be portable, this behavior is not reflected by the perl-level function 2035C<getppid()>, that returns a consistent value across threads. If you want 2036to call the underlying C<getppid()>, you may use the CPAN module 2037C<Linux::Pid>. 2038 2039=item getpriority WHICH,WHO 2040X<getpriority> X<priority> X<nice> 2041 2042Returns the current priority for a process, a process group, or a user. 2043(See L<getpriority(2)>.) Will raise a fatal exception if used on a 2044machine that doesn't implement getpriority(2). 2045 2046=item getpwnam NAME 2047X<getpwnam> X<getgrnam> X<gethostbyname> X<getnetbyname> X<getprotobyname> 2048X<getpwuid> X<getgrgid> X<getservbyname> X<gethostbyaddr> X<getnetbyaddr> 2049X<getprotobynumber> X<getservbyport> X<getpwent> X<getgrent> X<gethostent> 2050X<getnetent> X<getprotoent> X<getservent> X<setpwent> X<setgrent> X<sethostent> 2051X<setnetent> X<setprotoent> X<setservent> X<endpwent> X<endgrent> X<endhostent> 2052X<endnetent> X<endprotoent> X<endservent> 2053 2054=item getgrnam NAME 2055 2056=item gethostbyname NAME 2057 2058=item getnetbyname NAME 2059 2060=item getprotobyname NAME 2061 2062=item getpwuid UID 2063 2064=item getgrgid GID 2065 2066=item getservbyname NAME,PROTO 2067 2068=item gethostbyaddr ADDR,ADDRTYPE 2069 2070=item getnetbyaddr ADDR,ADDRTYPE 2071 2072=item getprotobynumber NUMBER 2073 2074=item getservbyport PORT,PROTO 2075 2076=item getpwent 2077 2078=item getgrent 2079 2080=item gethostent 2081 2082=item getnetent 2083 2084=item getprotoent 2085 2086=item getservent 2087 2088=item setpwent 2089 2090=item setgrent 2091 2092=item sethostent STAYOPEN 2093 2094=item setnetent STAYOPEN 2095 2096=item setprotoent STAYOPEN 2097 2098=item setservent STAYOPEN 2099 2100=item endpwent 2101 2102=item endgrent 2103 2104=item endhostent 2105 2106=item endnetent 2107 2108=item endprotoent 2109 2110=item endservent 2111 2112These routines perform the same functions as their counterparts in the 2113system library. In list context, the return values from the 2114various get routines are as follows: 2115 2116 ($name,$passwd,$uid,$gid, 2117 $quota,$comment,$gcos,$dir,$shell,$expire) = getpw* 2118 ($name,$passwd,$gid,$members) = getgr* 2119 ($name,$aliases,$addrtype,$length,@addrs) = gethost* 2120 ($name,$aliases,$addrtype,$net) = getnet* 2121 ($name,$aliases,$proto) = getproto* 2122 ($name,$aliases,$port,$proto) = getserv* 2123 2124(If the entry doesn't exist you get a null list.) 2125 2126The exact meaning of the $gcos field varies but it usually contains 2127the real name of the user (as opposed to the login name) and other 2128information pertaining to the user. Beware, however, that in many 2129system users are able to change this information and therefore it 2130cannot be trusted and therefore the $gcos is tainted (see 2131L<perlsec>). The $passwd and $shell, user's encrypted password and 2132login shell, are also tainted, because of the same reason. 2133 2134In scalar context, you get the name, unless the function was a 2135lookup by name, in which case you get the other thing, whatever it is. 2136(If the entry doesn't exist you get the undefined value.) For example: 2137 2138 $uid = getpwnam($name); 2139 $name = getpwuid($num); 2140 $name = getpwent(); 2141 $gid = getgrnam($name); 2142 $name = getgrgid($num); 2143 $name = getgrent(); 2144 #etc. 2145 2146In I<getpw*()> the fields $quota, $comment, and $expire are special 2147cases in the sense that in many systems they are unsupported. If the 2148$quota is unsupported, it is an empty scalar. If it is supported, it 2149usually encodes the disk quota. If the $comment field is unsupported, 2150it is an empty scalar. If it is supported it usually encodes some 2151administrative comment about the user. In some systems the $quota 2152field may be $change or $age, fields that have to do with password 2153aging. In some systems the $comment field may be $class. The $expire 2154field, if present, encodes the expiration period of the account or the 2155password. For the availability and the exact meaning of these fields 2156in your system, please consult your getpwnam(3) documentation and your 2157F<pwd.h> file. You can also find out from within Perl what your 2158$quota and $comment fields mean and whether you have the $expire field 2159by using the C<Config> module and the values C<d_pwquota>, C<d_pwage>, 2160C<d_pwchange>, C<d_pwcomment>, and C<d_pwexpire>. Shadow password 2161files are only supported if your vendor has implemented them in the 2162intuitive fashion that calling the regular C library routines gets the 2163shadow versions if you're running under privilege or if there exists 2164the shadow(3) functions as found in System V (this includes Solaris 2165and Linux.) Those systems that implement a proprietary shadow password 2166facility are unlikely to be supported. 2167 2168The $members value returned by I<getgr*()> is a space separated list of 2169the login names of the members of the group. 2170 2171For the I<gethost*()> functions, if the C<h_errno> variable is supported in 2172C, it will be returned to you via C<$?> if the function call fails. The 2173C<@addrs> value returned by a successful call is a list of the raw 2174addresses returned by the corresponding system library call. In the 2175Internet domain, each address is four bytes long and you can unpack it 2176by saying something like: 2177 2178 ($a,$b,$c,$d) = unpack('C4',$addr[0]); 2179 2180The Socket library makes this slightly easier: 2181 2182 use Socket; 2183 $iaddr = inet_aton("127.1"); # or whatever address 2184 $name = gethostbyaddr($iaddr, AF_INET); 2185 2186 # or going the other way 2187 $straddr = inet_ntoa($iaddr); 2188 2189If you get tired of remembering which element of the return list 2190contains which return value, by-name interfaces are provided 2191in standard modules: C<File::stat>, C<Net::hostent>, C<Net::netent>, 2192C<Net::protoent>, C<Net::servent>, C<Time::gmtime>, C<Time::localtime>, 2193and C<User::grent>. These override the normal built-ins, supplying 2194versions that return objects with the appropriate names 2195for each field. For example: 2196 2197 use File::stat; 2198 use User::pwent; 2199 $is_his = (stat($filename)->uid == pwent($whoever)->uid); 2200 2201Even though it looks like they're the same method calls (uid), 2202they aren't, because a C<File::stat> object is different from 2203a C<User::pwent> object. 2204 2205=item getsockname SOCKET 2206X<getsockname> 2207 2208Returns the packed sockaddr address of this end of the SOCKET connection, 2209in case you don't know the address because you have several different 2210IPs that the connection might have come in on. 2211 2212 use Socket; 2213 $mysockaddr = getsockname(SOCK); 2214 ($port, $myaddr) = sockaddr_in($mysockaddr); 2215 printf "Connect to %s [%s]\n", 2216 scalar gethostbyaddr($myaddr, AF_INET), 2217 inet_ntoa($myaddr); 2218 2219=item getsockopt SOCKET,LEVEL,OPTNAME 2220X<getsockopt> 2221 2222Queries the option named OPTNAME associated with SOCKET at a given LEVEL. 2223Options may exist at multiple protocol levels depending on the socket 2224type, but at least the uppermost socket level SOL_SOCKET (defined in the 2225C<Socket> module) will exist. To query options at another level the 2226protocol number of the appropriate protocol controlling the option 2227should be supplied. For example, to indicate that an option is to be 2228interpreted by the TCP protocol, LEVEL should be set to the protocol 2229number of TCP, which you can get using getprotobyname. 2230 2231The call returns a packed string representing the requested socket option, 2232or C<undef> if there is an error (the error reason will be in $!). What 2233exactly is in the packed string depends in the LEVEL and OPTNAME, consult 2234your system documentation for details. A very common case however is that 2235the option is an integer, in which case the result will be a packed 2236integer which you can decode using unpack with the C<i> (or C<I>) format. 2237 2238An example testing if Nagle's algorithm is turned on on a socket: 2239 2240 use Socket qw(:all); 2241 2242 defined(my $tcp = getprotobyname("tcp")) 2243 or die "Could not determine the protocol number for tcp"; 2244 # my $tcp = IPPROTO_TCP; # Alternative 2245 my $packed = getsockopt($socket, $tcp, TCP_NODELAY) 2246 or die "Could not query TCP_NODELAY socket option: $!"; 2247 my $nodelay = unpack("I", $packed); 2248 print "Nagle's algorithm is turned ", $nodelay ? "off\n" : "on\n"; 2249 2250 2251=item glob EXPR 2252X<glob> X<wildcard> X<filename, expansion> X<expand> 2253 2254=item glob 2255 2256In list context, returns a (possibly empty) list of filename expansions on 2257the value of EXPR such as the standard Unix shell F</bin/csh> would do. In 2258scalar context, glob iterates through such filename expansions, returning 2259undef when the list is exhausted. This is the internal function 2260implementing the C<< <*.c> >> operator, but you can use it directly. If 2261EXPR is omitted, C<$_> is used. The C<< <*.c> >> operator is discussed in 2262more detail in L<perlop/"I/O Operators">. 2263 2264Beginning with v5.6.0, this operator is implemented using the standard 2265C<File::Glob> extension. See L<File::Glob> for details. 2266 2267=item gmtime EXPR 2268X<gmtime> X<UTC> X<Greenwich> 2269 2270=item gmtime 2271 2272Converts a time as returned by the time function to an 9-element list 2273with the time localized for the standard Greenwich time zone. 2274Typically used as follows: 2275 2276 # 0 1 2 3 4 5 6 7 8 2277 ($sec,$min,$hour,$mday,$mon,$year,$wday,$yday,$isdst) = 2278 gmtime(time); 2279 2280All list elements are numeric, and come straight out of the C `struct 2281tm'. $sec, $min, and $hour are the seconds, minutes, and hours of the 2282specified time. $mday is the day of the month, and $mon is the month 2283itself, in the range C<0..11> with 0 indicating January and 11 2284indicating December. $year is the number of years since 1900. That 2285is, $year is C<123> in year 2023. $wday is the day of the week, with 22860 indicating Sunday and 3 indicating Wednesday. $yday is the day of 2287the year, in the range C<0..364> (or C<0..365> in leap years). $isdst 2288is always C<0>. 2289 2290Note that the $year element is I<not> simply the last two digits of 2291the year. If you assume it is then you create non-Y2K-compliant 2292programs--and you wouldn't want to do that, would you? 2293 2294The proper way to get a complete 4-digit year is simply: 2295 2296 $year += 1900; 2297 2298And to get the last two digits of the year (e.g., '01' in 2001) do: 2299 2300 $year = sprintf("%02d", $year % 100); 2301 2302If EXPR is omitted, C<gmtime()> uses the current time (C<gmtime(time)>). 2303 2304In scalar context, C<gmtime()> returns the ctime(3) value: 2305 2306 $now_string = gmtime; # e.g., "Thu Oct 13 04:54:34 1994" 2307 2308If you need local time instead of GMT use the L</localtime> builtin. 2309See also the C<timegm> function provided by the C<Time::Local> module, 2310and the strftime(3) and mktime(3) functions available via the L<POSIX> module. 2311 2312This scalar value is B<not> locale dependent (see L<perllocale>), but is 2313instead a Perl builtin. To get somewhat similar but locale dependent date 2314strings, see the example in L</localtime>. 2315 2316See L<perlport/gmtime> for portability concerns. 2317 2318=item goto LABEL 2319X<goto> X<jump> X<jmp> 2320 2321=item goto EXPR 2322 2323=item goto &NAME 2324 2325The C<goto-LABEL> form finds the statement labeled with LABEL and resumes 2326execution there. It may not be used to go into any construct that 2327requires initialization, such as a subroutine or a C<foreach> loop. It 2328also can't be used to go into a construct that is optimized away, 2329or to get out of a block or subroutine given to C<sort>. 2330It can be used to go almost anywhere else within the dynamic scope, 2331including out of subroutines, but it's usually better to use some other 2332construct such as C<last> or C<die>. The author of Perl has never felt the 2333need to use this form of C<goto> (in Perl, that is--C is another matter). 2334(The difference being that C does not offer named loops combined with 2335loop control. Perl does, and this replaces most structured uses of C<goto> 2336in other languages.) 2337 2338The C<goto-EXPR> form expects a label name, whose scope will be resolved 2339dynamically. This allows for computed C<goto>s per FORTRAN, but isn't 2340necessarily recommended if you're optimizing for maintainability: 2341 2342 goto ("FOO", "BAR", "GLARCH")[$i]; 2343 2344The C<goto-&NAME> form is quite different from the other forms of 2345C<goto>. In fact, it isn't a goto in the normal sense at all, and 2346doesn't have the stigma associated with other gotos. Instead, it 2347exits the current subroutine (losing any changes set by local()) and 2348immediately calls in its place the named subroutine using the current 2349value of @_. This is used by C<AUTOLOAD> subroutines that wish to 2350load another subroutine and then pretend that the other subroutine had 2351been called in the first place (except that any modifications to C<@_> 2352in the current subroutine are propagated to the other subroutine.) 2353After the C<goto>, not even C<caller> will be able to tell that this 2354routine was called first. 2355 2356NAME needn't be the name of a subroutine; it can be a scalar variable 2357containing a code reference, or a block that evaluates to a code 2358reference. 2359 2360=item grep BLOCK LIST 2361X<grep> 2362 2363=item grep EXPR,LIST 2364 2365This is similar in spirit to, but not the same as, grep(1) and its 2366relatives. In particular, it is not limited to using regular expressions. 2367 2368Evaluates the BLOCK or EXPR for each element of LIST (locally setting 2369C<$_> to each element) and returns the list value consisting of those 2370elements for which the expression evaluated to true. In scalar 2371context, returns the number of times the expression was true. 2372 2373 @foo = grep(!/^#/, @bar); # weed out comments 2374 2375or equivalently, 2376 2377 @foo = grep {!/^#/} @bar; # weed out comments 2378 2379Note that C<$_> is an alias to the list value, so it can be used to 2380modify the elements of the LIST. While this is useful and supported, 2381it can cause bizarre results if the elements of LIST are not variables. 2382Similarly, grep returns aliases into the original list, much as a for 2383loop's index variable aliases the list elements. That is, modifying an 2384element of a list returned by grep (for example, in a C<foreach>, C<map> 2385or another C<grep>) actually modifies the element in the original list. 2386This is usually something to be avoided when writing clear code. 2387 2388See also L</map> for a list composed of the results of the BLOCK or EXPR. 2389 2390=item hex EXPR 2391X<hex> X<hexadecimal> 2392 2393=item hex 2394 2395Interprets EXPR as a hex string and returns the corresponding value. 2396(To convert strings that might start with either C<0>, C<0x>, or C<0b>, see 2397L</oct>.) If EXPR is omitted, uses C<$_>. 2398 2399 print hex '0xAf'; # prints '175' 2400 print hex 'aF'; # same 2401 2402Hex strings may only represent integers. Strings that would cause 2403integer overflow trigger a warning. Leading whitespace is not stripped, 2404unlike oct(). To present something as hex, look into L</printf>, 2405L</sprintf>, or L</unpack>. 2406 2407=item import LIST 2408X<import> 2409 2410There is no builtin C<import> function. It is just an ordinary 2411method (subroutine) defined (or inherited) by modules that wish to export 2412names to another module. The C<use> function calls the C<import> method 2413for the package used. See also L</use>, L<perlmod>, and L<Exporter>. 2414 2415=item index STR,SUBSTR,POSITION 2416X<index> X<indexOf> X<InStr> 2417 2418=item index STR,SUBSTR 2419 2420The index function searches for one string within another, but without 2421the wildcard-like behavior of a full regular-expression pattern match. 2422It returns the position of the first occurrence of SUBSTR in STR at 2423or after POSITION. If POSITION is omitted, starts searching from the 2424beginning of the string. POSITION before the beginning of the string 2425or after its end is treated as if it were the beginning or the end, 2426respectively. POSITION and the return value are based at C<0> (or whatever 2427you've set the C<$[> variable to--but don't do that). If the substring 2428is not found, C<index> returns one less than the base, ordinarily C<-1>. 2429 2430=item int EXPR 2431X<int> X<integer> X<truncate> X<trunc> 2432 2433=item int 2434 2435Returns the integer portion of EXPR. If EXPR is omitted, uses C<$_>. 2436You should not use this function for rounding: one because it truncates 2437towards C<0>, and two because machine representations of floating point 2438numbers can sometimes produce counterintuitive results. For example, 2439C<int(-6.725/0.025)> produces -268 rather than the correct -269; that's 2440because it's really more like -268.99999999999994315658 instead. Usually, 2441the C<sprintf>, C<printf>, or the C<POSIX::floor> and C<POSIX::ceil> 2442functions will serve you better than will int(). 2443 2444=item ioctl FILEHANDLE,FUNCTION,SCALAR 2445X<ioctl> 2446 2447Implements the ioctl(2) function. You'll probably first have to say 2448 2449 require "sys/ioctl.ph"; # probably in $Config{archlib}/sys/ioctl.ph 2450 2451to get the correct function definitions. If F<sys/ioctl.ph> doesn't 2452exist or doesn't have the correct definitions you'll have to roll your 2453own, based on your C header files such as F<< <sys/ioctl.h> >>. 2454(There is a Perl script called B<h2ph> that comes with the Perl kit that 2455may help you in this, but it's nontrivial.) SCALAR will be read and/or 2456written depending on the FUNCTION--a pointer to the string value of SCALAR 2457will be passed as the third argument of the actual C<ioctl> call. (If SCALAR 2458has no string value but does have a numeric value, that value will be 2459passed rather than a pointer to the string value. To guarantee this to be 2460true, add a C<0> to the scalar before using it.) The C<pack> and C<unpack> 2461functions may be needed to manipulate the values of structures used by 2462C<ioctl>. 2463 2464The return value of C<ioctl> (and C<fcntl>) is as follows: 2465 2466 if OS returns: then Perl returns: 2467 -1 undefined value 2468 0 string "0 but true" 2469 anything else that number 2470 2471Thus Perl returns true on success and false on failure, yet you can 2472still easily determine the actual value returned by the operating 2473system: 2474 2475 $retval = ioctl(...) || -1; 2476 printf "System returned %d\n", $retval; 2477 2478The special string C<"0 but true"> is exempt from B<-w> complaints 2479about improper numeric conversions. 2480 2481=item join EXPR,LIST 2482X<join> 2483 2484Joins the separate strings of LIST into a single string with fields 2485separated by the value of EXPR, and returns that new string. Example: 2486 2487 $rec = join(':', $login,$passwd,$uid,$gid,$gcos,$home,$shell); 2488 2489Beware that unlike C<split>, C<join> doesn't take a pattern as its 2490first argument. Compare L</split>. 2491 2492=item keys HASH 2493X<keys> X<key> 2494 2495Returns a list consisting of all the keys of the named hash. 2496(In scalar context, returns the number of keys.) 2497 2498The keys are returned in an apparently random order. The actual 2499random order is subject to change in future versions of perl, but it 2500is guaranteed to be the same order as either the C<values> or C<each> 2501function produces (given that the hash has not been modified). Since 2502Perl 5.8.1 the ordering is different even between different runs of 2503Perl for security reasons (see L<perlsec/"Algorithmic Complexity 2504Attacks">). 2505 2506As a side effect, calling keys() resets the HASH's internal iterator 2507(see L</each>). In particular, calling keys() in void context resets 2508the iterator with no other overhead. 2509 2510Here is yet another way to print your environment: 2511 2512 @keys = keys %ENV; 2513 @values = values %ENV; 2514 while (@keys) { 2515 print pop(@keys), '=', pop(@values), "\n"; 2516 } 2517 2518or how about sorted by key: 2519 2520 foreach $key (sort(keys %ENV)) { 2521 print $key, '=', $ENV{$key}, "\n"; 2522 } 2523 2524The returned values are copies of the original keys in the hash, so 2525modifying them will not affect the original hash. Compare L</values>. 2526 2527To sort a hash by value, you'll need to use a C<sort> function. 2528Here's a descending numeric sort of a hash by its values: 2529 2530 foreach $key (sort { $hash{$b} <=> $hash{$a} } keys %hash) { 2531 printf "%4d %s\n", $hash{$key}, $key; 2532 } 2533 2534As an lvalue C<keys> allows you to increase the number of hash buckets 2535allocated for the given hash. This can gain you a measure of efficiency if 2536you know the hash is going to get big. (This is similar to pre-extending 2537an array by assigning a larger number to $#array.) If you say 2538 2539 keys %hash = 200; 2540 2541then C<%hash> will have at least 200 buckets allocated for it--256 of them, 2542in fact, since it rounds up to the next power of two. These 2543buckets will be retained even if you do C<%hash = ()>, use C<undef 2544%hash> if you want to free the storage while C<%hash> is still in scope. 2545You can't shrink the number of buckets allocated for the hash using 2546C<keys> in this way (but you needn't worry about doing this by accident, 2547as trying has no effect). 2548 2549See also C<each>, C<values> and C<sort>. 2550 2551=item kill SIGNAL, LIST 2552X<kill> X<signal> 2553 2554Sends a signal to a list of processes. Returns the number of 2555processes successfully signaled (which is not necessarily the 2556same as the number actually killed). 2557 2558 $cnt = kill 1, $child1, $child2; 2559 kill 9, @goners; 2560 2561If SIGNAL is zero, no signal is sent to the process. This is a 2562useful way to check that a child process is alive and hasn't changed 2563its UID. See L<perlport> for notes on the portability of this 2564construct. 2565 2566Unlike in the shell, if SIGNAL is negative, it kills 2567process groups instead of processes. (On System V, a negative I<PROCESS> 2568number will also kill process groups, but that's not portable.) That 2569means you usually want to use positive not negative signals. You may also 2570use a signal name in quotes. 2571 2572See L<perlipc/"Signals"> for more details. 2573 2574=item last LABEL 2575X<last> X<break> 2576 2577=item last 2578 2579The C<last> command is like the C<break> statement in C (as used in 2580loops); it immediately exits the loop in question. If the LABEL is 2581omitted, the command refers to the innermost enclosing loop. The 2582C<continue> block, if any, is not executed: 2583 2584 LINE: while (<STDIN>) { 2585 last LINE if /^$/; # exit when done with header 2586 #... 2587 } 2588 2589C<last> cannot be used to exit a block which returns a value such as 2590C<eval {}>, C<sub {}> or C<do {}>, and should not be used to exit 2591a grep() or map() operation. 2592 2593Note that a block by itself is semantically identical to a loop 2594that executes once. Thus C<last> can be used to effect an early 2595exit out of such a block. 2596 2597See also L</continue> for an illustration of how C<last>, C<next>, and 2598C<redo> work. 2599 2600=item lc EXPR 2601X<lc> X<lowercase> 2602 2603=item lc 2604 2605Returns a lowercased version of EXPR. This is the internal function 2606implementing the C<\L> escape in double-quoted strings. Respects 2607current LC_CTYPE locale if C<use locale> in force. See L<perllocale> 2608and L<perlunicode> for more details about locale and Unicode support. 2609 2610If EXPR is omitted, uses C<$_>. 2611 2612=item lcfirst EXPR 2613X<lcfirst> X<lowercase> 2614 2615=item lcfirst 2616 2617Returns the value of EXPR with the first character lowercased. This 2618is the internal function implementing the C<\l> escape in 2619double-quoted strings. Respects current LC_CTYPE locale if C<use 2620locale> in force. See L<perllocale> and L<perlunicode> for more 2621details about locale and Unicode support. 2622 2623If EXPR is omitted, uses C<$_>. 2624 2625=item length EXPR 2626X<length> X<size> 2627 2628=item length 2629 2630Returns the length in I<characters> of the value of EXPR. If EXPR is 2631omitted, returns length of C<$_>. Note that this cannot be used on 2632an entire array or hash to find out how many elements these have. 2633For that, use C<scalar @array> and C<scalar keys %hash> respectively. 2634 2635Note the I<characters>: if the EXPR is in Unicode, you will get the 2636number of characters, not the number of bytes. To get the length 2637in bytes, use C<do { use bytes; length(EXPR) }>, see L<bytes>. 2638 2639=item link OLDFILE,NEWFILE 2640X<link> 2641 2642Creates a new filename linked to the old filename. Returns true for 2643success, false otherwise. 2644 2645=item listen SOCKET,QUEUESIZE 2646X<listen> 2647 2648Does the same thing that the listen system call does. Returns true if 2649it succeeded, false otherwise. See the example in 2650L<perlipc/"Sockets: Client/Server Communication">. 2651 2652=item local EXPR 2653X<local> 2654 2655You really probably want to be using C<my> instead, because C<local> isn't 2656what most people think of as "local". See 2657L<perlsub/"Private Variables via my()"> for details. 2658 2659A local modifies the listed variables to be local to the enclosing 2660block, file, or eval. If more than one value is listed, the list must 2661be placed in parentheses. See L<perlsub/"Temporary Values via local()"> 2662for details, including issues with tied arrays and hashes. 2663 2664=item localtime EXPR 2665X<localtime> 2666 2667=item localtime 2668 2669Converts a time as returned by the time function to a 9-element list 2670with the time analyzed for the local time zone. Typically used as 2671follows: 2672 2673 # 0 1 2 3 4 5 6 7 8 2674 ($sec,$min,$hour,$mday,$mon,$year,$wday,$yday,$isdst) = 2675 localtime(time); 2676 2677All list elements are numeric, and come straight out of the C `struct 2678tm'. C<$sec>, C<$min>, and C<$hour> are the seconds, minutes, and hours 2679of the specified time. 2680 2681C<$mday> is the day of the month, and C<$mon> is the month itself, in 2682the range C<0..11> with 0 indicating January and 11 indicating December. 2683This makes it easy to get a month name from a list: 2684 2685 my @abbr = qw( Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec ); 2686 print "$abbr[$mon] $mday"; 2687 # $mon=9, $mday=18 gives "Oct 18" 2688 2689C<$year> is the number of years since 1900, not just the last two digits 2690of the year. That is, C<$year> is C<123> in year 2023. The proper way 2691to get a complete 4-digit year is simply: 2692 2693 $year += 1900; 2694 2695To get the last two digits of the year (e.g., '01' in 2001) do: 2696 2697 $year = sprintf("%02d", $year % 100); 2698 2699C<$wday> is the day of the week, with 0 indicating Sunday and 3 indicating 2700Wednesday. C<$yday> is the day of the year, in the range C<0..364> 2701(or C<0..365> in leap years.) 2702 2703C<$isdst> is true if the specified time occurs during Daylight Saving 2704Time, false otherwise. 2705 2706If EXPR is omitted, C<localtime()> uses the current time (C<localtime(time)>). 2707 2708In scalar context, C<localtime()> returns the ctime(3) value: 2709 2710 $now_string = localtime; # e.g., "Thu Oct 13 04:54:34 1994" 2711 2712This scalar value is B<not> locale dependent but is a Perl builtin. For GMT 2713instead of local time use the L</gmtime> builtin. See also the 2714C<Time::Local> module (to convert the second, minutes, hours, ... back to 2715the integer value returned by time()), and the L<POSIX> module's strftime(3) 2716and mktime(3) functions. 2717 2718To get somewhat similar but locale dependent date strings, set up your 2719locale environment variables appropriately (please see L<perllocale>) and 2720try for example: 2721 2722 use POSIX qw(strftime); 2723 $now_string = strftime "%a %b %e %H:%M:%S %Y", localtime; 2724 # or for GMT formatted appropriately for your locale: 2725 $now_string = strftime "%a %b %e %H:%M:%S %Y", gmtime; 2726 2727Note that the C<%a> and C<%b>, the short forms of the day of the week 2728and the month of the year, may not necessarily be three characters wide. 2729 2730See L<perlport/localtime> for portability concerns. 2731 2732=item lock THING 2733X<lock> 2734 2735This function places an advisory lock on a shared variable, or referenced 2736object contained in I<THING> until the lock goes out of scope. 2737 2738lock() is a "weak keyword" : this means that if you've defined a function 2739by this name (before any calls to it), that function will be called 2740instead. (However, if you've said C<use threads>, lock() is always a 2741keyword.) See L<threads>. 2742 2743=item log EXPR 2744X<log> X<logarithm> X<e> X<ln> X<base> 2745 2746=item log 2747 2748Returns the natural logarithm (base I<e>) of EXPR. If EXPR is omitted, 2749returns log of C<$_>. To get the log of another base, use basic algebra: 2750The base-N log of a number is equal to the natural log of that number 2751divided by the natural log of N. For example: 2752 2753 sub log10 { 2754 my $n = shift; 2755 return log($n)/log(10); 2756 } 2757 2758See also L</exp> for the inverse operation. 2759 2760=item lstat EXPR 2761X<lstat> 2762 2763=item lstat 2764 2765Does the same thing as the C<stat> function (including setting the 2766special C<_> filehandle) but stats a symbolic link instead of the file 2767the symbolic link points to. If symbolic links are unimplemented on 2768your system, a normal C<stat> is done. For much more detailed 2769information, please see the documentation for L</stat>. 2770 2771If EXPR is omitted, stats C<$_>. 2772 2773=item m// 2774 2775The match operator. See L<perlop>. 2776 2777=item map BLOCK LIST 2778X<map> 2779 2780=item map EXPR,LIST 2781 2782Evaluates the BLOCK or EXPR for each element of LIST (locally setting 2783C<$_> to each element) and returns the list value composed of the 2784results of each such evaluation. In scalar context, returns the 2785total number of elements so generated. Evaluates BLOCK or EXPR in 2786list context, so each element of LIST may produce zero, one, or 2787more elements in the returned value. 2788 2789 @chars = map(chr, @nums); 2790 2791translates a list of numbers to the corresponding characters. And 2792 2793 %hash = map { getkey($_) => $_ } @array; 2794 2795is just a funny way to write 2796 2797 %hash = (); 2798 foreach $_ (@array) { 2799 $hash{getkey($_)} = $_; 2800 } 2801 2802Note that C<$_> is an alias to the list value, so it can be used to 2803modify the elements of the LIST. While this is useful and supported, 2804it can cause bizarre results if the elements of LIST are not variables. 2805Using a regular C<foreach> loop for this purpose would be clearer in 2806most cases. See also L</grep> for an array composed of those items of 2807the original list for which the BLOCK or EXPR evaluates to true. 2808 2809C<{> starts both hash references and blocks, so C<map { ...> could be either 2810the start of map BLOCK LIST or map EXPR, LIST. Because perl doesn't look 2811ahead for the closing C<}> it has to take a guess at which its dealing with 2812based what it finds just after the C<{>. Usually it gets it right, but if it 2813doesn't it won't realize something is wrong until it gets to the C<}> and 2814encounters the missing (or unexpected) comma. The syntax error will be 2815reported close to the C<}> but you'll need to change something near the C<{> 2816such as using a unary C<+> to give perl some help: 2817 2818 %hash = map { "\L$_", 1 } @array # perl guesses EXPR. wrong 2819 %hash = map { +"\L$_", 1 } @array # perl guesses BLOCK. right 2820 %hash = map { ("\L$_", 1) } @array # this also works 2821 %hash = map { lc($_), 1 } @array # as does this. 2822 %hash = map +( lc($_), 1 ), @array # this is EXPR and works! 2823 2824 %hash = map ( lc($_), 1 ), @array # evaluates to (1, @array) 2825 2826or to force an anon hash constructor use C<+{> 2827 2828 @hashes = map +{ lc($_), 1 }, @array # EXPR, so needs , at end 2829 2830and you get list of anonymous hashes each with only 1 entry. 2831 2832=item mkdir FILENAME,MASK 2833X<mkdir> X<md> X<directory, create> 2834 2835=item mkdir FILENAME 2836 2837Creates the directory specified by FILENAME, with permissions 2838specified by MASK (as modified by C<umask>). If it succeeds it 2839returns true, otherwise it returns false and sets C<$!> (errno). 2840If omitted, MASK defaults to 0777. 2841 2842In general, it is better to create directories with permissive MASK, 2843and let the user modify that with their C<umask>, than it is to supply 2844a restrictive MASK and give the user no way to be more permissive. 2845The exceptions to this rule are when the file or directory should be 2846kept private (mail files, for instance). The perlfunc(1) entry on 2847C<umask> discusses the choice of MASK in more detail. 2848 2849Note that according to the POSIX 1003.1-1996 the FILENAME may have any 2850number of trailing slashes. Some operating and filesystems do not get 2851this right, so Perl automatically removes all trailing slashes to keep 2852everyone happy. 2853 2854=item msgctl ID,CMD,ARG 2855X<msgctl> 2856 2857Calls the System V IPC function msgctl(2). You'll probably have to say 2858 2859 use IPC::SysV; 2860 2861first to get the correct constant definitions. If CMD is C<IPC_STAT>, 2862then ARG must be a variable that will hold the returned C<msqid_ds> 2863structure. Returns like C<ioctl>: the undefined value for error, 2864C<"0 but true"> for zero, or the actual return value otherwise. See also 2865L<perlipc/"SysV IPC">, C<IPC::SysV>, and C<IPC::Semaphore> documentation. 2866 2867=item msgget KEY,FLAGS 2868X<msgget> 2869 2870Calls the System V IPC function msgget(2). Returns the message queue 2871id, or the undefined value if there is an error. See also 2872L<perlipc/"SysV IPC"> and C<IPC::SysV> and C<IPC::Msg> documentation. 2873 2874=item msgrcv ID,VAR,SIZE,TYPE,FLAGS 2875X<msgrcv> 2876 2877Calls the System V IPC function msgrcv to receive a message from 2878message queue ID into variable VAR with a maximum message size of 2879SIZE. Note that when a message is received, the message type as a 2880native long integer will be the first thing in VAR, followed by the 2881actual message. This packing may be opened with C<unpack("l! a*")>. 2882Taints the variable. Returns true if successful, or false if there is 2883an error. See also L<perlipc/"SysV IPC">, C<IPC::SysV>, and 2884C<IPC::SysV::Msg> documentation. 2885 2886=item msgsnd ID,MSG,FLAGS 2887X<msgsnd> 2888 2889Calls the System V IPC function msgsnd to send the message MSG to the 2890message queue ID. MSG must begin with the native long integer message 2891type, and be followed by the length of the actual message, and finally 2892the message itself. This kind of packing can be achieved with 2893C<pack("l! a*", $type, $message)>. Returns true if successful, 2894or false if there is an error. See also C<IPC::SysV> 2895and C<IPC::SysV::Msg> documentation. 2896 2897=item my EXPR 2898X<my> 2899 2900=item my TYPE EXPR 2901 2902=item my EXPR : ATTRS 2903 2904=item my TYPE EXPR : ATTRS 2905 2906A C<my> declares the listed variables to be local (lexically) to the 2907enclosing block, file, or C<eval>. If more than one value is listed, 2908the list must be placed in parentheses. 2909 2910The exact semantics and interface of TYPE and ATTRS are still 2911evolving. TYPE is currently bound to the use of C<fields> pragma, 2912and attributes are handled using the C<attributes> pragma, or starting 2913from Perl 5.8.0 also via the C<Attribute::Handlers> module. See 2914L<perlsub/"Private Variables via my()"> for details, and L<fields>, 2915L<attributes>, and L<Attribute::Handlers>. 2916 2917=item next LABEL 2918X<next> X<continue> 2919 2920=item next 2921 2922The C<next> command is like the C<continue> statement in C; it starts 2923the next iteration of the loop: 2924 2925 LINE: while (<STDIN>) { 2926 next LINE if /^#/; # discard comments 2927 #... 2928 } 2929 2930Note that if there were a C<continue> block on the above, it would get 2931executed even on discarded lines. If the LABEL is omitted, the command 2932refers to the innermost enclosing loop. 2933 2934C<next> cannot be used to exit a block which returns a value such as 2935C<eval {}>, C<sub {}> or C<do {}>, and should not be used to exit 2936a grep() or map() operation. 2937 2938Note that a block by itself is semantically identical to a loop 2939that executes once. Thus C<next> will exit such a block early. 2940 2941See also L</continue> for an illustration of how C<last>, C<next>, and 2942C<redo> work. 2943 2944=item no Module VERSION LIST 2945X<no> 2946 2947=item no Module VERSION 2948 2949=item no Module LIST 2950 2951=item no Module 2952 2953See the C<use> function, which C<no> is the opposite of. 2954 2955=item oct EXPR 2956X<oct> X<octal> X<hex> X<hexadecimal> X<binary> X<bin> 2957 2958=item oct 2959 2960Interprets EXPR as an octal string and returns the corresponding 2961value. (If EXPR happens to start off with C<0x>, interprets it as a 2962hex string. If EXPR starts off with C<0b>, it is interpreted as a 2963binary string. Leading whitespace is ignored in all three cases.) 2964The following will handle decimal, binary, octal, and hex in the standard 2965Perl or C notation: 2966 2967 $val = oct($val) if $val =~ /^0/; 2968 2969If EXPR is omitted, uses C<$_>. To go the other way (produce a number 2970in octal), use sprintf() or printf(): 2971 2972 $perms = (stat("filename"))[2] & 07777; 2973 $oct_perms = sprintf "%lo", $perms; 2974 2975The oct() function is commonly used when a string such as C<644> needs 2976to be converted into a file mode, for example. (Although perl will 2977automatically convert strings into numbers as needed, this automatic 2978conversion assumes base 10.) 2979 2980=item open FILEHANDLE,EXPR 2981X<open> X<pipe> X<file, open> X<fopen> 2982 2983=item open FILEHANDLE,MODE,EXPR 2984 2985=item open FILEHANDLE,MODE,EXPR,LIST 2986 2987=item open FILEHANDLE,MODE,REFERENCE 2988 2989=item open FILEHANDLE 2990 2991Opens the file whose filename is given by EXPR, and associates it with 2992FILEHANDLE. 2993 2994(The following is a comprehensive reference to open(): for a gentler 2995introduction you may consider L<perlopentut>.) 2996 2997If FILEHANDLE is an undefined scalar variable (or array or hash element) 2998the variable is assigned a reference to a new anonymous filehandle, 2999otherwise if FILEHANDLE is an expression, its value is used as the name of 3000the real filehandle wanted. (This is considered a symbolic reference, so 3001C<use strict 'refs'> should I<not> be in effect.) 3002 3003If EXPR is omitted, the scalar variable of the same name as the 3004FILEHANDLE contains the filename. (Note that lexical variables--those 3005declared with C<my>--will not work for this purpose; so if you're 3006using C<my>, specify EXPR in your call to open.) 3007 3008If three or more arguments are specified then the mode of opening and 3009the file name are separate. If MODE is C<< '<' >> or nothing, the file 3010is opened for input. If MODE is C<< '>' >>, the file is truncated and 3011opened for output, being created if necessary. If MODE is C<<< '>>' >>>, 3012the file is opened for appending, again being created if necessary. 3013 3014You can put a C<'+'> in front of the C<< '>' >> or C<< '<' >> to 3015indicate that you want both read and write access to the file; thus 3016C<< '+<' >> is almost always preferred for read/write updates--the C<< 3017'+>' >> mode would clobber the file first. You can't usually use 3018either read-write mode for updating textfiles, since they have 3019variable length records. See the B<-i> switch in L<perlrun> for a 3020better approach. The file is created with permissions of C<0666> 3021modified by the process' C<umask> value. 3022 3023These various prefixes correspond to the fopen(3) modes of C<'r'>, 3024C<'r+'>, C<'w'>, C<'w+'>, C<'a'>, and C<'a+'>. 3025 3026In the 2-arguments (and 1-argument) form of the call the mode and 3027filename should be concatenated (in this order), possibly separated by 3028spaces. It is possible to omit the mode in these forms if the mode is 3029C<< '<' >>. 3030 3031If the filename begins with C<'|'>, the filename is interpreted as a 3032command to which output is to be piped, and if the filename ends with a 3033C<'|'>, the filename is interpreted as a command which pipes output to 3034us. See L<perlipc/"Using open() for IPC"> 3035for more examples of this. (You are not allowed to C<open> to a command 3036that pipes both in I<and> out, but see L<IPC::Open2>, L<IPC::Open3>, 3037and L<perlipc/"Bidirectional Communication with Another Process"> 3038for alternatives.) 3039 3040For three or more arguments if MODE is C<'|-'>, the filename is 3041interpreted as a command to which output is to be piped, and if MODE 3042is C<'-|'>, the filename is interpreted as a command which pipes 3043output to us. In the 2-arguments (and 1-argument) form one should 3044replace dash (C<'-'>) with the command. 3045See L<perlipc/"Using open() for IPC"> for more examples of this. 3046(You are not allowed to C<open> to a command that pipes both in I<and> 3047out, but see L<IPC::Open2>, L<IPC::Open3>, and 3048L<perlipc/"Bidirectional Communication"> for alternatives.) 3049 3050In the three-or-more argument form of pipe opens, if LIST is specified 3051(extra arguments after the command name) then LIST becomes arguments 3052to the command invoked if the platform supports it. The meaning of 3053C<open> with more than three arguments for non-pipe modes is not yet 3054specified. Experimental "layers" may give extra LIST arguments 3055meaning. 3056 3057In the 2-arguments (and 1-argument) form opening C<'-'> opens STDIN 3058and opening C<< '>-' >> opens STDOUT. 3059 3060You may use the three-argument form of open to specify IO "layers" 3061(sometimes also referred to as "disciplines") to be applied to the handle 3062that affect how the input and output are processed (see L<open> and 3063L<PerlIO> for more details). For example 3064 3065 open(FH, "<:utf8", "file") 3066 3067will open the UTF-8 encoded file containing Unicode characters, 3068see L<perluniintro>. Note that if layers are specified in the 3069three-arg form then default layers stored in ${^OPEN} (see L<perlvar>; 3070usually set by the B<open> pragma or the switch B<-CioD>) are ignored. 3071 3072Open returns nonzero upon success, the undefined value otherwise. If 3073the C<open> involved a pipe, the return value happens to be the pid of 3074the subprocess. 3075 3076If you're running Perl on a system that distinguishes between text 3077files and binary files, then you should check out L</binmode> for tips 3078for dealing with this. The key distinction between systems that need 3079C<binmode> and those that don't is their text file formats. Systems 3080like Unix, Mac OS, and Plan 9, which delimit lines with a single 3081character, and which encode that character in C as C<"\n">, do not 3082need C<binmode>. The rest need it. 3083 3084When opening a file, it's usually a bad idea to continue normal execution 3085if the request failed, so C<open> is frequently used in connection with 3086C<die>. Even if C<die> won't do what you want (say, in a CGI script, 3087where you want to make a nicely formatted error message (but there are 3088modules that can help with that problem)) you should always check 3089the return value from opening a file. The infrequent exception is when 3090working with an unopened filehandle is actually what you want to do. 3091 3092As a special case the 3-arg form with a read/write mode and the third 3093argument being C<undef>: 3094 3095 open(TMP, "+>", undef) or die ... 3096 3097opens a filehandle to an anonymous temporary file. Also using "+<" 3098works for symmetry, but you really should consider writing something 3099to the temporary file first. You will need to seek() to do the 3100reading. 3101 3102Since v5.8.0, perl has built using PerlIO by default. Unless you've 3103changed this (i.e. Configure -Uuseperlio), you can open file handles to 3104"in memory" files held in Perl scalars via: 3105 3106 open($fh, '>', \$variable) || .. 3107 3108Though if you try to re-open C<STDOUT> or C<STDERR> as an "in memory" 3109file, you have to close it first: 3110 3111 close STDOUT; 3112 open STDOUT, '>', \$variable or die "Can't open STDOUT: $!"; 3113 3114Examples: 3115 3116 $ARTICLE = 100; 3117 open ARTICLE or die "Can't find article $ARTICLE: $!\n"; 3118 while (<ARTICLE>) {... 3119 3120 open(LOG, '>>/usr/spool/news/twitlog'); # (log is reserved) 3121 # if the open fails, output is discarded 3122 3123 open(DBASE, '+<', 'dbase.mine') # open for update 3124 or die "Can't open 'dbase.mine' for update: $!"; 3125 3126 open(DBASE, '+<dbase.mine') # ditto 3127 or die "Can't open 'dbase.mine' for update: $!"; 3128 3129 open(ARTICLE, '-|', "caesar <$article") # decrypt article 3130 or die "Can't start caesar: $!"; 3131 3132 open(ARTICLE, "caesar <$article |") # ditto 3133 or die "Can't start caesar: $!"; 3134 3135 open(EXTRACT, "|sort >Tmp$$") # $$ is our process id 3136 or die "Can't start sort: $!"; 3137 3138 # in memory files 3139 open(MEMORY,'>', \$var) 3140 or die "Can't open memory file: $!"; 3141 print MEMORY "foo!\n"; # output will end up in $var 3142 3143 # process argument list of files along with any includes 3144 3145 foreach $file (@ARGV) { 3146 process($file, 'fh00'); 3147 } 3148 3149 sub process { 3150 my($filename, $input) = @_; 3151 $input++; # this is a string increment 3152 unless (open($input, $filename)) { 3153 print STDERR "Can't open $filename: $!\n"; 3154 return; 3155 } 3156 3157 local $_; 3158 while (<$input>) { # note use of indirection 3159 if (/^#include "(.*)"/) { 3160 process($1, $input); 3161 next; 3162 } 3163 #... # whatever 3164 } 3165 } 3166 3167See L<perliol> for detailed info on PerlIO. 3168 3169You may also, in the Bourne shell tradition, specify an EXPR beginning 3170with C<< '>&' >>, in which case the rest of the string is interpreted 3171as the name of a filehandle (or file descriptor, if numeric) to be 3172duped (as L<dup(2)>) and opened. You may use C<&> after C<< > >>, 3173C<<< >> >>>, C<< < >>, C<< +> >>, C<<< +>> >>>, and C<< +< >>. 3174The mode you specify should match the mode of the original filehandle. 3175(Duping a filehandle does not take into account any existing contents 3176of IO buffers.) If you use the 3-arg form then you can pass either a 3177number, the name of a filehandle or the normal "reference to a glob". 3178 3179Here is a script that saves, redirects, and restores C<STDOUT> and 3180C<STDERR> using various methods: 3181 3182 #!/usr/bin/perl 3183 open my $oldout, ">&STDOUT" or die "Can't dup STDOUT: $!"; 3184 open OLDERR, ">&", \*STDERR or die "Can't dup STDERR: $!"; 3185 3186 open STDOUT, '>', "foo.out" or die "Can't redirect STDOUT: $!"; 3187 open STDERR, ">&STDOUT" or die "Can't dup STDOUT: $!"; 3188 3189 select STDERR; $| = 1; # make unbuffered 3190 select STDOUT; $| = 1; # make unbuffered 3191 3192 print STDOUT "stdout 1\n"; # this works for 3193 print STDERR "stderr 1\n"; # subprocesses too 3194 3195 open STDOUT, ">&", $oldout or die "Can't dup \$oldout: $!"; 3196 open STDERR, ">&OLDERR" or die "Can't dup OLDERR: $!"; 3197 3198 print STDOUT "stdout 2\n"; 3199 print STDERR "stderr 2\n"; 3200 3201If you specify C<< '<&=X' >>, where C<X> is a file descriptor number 3202or a filehandle, then Perl will do an equivalent of C's C<fdopen> of 3203that file descriptor (and not call L<dup(2)>); this is more 3204parsimonious of file descriptors. For example: 3205 3206 # open for input, reusing the fileno of $fd 3207 open(FILEHANDLE, "<&=$fd") 3208 3209or 3210 3211 open(FILEHANDLE, "<&=", $fd) 3212 3213or 3214 3215 # open for append, using the fileno of OLDFH 3216 open(FH, ">>&=", OLDFH) 3217 3218or 3219 3220 open(FH, ">>&=OLDFH") 3221 3222Being parsimonious on filehandles is also useful (besides being 3223parsimonious) for example when something is dependent on file 3224descriptors, like for example locking using flock(). If you do just 3225C<< open(A, '>>&B') >>, the filehandle A will not have the same file 3226descriptor as B, and therefore flock(A) will not flock(B), and vice 3227versa. But with C<< open(A, '>>&=B') >> the filehandles will share 3228the same file descriptor. 3229 3230Note that if you are using Perls older than 5.8.0, Perl will be using 3231the standard C libraries' fdopen() to implement the "=" functionality. 3232On many UNIX systems fdopen() fails when file descriptors exceed a 3233certain value, typically 255. For Perls 5.8.0 and later, PerlIO is 3234most often the default. 3235 3236You can see whether Perl has been compiled with PerlIO or not by 3237running C<perl -V> and looking for C<useperlio=> line. If C<useperlio> 3238is C<define>, you have PerlIO, otherwise you don't. 3239 3240If you open a pipe on the command C<'-'>, i.e., either C<'|-'> or C<'-|'> 3241with 2-arguments (or 1-argument) form of open(), then 3242there is an implicit fork done, and the return value of open is the pid 3243of the child within the parent process, and C<0> within the child 3244process. (Use C<defined($pid)> to determine whether the open was successful.) 3245The filehandle behaves normally for the parent, but i/o to that 3246filehandle is piped from/to the STDOUT/STDIN of the child process. 3247In the child process the filehandle isn't opened--i/o happens from/to 3248the new STDOUT or STDIN. Typically this is used like the normal 3249piped open when you want to exercise more control over just how the 3250pipe command gets executed, such as when you are running setuid, and 3251don't want to have to scan shell commands for metacharacters. 3252The following triples are more or less equivalent: 3253 3254 open(FOO, "|tr '[a-z]' '[A-Z]'"); 3255 open(FOO, '|-', "tr '[a-z]' '[A-Z]'"); 3256 open(FOO, '|-') || exec 'tr', '[a-z]', '[A-Z]'; 3257 open(FOO, '|-', "tr", '[a-z]', '[A-Z]'); 3258 3259 open(FOO, "cat -n '$file'|"); 3260 open(FOO, '-|', "cat -n '$file'"); 3261 open(FOO, '-|') || exec 'cat', '-n', $file; 3262 open(FOO, '-|', "cat", '-n', $file); 3263 3264The last example in each block shows the pipe as "list form", which is 3265not yet supported on all platforms. A good rule of thumb is that if 3266your platform has true C<fork()> (in other words, if your platform is 3267UNIX) you can use the list form. 3268 3269See L<perlipc/"Safe Pipe Opens"> for more examples of this. 3270 3271Beginning with v5.6.0, Perl will attempt to flush all files opened for 3272output before any operation that may do a fork, but this may not be 3273supported on some platforms (see L<perlport>). To be safe, you may need 3274to set C<$|> ($AUTOFLUSH in English) or call the C<autoflush()> method 3275of C<IO::Handle> on any open handles. 3276 3277On systems that support a close-on-exec flag on files, the flag will 3278be set for the newly opened file descriptor as determined by the value 3279of $^F. See L<perlvar/$^F>. 3280 3281Closing any piped filehandle causes the parent process to wait for the 3282child to finish, and returns the status value in C<$?>. 3283 3284The filename passed to 2-argument (or 1-argument) form of open() will 3285have leading and trailing whitespace deleted, and the normal 3286redirection characters honored. This property, known as "magic open", 3287can often be used to good effect. A user could specify a filename of 3288F<"rsh cat file |">, or you could change certain filenames as needed: 3289 3290 $filename =~ s/(.*\.gz)\s*$/gzip -dc < $1|/; 3291 open(FH, $filename) or die "Can't open $filename: $!"; 3292 3293Use 3-argument form to open a file with arbitrary weird characters in it, 3294 3295 open(FOO, '<', $file); 3296 3297otherwise it's necessary to protect any leading and trailing whitespace: 3298 3299 $file =~ s#^(\s)#./$1#; 3300 open(FOO, "< $file\0"); 3301 3302(this may not work on some bizarre filesystems). One should 3303conscientiously choose between the I<magic> and 3-arguments form 3304of open(): 3305 3306 open IN, $ARGV[0]; 3307 3308will allow the user to specify an argument of the form C<"rsh cat file |">, 3309but will not work on a filename which happens to have a trailing space, while 3310 3311 open IN, '<', $ARGV[0]; 3312 3313will have exactly the opposite restrictions. 3314 3315If you want a "real" C C<open> (see L<open(2)> on your system), then you 3316should use the C<sysopen> function, which involves no such magic (but 3317may use subtly different filemodes than Perl open(), which is mapped 3318to C fopen()). This is 3319another way to protect your filenames from interpretation. For example: 3320 3321 use IO::Handle; 3322 sysopen(HANDLE, $path, O_RDWR|O_CREAT|O_EXCL) 3323 or die "sysopen $path: $!"; 3324 $oldfh = select(HANDLE); $| = 1; select($oldfh); 3325 print HANDLE "stuff $$\n"; 3326 seek(HANDLE, 0, 0); 3327 print "File contains: ", <HANDLE>; 3328 3329Using the constructor from the C<IO::Handle> package (or one of its 3330subclasses, such as C<IO::File> or C<IO::Socket>), you can generate anonymous 3331filehandles that have the scope of whatever variables hold references to 3332them, and automatically close whenever and however you leave that scope: 3333 3334 use IO::File; 3335 #... 3336 sub read_myfile_munged { 3337 my $ALL = shift; 3338 my $handle = new IO::File; 3339 open($handle, "myfile") or die "myfile: $!"; 3340 $first = <$handle> 3341 or return (); # Automatically closed here. 3342 mung $first or die "mung failed"; # Or here. 3343 return $first, <$handle> if $ALL; # Or here. 3344 $first; # Or here. 3345 } 3346 3347See L</seek> for some details about mixing reading and writing. 3348 3349=item opendir DIRHANDLE,EXPR 3350X<opendir> 3351 3352Opens a directory named EXPR for processing by C<readdir>, C<telldir>, 3353C<seekdir>, C<rewinddir>, and C<closedir>. Returns true if successful. 3354DIRHANDLE may be an expression whose value can be used as an indirect 3355dirhandle, usually the real dirhandle name. If DIRHANDLE is an undefined 3356scalar variable (or array or hash element), the variable is assigned a 3357reference to a new anonymous dirhandle. 3358DIRHANDLEs have their own namespace separate from FILEHANDLEs. 3359 3360=item ord EXPR 3361X<ord> X<encoding> 3362 3363=item ord 3364 3365Returns the numeric (the native 8-bit encoding, like ASCII or EBCDIC, 3366or Unicode) value of the first character of EXPR. If EXPR is omitted, 3367uses C<$_>. 3368 3369For the reverse, see L</chr>. 3370See L<perlunicode> and L<encoding> for more about Unicode. 3371 3372=item our EXPR 3373X<our> X<global> 3374 3375=item our EXPR TYPE 3376 3377=item our EXPR : ATTRS 3378 3379=item our TYPE EXPR : ATTRS 3380 3381C<our> associates a simple name with a package variable in the current 3382package for use within the current scope. When C<use strict 'vars'> is in 3383effect, C<our> lets you use declared global variables without qualifying 3384them with package names, within the lexical scope of the C<our> declaration. 3385In this way C<our> differs from C<use vars>, which is package scoped. 3386 3387Unlike C<my>, which both allocates storage for a variable and associates 3388a simple name with that storage for use within the current scope, C<our> 3389associates a simple name with a package variable in the current package, 3390for use within the current scope. In other words, C<our> has the same 3391scoping rules as C<my>, but does not necessarily create a 3392variable. 3393 3394If more than one value is listed, the list must be placed 3395in parentheses. 3396 3397 our $foo; 3398 our($bar, $baz); 3399 3400An C<our> declaration declares a global variable that will be visible 3401across its entire lexical scope, even across package boundaries. The 3402package in which the variable is entered is determined at the point 3403of the declaration, not at the point of use. This means the following 3404behavior holds: 3405 3406 package Foo; 3407 our $bar; # declares $Foo::bar for rest of lexical scope 3408 $bar = 20; 3409 3410 package Bar; 3411 print $bar; # prints 20, as it refers to $Foo::bar 3412 3413Multiple C<our> declarations with the same name in the same lexical 3414scope are allowed if they are in different packages. If they happen 3415to be in the same package, Perl will emit warnings if you have asked 3416for them, just like multiple C<my> declarations. Unlike a second 3417C<my> declaration, which will bind the name to a fresh variable, a 3418second C<our> declaration in the same package, in the same scope, is 3419merely redundant. 3420 3421 use warnings; 3422 package Foo; 3423 our $bar; # declares $Foo::bar for rest of lexical scope 3424 $bar = 20; 3425 3426 package Bar; 3427 our $bar = 30; # declares $Bar::bar for rest of lexical scope 3428 print $bar; # prints 30 3429 3430 our $bar; # emits warning but has no other effect 3431 print $bar; # still prints 30 3432 3433An C<our> declaration may also have a list of attributes associated 3434with it. 3435 3436The exact semantics and interface of TYPE and ATTRS are still 3437evolving. TYPE is currently bound to the use of C<fields> pragma, 3438and attributes are handled using the C<attributes> pragma, or starting 3439from Perl 5.8.0 also via the C<Attribute::Handlers> module. See 3440L<perlsub/"Private Variables via my()"> for details, and L<fields>, 3441L<attributes>, and L<Attribute::Handlers>. 3442 3443The only currently recognized C<our()> attribute is C<unique> which 3444indicates that a single copy of the global is to be used by all 3445interpreters should the program happen to be running in a 3446multi-interpreter environment. (The default behaviour would be for 3447each interpreter to have its own copy of the global.) Examples: 3448 3449 our @EXPORT : unique = qw(foo); 3450 our %EXPORT_TAGS : unique = (bar => [qw(aa bb cc)]); 3451 our $VERSION : unique = "1.00"; 3452 3453Note that this attribute also has the effect of making the global 3454readonly when the first new interpreter is cloned (for example, 3455when the first new thread is created). 3456 3457Multi-interpreter environments can come to being either through the 3458fork() emulation on Windows platforms, or by embedding perl in a 3459multi-threaded application. The C<unique> attribute does nothing in 3460all other environments. 3461 3462Warning: the current implementation of this attribute operates on the 3463typeglob associated with the variable; this means that C<our $x : unique> 3464also has the effect of C<our @x : unique; our %x : unique>. This may be 3465subject to change. 3466 3467=item pack TEMPLATE,LIST 3468X<pack> 3469 3470Takes a LIST of values and converts it into a string using the rules 3471given by the TEMPLATE. The resulting string is the concatenation of 3472the converted values. Typically, each converted value looks 3473like its machine-level representation. For example, on 32-bit machines 3474a converted integer may be represented by a sequence of 4 bytes. 3475 3476The TEMPLATE is a sequence of characters that give the order and type 3477of values, as follows: 3478 3479 a A string with arbitrary binary data, will be null padded. 3480 A A text (ASCII) string, will be space padded. 3481 Z A null terminated (ASCIZ) string, will be null padded. 3482 3483 b A bit string (ascending bit order inside each byte, like vec()). 3484 B A bit string (descending bit order inside each byte). 3485 h A hex string (low nybble first). 3486 H A hex string (high nybble first). 3487 3488 c A signed char value. 3489 C An unsigned char value. Only does bytes. See U for Unicode. 3490 3491 s A signed short value. 3492 S An unsigned short value. 3493 (This 'short' is _exactly_ 16 bits, which may differ from 3494 what a local C compiler calls 'short'. If you want 3495 native-length shorts, use the '!' suffix.) 3496 3497 i A signed integer value. 3498 I An unsigned integer value. 3499 (This 'integer' is _at_least_ 32 bits wide. Its exact 3500 size depends on what a local C compiler calls 'int', 3501 and may even be larger than the 'long' described in 3502 the next item.) 3503 3504 l A signed long value. 3505 L An unsigned long value. 3506 (This 'long' is _exactly_ 32 bits, which may differ from 3507 what a local C compiler calls 'long'. If you want 3508 native-length longs, use the '!' suffix.) 3509 3510 n An unsigned short in "network" (big-endian) order. 3511 N An unsigned long in "network" (big-endian) order. 3512 v An unsigned short in "VAX" (little-endian) order. 3513 V An unsigned long in "VAX" (little-endian) order. 3514 (These 'shorts' and 'longs' are _exactly_ 16 bits and 3515 _exactly_ 32 bits, respectively.) 3516 3517 q A signed quad (64-bit) value. 3518 Q An unsigned quad value. 3519 (Quads are available only if your system supports 64-bit 3520 integer values _and_ if Perl has been compiled to support those. 3521 Causes a fatal error otherwise.) 3522 3523 j A signed integer value (a Perl internal integer, IV). 3524 J An unsigned integer value (a Perl internal unsigned integer, UV). 3525 3526 f A single-precision float in the native format. 3527 d A double-precision float in the native format. 3528 3529 F A floating point value in the native native format 3530 (a Perl internal floating point value, NV). 3531 D A long double-precision float in the native format. 3532 (Long doubles are available only if your system supports long 3533 double values _and_ if Perl has been compiled to support those. 3534 Causes a fatal error otherwise.) 3535 3536 p A pointer to a null-terminated string. 3537 P A pointer to a structure (fixed-length string). 3538 3539 u A uuencoded string. 3540 U A Unicode character number. Encodes to UTF-8 internally 3541 (or UTF-EBCDIC in EBCDIC platforms). 3542 3543 w A BER compressed integer (not an ASN.1 BER, see perlpacktut for 3544 details). Its bytes represent an unsigned integer in base 128, 3545 most significant digit first, with as few digits as possible. Bit 3546 eight (the high bit) is set on each byte except the last. 3547 3548 x A null byte. 3549 X Back up a byte. 3550 @ Null fill to absolute position, counted from the start of 3551 the innermost ()-group. 3552 ( Start of a ()-group. 3553 3554The following rules apply: 3555 3556=over 8 3557 3558=item * 3559 3560Each letter may optionally be followed by a number giving a repeat 3561count. With all types except C<a>, C<A>, C<Z>, C<b>, C<B>, C<h>, 3562C<H>, C<@>, C<x>, C<X> and C<P> the pack function will gobble up that 3563many values from the LIST. A C<*> for the repeat count means to use 3564however many items are left, except for C<@>, C<x>, C<X>, where it is 3565equivalent to C<0>, and C<u>, where it is equivalent to 1 (or 45, what 3566is the same). A numeric repeat count may optionally be enclosed in 3567brackets, as in C<pack 'C[80]', @arr>. 3568 3569One can replace the numeric repeat count by a template enclosed in brackets; 3570then the packed length of this template in bytes is used as a count. 3571For example, C<x[L]> skips a long (it skips the number of bytes in a long); 3572the template C<$t X[$t] $t> unpack()s twice what $t unpacks. 3573If the template in brackets contains alignment commands (such as C<x![d]>), 3574its packed length is calculated as if the start of the template has the maximal 3575possible alignment. 3576 3577When used with C<Z>, C<*> results in the addition of a trailing null 3578byte (so the packed result will be one longer than the byte C<length> 3579of the item). 3580 3581The repeat count for C<u> is interpreted as the maximal number of bytes 3582to encode per line of output, with 0 and 1 replaced by 45. 3583 3584=item * 3585 3586The C<a>, C<A>, and C<Z> types gobble just one value, but pack it as a 3587string of length count, padding with nulls or spaces as necessary. When 3588unpacking, C<A> strips trailing spaces and nulls, C<Z> strips everything 3589after the first null, and C<a> returns data verbatim. When packing, 3590C<a>, and C<Z> are equivalent. 3591 3592If the value-to-pack is too long, it is truncated. If too long and an 3593explicit count is provided, C<Z> packs only C<$count-1> bytes, followed 3594by a null byte. Thus C<Z> always packs a trailing null byte under 3595all circumstances. 3596 3597=item * 3598 3599Likewise, the C<b> and C<B> fields pack a string that many bits long. 3600Each byte of the input field of pack() generates 1 bit of the result. 3601Each result bit is based on the least-significant bit of the corresponding 3602input byte, i.e., on C<ord($byte)%2>. In particular, bytes C<"0"> and 3603C<"1"> generate bits 0 and 1, as do bytes C<"\0"> and C<"\1">. 3604 3605Starting from the beginning of the input string of pack(), each 8-tuple 3606of bytes is converted to 1 byte of output. With format C<b> 3607the first byte of the 8-tuple determines the least-significant bit of a 3608byte, and with format C<B> it determines the most-significant bit of 3609a byte. 3610 3611If the length of the input string is not exactly divisible by 8, the 3612remainder is packed as if the input string were padded by null bytes 3613at the end. Similarly, during unpack()ing the "extra" bits are ignored. 3614 3615If the input string of pack() is longer than needed, extra bytes are ignored. 3616A C<*> for the repeat count of pack() means to use all the bytes of 3617the input field. On unpack()ing the bits are converted to a string 3618of C<"0">s and C<"1">s. 3619 3620=item * 3621 3622The C<h> and C<H> fields pack a string that many nybbles (4-bit groups, 3623representable as hexadecimal digits, 0-9a-f) long. 3624 3625Each byte of the input field of pack() generates 4 bits of the result. 3626For non-alphabetical bytes the result is based on the 4 least-significant 3627bits of the input byte, i.e., on C<ord($byte)%16>. In particular, 3628bytes C<"0"> and C<"1"> generate nybbles 0 and 1, as do bytes 3629C<"\0"> and C<"\1">. For bytes C<"a".."f"> and C<"A".."F"> the result 3630is compatible with the usual hexadecimal digits, so that C<"a"> and 3631C<"A"> both generate the nybble C<0xa==10>. The result for bytes 3632C<"g".."z"> and C<"G".."Z"> is not well-defined. 3633 3634Starting from the beginning of the input string of pack(), each pair 3635of bytes is converted to 1 byte of output. With format C<h> the 3636first byte of the pair determines the least-significant nybble of the 3637output byte, and with format C<H> it determines the most-significant 3638nybble. 3639 3640If the length of the input string is not even, it behaves as if padded 3641by a null byte at the end. Similarly, during unpack()ing the "extra" 3642nybbles are ignored. 3643 3644If the input string of pack() is longer than needed, extra bytes are ignored. 3645A C<*> for the repeat count of pack() means to use all the bytes of 3646the input field. On unpack()ing the bits are converted to a string 3647of hexadecimal digits. 3648 3649=item * 3650 3651The C<p> type packs a pointer to a null-terminated string. You are 3652responsible for ensuring the string is not a temporary value (which can 3653potentially get deallocated before you get around to using the packed result). 3654The C<P> type packs a pointer to a structure of the size indicated by the 3655length. A NULL pointer is created if the corresponding value for C<p> or 3656C<P> is C<undef>, similarly for unpack(). 3657 3658=item * 3659 3660The C</> template character allows packing and unpacking of strings where 3661the packed structure contains a byte count followed by the string itself. 3662You write I<length-item>C</>I<string-item>. 3663 3664The I<length-item> can be any C<pack> template letter, and describes 3665how the length value is packed. The ones likely to be of most use are 3666integer-packing ones like C<n> (for Java strings), C<w> (for ASN.1 or 3667SNMP) and C<N> (for Sun XDR). 3668 3669For C<pack>, the I<string-item> must, at present, be C<"A*">, C<"a*"> or 3670C<"Z*">. For C<unpack> the length of the string is obtained from the 3671I<length-item>, but if you put in the '*' it will be ignored. For all other 3672codes, C<unpack> applies the length value to the next item, which must not 3673have a repeat count. 3674 3675 unpack 'C/a', "\04Gurusamy"; gives 'Guru' 3676 unpack 'a3/A* A*', '007 Bond J '; gives (' Bond','J') 3677 pack 'n/a* w/a*','hello,','world'; gives "\000\006hello,\005world" 3678 3679The I<length-item> is not returned explicitly from C<unpack>. 3680 3681Adding a count to the I<length-item> letter is unlikely to do anything 3682useful, unless that letter is C<A>, C<a> or C<Z>. Packing with a 3683I<length-item> of C<a> or C<Z> may introduce C<"\000"> characters, 3684which Perl does not regard as legal in numeric strings. 3685 3686=item * 3687 3688The integer types C<s>, C<S>, C<l>, and C<L> may be 3689immediately followed by a C<!> suffix to signify native shorts or 3690longs--as you can see from above for example a bare C<l> does mean 3691exactly 32 bits, the native C<long> (as seen by the local C compiler) 3692may be larger. This is an issue mainly in 64-bit platforms. You can 3693see whether using C<!> makes any difference by 3694 3695 print length(pack("s")), " ", length(pack("s!")), "\n"; 3696 print length(pack("l")), " ", length(pack("l!")), "\n"; 3697 3698C<i!> and C<I!> also work but only because of completeness; 3699they are identical to C<i> and C<I>. 3700 3701The actual sizes (in bytes) of native shorts, ints, longs, and long 3702longs on the platform where Perl was built are also available via 3703L<Config>: 3704 3705 use Config; 3706 print $Config{shortsize}, "\n"; 3707 print $Config{intsize}, "\n"; 3708 print $Config{longsize}, "\n"; 3709 print $Config{longlongsize}, "\n"; 3710 3711(The C<$Config{longlongsize}> will be undefined if your system does 3712not support long longs.) 3713 3714=item * 3715 3716The integer formats C<s>, C<S>, C<i>, C<I>, C<l>, C<L>, C<j>, and C<J> 3717are inherently non-portable between processors and operating systems 3718because they obey the native byteorder and endianness. For example a 37194-byte integer 0x12345678 (305419896 decimal) would be ordered natively 3720(arranged in and handled by the CPU registers) into bytes as 3721 3722 0x12 0x34 0x56 0x78 # big-endian 3723 0x78 0x56 0x34 0x12 # little-endian 3724 3725Basically, the Intel and VAX CPUs are little-endian, while everybody 3726else, for example Motorola m68k/88k, PPC, Sparc, HP PA, Power, and 3727Cray are big-endian. Alpha and MIPS can be either: Digital/Compaq 3728used/uses them in little-endian mode; SGI/Cray uses them in big-endian 3729mode. 3730 3731The names `big-endian' and `little-endian' are comic references to 3732the classic "Gulliver's Travels" (via the paper "On Holy Wars and a 3733Plea for Peace" by Danny Cohen, USC/ISI IEN 137, April 1, 1980) and 3734the egg-eating habits of the Lilliputians. 3735 3736Some systems may have even weirder byte orders such as 3737 3738 0x56 0x78 0x12 0x34 3739 0x34 0x12 0x78 0x56 3740 3741You can see your system's preference with 3742 3743 print join(" ", map { sprintf "%#02x", $_ } 3744 unpack("C*",pack("L",0x12345678))), "\n"; 3745 3746The byteorder on the platform where Perl was built is also available 3747via L<Config>: 3748 3749 use Config; 3750 print $Config{byteorder}, "\n"; 3751 3752Byteorders C<'1234'> and C<'12345678'> are little-endian, C<'4321'> 3753and C<'87654321'> are big-endian. 3754 3755If you want portable packed integers use the formats C<n>, C<N>, 3756C<v>, and C<V>, their byte endianness and size are known. 3757See also L<perlport>. 3758 3759=item * 3760 3761Real numbers (floats and doubles) are in the native machine format only; 3762due to the multiplicity of floating formats around, and the lack of a 3763standard "network" representation, no facility for interchange has been 3764made. This means that packed floating point data written on one machine 3765may not be readable on another - even if both use IEEE floating point 3766arithmetic (as the endian-ness of the memory representation is not part 3767of the IEEE spec). See also L<perlport>. 3768 3769Note that Perl uses doubles internally for all numeric calculation, and 3770converting from double into float and thence back to double again will 3771lose precision (i.e., C<unpack("f", pack("f", $foo)>) will not in general 3772equal $foo). 3773 3774=item * 3775 3776If the pattern begins with a C<U>, the resulting string will be 3777treated as UTF-8-encoded Unicode. You can force UTF-8 encoding on in a 3778string with an initial C<U0>, and the bytes that follow will be 3779interpreted as Unicode characters. If you don't want this to happen, 3780you can begin your pattern with C<C0> (or anything else) to force Perl 3781not to UTF-8 encode your string, and then follow this with a C<U*> 3782somewhere in your pattern. 3783 3784=item * 3785 3786You must yourself do any alignment or padding by inserting for example 3787enough C<'x'>es while packing. There is no way to pack() and unpack() 3788could know where the bytes are going to or coming from. Therefore 3789C<pack> (and C<unpack>) handle their output and input as flat 3790sequences of bytes. 3791 3792=item * 3793 3794A ()-group is a sub-TEMPLATE enclosed in parentheses. A group may 3795take a repeat count, both as postfix, and for unpack() also via the C</> 3796template character. Within each repetition of a group, positioning with 3797C<@> starts again at 0. Therefore, the result of 3798 3799 pack( '@1A((@2A)@3A)', 'a', 'b', 'c' ) 3800 3801is the string "\0a\0\0bc". 3802 3803 3804=item * 3805 3806C<x> and C<X> accept C<!> modifier. In this case they act as 3807alignment commands: they jump forward/back to the closest position 3808aligned at a multiple of C<count> bytes. For example, to pack() or 3809unpack() C's C<struct {char c; double d; char cc[2]}> one may need to 3810use the template C<C x![d] d C[2]>; this assumes that doubles must be 3811aligned on the double's size. 3812 3813For alignment commands C<count> of 0 is equivalent to C<count> of 1; 3814both result in no-ops. 3815 3816=item * 3817 3818A comment in a TEMPLATE starts with C<#> and goes to the end of line. 3819White space may be used to separate pack codes from each other, but 3820a C<!> modifier and a repeat count must follow immediately. 3821 3822=item * 3823 3824If TEMPLATE requires more arguments to pack() than actually given, pack() 3825assumes additional C<""> arguments. If TEMPLATE requires fewer arguments 3826to pack() than actually given, extra arguments are ignored. 3827 3828=back 3829 3830Examples: 3831 3832 $foo = pack("CCCC",65,66,67,68); 3833 # foo eq "ABCD" 3834 $foo = pack("C4",65,66,67,68); 3835 # same thing 3836 $foo = pack("U4",0x24b6,0x24b7,0x24b8,0x24b9); 3837 # same thing with Unicode circled letters 3838 3839 $foo = pack("ccxxcc",65,66,67,68); 3840 # foo eq "AB\0\0CD" 3841 3842 # note: the above examples featuring "C" and "c" are true 3843 # only on ASCII and ASCII-derived systems such as ISO Latin 1 3844 # and UTF-8. In EBCDIC the first example would be 3845 # $foo = pack("CCCC",193,194,195,196); 3846 3847 $foo = pack("s2",1,2); 3848 # "\1\0\2\0" on little-endian 3849 # "\0\1\0\2" on big-endian 3850 3851 $foo = pack("a4","abcd","x","y","z"); 3852 # "abcd" 3853 3854 $foo = pack("aaaa","abcd","x","y","z"); 3855 # "axyz" 3856 3857 $foo = pack("a14","abcdefg"); 3858 # "abcdefg\0\0\0\0\0\0\0" 3859 3860 $foo = pack("i9pl", gmtime); 3861 # a real struct tm (on my system anyway) 3862 3863 $utmp_template = "Z8 Z8 Z16 L"; 3864 $utmp = pack($utmp_template, @utmp1); 3865 # a struct utmp (BSDish) 3866 3867 @utmp2 = unpack($utmp_template, $utmp); 3868 # "@utmp1" eq "@utmp2" 3869 3870 sub bintodec { 3871 unpack("N", pack("B32", substr("0" x 32 . shift, -32))); 3872 } 3873 3874 $foo = pack('sx2l', 12, 34); 3875 # short 12, two zero bytes padding, long 34 3876 $bar = pack('s@4l', 12, 34); 3877 # short 12, zero fill to position 4, long 34 3878 # $foo eq $bar 3879 3880The same template may generally also be used in unpack(). 3881 3882=item package NAMESPACE 3883X<package> X<module> X<namespace> 3884 3885=item package 3886 3887Declares the compilation unit as being in the given namespace. The scope 3888of the package declaration is from the declaration itself through the end 3889of the enclosing block, file, or eval (the same as the C<my> operator). 3890All further unqualified dynamic identifiers will be in this namespace. 3891A package statement affects only dynamic variables--including those 3892you've used C<local> on--but I<not> lexical variables, which are created 3893with C<my>. Typically it would be the first declaration in a file to 3894be included by the C<require> or C<use> operator. You can switch into a 3895package in more than one place; it merely influences which symbol table 3896is used by the compiler for the rest of that block. You can refer to 3897variables and filehandles in other packages by prefixing the identifier 3898with the package name and a double colon: C<$Package::Variable>. 3899If the package name is null, the C<main> package as assumed. That is, 3900C<$::sail> is equivalent to C<$main::sail> (as well as to C<$main'sail>, 3901still seen in older code). 3902 3903If NAMESPACE is omitted, then there is no current package, and all 3904identifiers must be fully qualified or lexicals. However, you are 3905strongly advised not to make use of this feature. Its use can cause 3906unexpected behaviour, even crashing some versions of Perl. It is 3907deprecated, and will be removed from a future release. 3908 3909See L<perlmod/"Packages"> for more information about packages, modules, 3910and classes. See L<perlsub> for other scoping issues. 3911 3912=item pipe READHANDLE,WRITEHANDLE 3913X<pipe> 3914 3915Opens a pair of connected pipes like the corresponding system call. 3916Note that if you set up a loop of piped processes, deadlock can occur 3917unless you are very careful. In addition, note that Perl's pipes use 3918IO buffering, so you may need to set C<$|> to flush your WRITEHANDLE 3919after each command, depending on the application. 3920 3921See L<IPC::Open2>, L<IPC::Open3>, and L<perlipc/"Bidirectional Communication"> 3922for examples of such things. 3923 3924On systems that support a close-on-exec flag on files, the flag will be set 3925for the newly opened file descriptors as determined by the value of $^F. 3926See L<perlvar/$^F>. 3927 3928=item pop ARRAY 3929X<pop> X<stack> 3930 3931=item pop 3932 3933Pops and returns the last value of the array, shortening the array by 3934one element. Has an effect similar to 3935 3936 $ARRAY[$#ARRAY--] 3937 3938If there are no elements in the array, returns the undefined value 3939(although this may happen at other times as well). If ARRAY is 3940omitted, pops the C<@ARGV> array in the main program, and the C<@_> 3941array in subroutines, just like C<shift>. 3942 3943=item pos SCALAR 3944X<pos> X<match, position> 3945 3946=item pos 3947 3948Returns the offset of where the last C<m//g> search left off for the variable 3949in question (C<$_> is used when the variable is not specified). Note that 39500 is a valid match offset. C<undef> indicates that the search position 3951is reset (usually due to match failure, but can also be because no match has 3952yet been performed on the scalar). C<pos> directly accesses the location used 3953by the regexp engine to store the offset, so assigning to C<pos> will change 3954that offset, and so will also influence the C<\G> zero-width assertion in 3955regular expressions. Because a failed C<m//gc> match doesn't reset the offset, 3956the return from C<pos> won't change either in this case. See L<perlre> and 3957L<perlop>. 3958 3959=item print FILEHANDLE LIST 3960X<print> 3961 3962=item print LIST 3963 3964=item print 3965 3966Prints a string or a list of strings. Returns true if successful. 3967FILEHANDLE may be a scalar variable name, in which case the variable 3968contains the name of or a reference to the filehandle, thus introducing 3969one level of indirection. (NOTE: If FILEHANDLE is a variable and 3970the next token is a term, it may be misinterpreted as an operator 3971unless you interpose a C<+> or put parentheses around the arguments.) 3972If FILEHANDLE is omitted, prints by default to standard output (or 3973to the last selected output channel--see L</select>). If LIST is 3974also omitted, prints C<$_> to the currently selected output channel. 3975To set the default output channel to something other than STDOUT 3976use the select operation. The current value of C<$,> (if any) is 3977printed between each LIST item. The current value of C<$\> (if 3978any) is printed after the entire LIST has been printed. Because 3979print takes a LIST, anything in the LIST is evaluated in list 3980context, and any subroutine that you call will have one or more of 3981its expressions evaluated in list context. Also be careful not to 3982follow the print keyword with a left parenthesis unless you want 3983the corresponding right parenthesis to terminate the arguments to 3984the print--interpose a C<+> or put parentheses around all the 3985arguments. 3986 3987Note that if you're storing FILEHANDLEs in an array, or if you're using 3988any other expression more complex than a scalar variable to retrieve it, 3989you will have to use a block returning the filehandle value instead: 3990 3991 print { $files[$i] } "stuff\n"; 3992 print { $OK ? STDOUT : STDERR } "stuff\n"; 3993 3994=item printf FILEHANDLE FORMAT, LIST 3995X<printf> 3996 3997=item printf FORMAT, LIST 3998 3999Equivalent to C<print FILEHANDLE sprintf(FORMAT, LIST)>, except that C<$\> 4000(the output record separator) is not appended. The first argument 4001of the list will be interpreted as the C<printf> format. See C<sprintf> 4002for an explanation of the format argument. If C<use locale> is in effect, 4003the character used for the decimal point in formatted real numbers is 4004affected by the LC_NUMERIC locale. See L<perllocale>. 4005 4006Don't fall into the trap of using a C<printf> when a simple 4007C<print> would do. The C<print> is more efficient and less 4008error prone. 4009 4010=item prototype FUNCTION 4011X<prototype> 4012 4013Returns the prototype of a function as a string (or C<undef> if the 4014function has no prototype). FUNCTION is a reference to, or the name of, 4015the function whose prototype you want to retrieve. 4016 4017If FUNCTION is a string starting with C<CORE::>, the rest is taken as a 4018name for Perl builtin. If the builtin is not I<overridable> (such as 4019C<qw//>) or its arguments cannot be expressed by a prototype (such as 4020C<system>) returns C<undef> because the builtin does not really behave 4021like a Perl function. Otherwise, the string describing the equivalent 4022prototype is returned. 4023 4024=item push ARRAY,LIST 4025X<push>, X<stack> 4026 4027Treats ARRAY as a stack, and pushes the values of LIST 4028onto the end of ARRAY. The length of ARRAY increases by the length of 4029LIST. Has the same effect as 4030 4031 for $value (LIST) { 4032 $ARRAY[++$#ARRAY] = $value; 4033 } 4034 4035but is more efficient. Returns the number of elements in the array following 4036the completed C<push>. 4037 4038=item q/STRING/ 4039 4040=item qq/STRING/ 4041 4042=item qr/STRING/ 4043 4044=item qx/STRING/ 4045 4046=item qw/STRING/ 4047 4048Generalized quotes. See L<perlop/"Regexp Quote-Like Operators">. 4049 4050=item quotemeta EXPR 4051X<quotemeta> X<metacharacter> 4052 4053=item quotemeta 4054 4055Returns the value of EXPR with all non-"word" 4056characters backslashed. (That is, all characters not matching 4057C</[A-Za-z_0-9]/> will be preceded by a backslash in the 4058returned string, regardless of any locale settings.) 4059This is the internal function implementing 4060the C<\Q> escape in double-quoted strings. 4061 4062If EXPR is omitted, uses C<$_>. 4063 4064=item rand EXPR 4065X<rand> X<random> 4066 4067=item rand 4068 4069Returns a random fractional number greater than or equal to C<0> and less 4070than the value of EXPR. (EXPR should be positive.) If EXPR is 4071omitted, the value C<1> is used. Currently EXPR with the value C<0> is 4072also special-cased as C<1> - this has not been documented before perl 5.8.0 4073and is subject to change in future versions of perl. Automatically calls 4074C<srand> unless C<srand> has already been called. See also C<srand>. 4075 4076Apply C<int()> to the value returned by C<rand()> if you want random 4077integers instead of random fractional numbers. For example, 4078 4079 int(rand(10)) 4080 4081returns a random integer between C<0> and C<9>, inclusive. 4082 4083(Note: If your rand function consistently returns numbers that are too 4084large or too small, then your version of Perl was probably compiled 4085with the wrong number of RANDBITS.) 4086 4087=item read FILEHANDLE,SCALAR,LENGTH,OFFSET 4088X<read> 4089 4090=item read FILEHANDLE,SCALAR,LENGTH 4091 4092Attempts to read LENGTH I<characters> of data into variable SCALAR 4093from the specified FILEHANDLE. Returns the number of characters 4094actually read, C<0> at end of file, or undef if there was an error (in 4095the latter case C<$!> is also set). SCALAR will be grown or shrunk 4096so that the last character actually read is the last character of the 4097scalar after the read. 4098 4099An OFFSET may be specified to place the read data at some place in the 4100string other than the beginning. A negative OFFSET specifies 4101placement at that many characters counting backwards from the end of 4102the string. A positive OFFSET greater than the length of SCALAR 4103results in the string being padded to the required size with C<"\0"> 4104bytes before the result of the read is appended. 4105 4106The call is actually implemented in terms of either Perl's or system's 4107fread() call. To get a true read(2) system call, see C<sysread>. 4108 4109Note the I<characters>: depending on the status of the filehandle, 4110either (8-bit) bytes or characters are read. By default all 4111filehandles operate on bytes, but for example if the filehandle has 4112been opened with the C<:utf8> I/O layer (see L</open>, and the C<open> 4113pragma, L<open>), the I/O will operate on UTF-8 encoded Unicode 4114characters, not bytes. Similarly for the C<:encoding> pragma: 4115in that case pretty much any characters can be read. 4116 4117=item readdir DIRHANDLE 4118X<readdir> 4119 4120Returns the next directory entry for a directory opened by C<opendir>. 4121If used in list context, returns all the rest of the entries in the 4122directory. If there are no more entries, returns an undefined value in 4123scalar context or a null list in list context. 4124 4125If you're planning to filetest the return values out of a C<readdir>, you'd 4126better prepend the directory in question. Otherwise, because we didn't 4127C<chdir> there, it would have been testing the wrong file. 4128 4129 opendir(DIR, $some_dir) || die "can't opendir $some_dir: $!"; 4130 @dots = grep { /^\./ && -f "$some_dir/$_" } readdir(DIR); 4131 closedir DIR; 4132 4133=item readline EXPR 4134X<readline> X<gets> X<fgets> 4135 4136Reads from the filehandle whose typeglob is contained in EXPR. In scalar 4137context, each call reads and returns the next line, until end-of-file is 4138reached, whereupon the subsequent call returns undef. In list context, 4139reads until end-of-file is reached and returns a list of lines. Note that 4140the notion of "line" used here is however you may have defined it 4141with C<$/> or C<$INPUT_RECORD_SEPARATOR>). See L<perlvar/"$/">. 4142 4143When C<$/> is set to C<undef>, when readline() is in scalar 4144context (i.e. file slurp mode), and when an empty file is read, it 4145returns C<''> the first time, followed by C<undef> subsequently. 4146 4147This is the internal function implementing the C<< <EXPR> >> 4148operator, but you can use it directly. The C<< <EXPR> >> 4149operator is discussed in more detail in L<perlop/"I/O Operators">. 4150 4151 $line = <STDIN>; 4152 $line = readline(*STDIN); # same thing 4153 4154If readline encounters an operating system error, C<$!> will be set with the 4155corresponding error message. It can be helpful to check C<$!> when you are 4156reading from filehandles you don't trust, such as a tty or a socket. The 4157following example uses the operator form of C<readline>, and takes the necessary 4158steps to ensure that C<readline> was successful. 4159 4160 for (;;) { 4161 undef $!; 4162 unless (defined( $line = <> )) { 4163 die $! if $!; 4164 last; # reached EOF 4165 } 4166 # ... 4167 } 4168 4169=item readlink EXPR 4170X<readlink> 4171 4172=item readlink 4173 4174Returns the value of a symbolic link, if symbolic links are 4175implemented. If not, gives a fatal error. If there is some system 4176error, returns the undefined value and sets C<$!> (errno). If EXPR is 4177omitted, uses C<$_>. 4178 4179=item readpipe EXPR 4180X<readpipe> 4181 4182EXPR is executed as a system command. 4183The collected standard output of the command is returned. 4184In scalar context, it comes back as a single (potentially 4185multi-line) string. In list context, returns a list of lines 4186(however you've defined lines with C<$/> or C<$INPUT_RECORD_SEPARATOR>). 4187This is the internal function implementing the C<qx/EXPR/> 4188operator, but you can use it directly. The C<qx/EXPR/> 4189operator is discussed in more detail in L<perlop/"I/O Operators">. 4190 4191=item recv SOCKET,SCALAR,LENGTH,FLAGS 4192X<recv> 4193 4194Receives a message on a socket. Attempts to receive LENGTH characters 4195of data into variable SCALAR from the specified SOCKET filehandle. 4196SCALAR will be grown or shrunk to the length actually read. Takes the 4197same flags as the system call of the same name. Returns the address 4198of the sender if SOCKET's protocol supports this; returns an empty 4199string otherwise. If there's an error, returns the undefined value. 4200This call is actually implemented in terms of recvfrom(2) system call. 4201See L<perlipc/"UDP: Message Passing"> for examples. 4202 4203Note the I<characters>: depending on the status of the socket, either 4204(8-bit) bytes or characters are received. By default all sockets 4205operate on bytes, but for example if the socket has been changed using 4206binmode() to operate with the C<:utf8> I/O layer (see the C<open> 4207pragma, L<open>), the I/O will operate on UTF-8 encoded Unicode 4208characters, not bytes. Similarly for the C<:encoding> pragma: 4209in that case pretty much any characters can be read. 4210 4211=item redo LABEL 4212X<redo> 4213 4214=item redo 4215 4216The C<redo> command restarts the loop block without evaluating the 4217conditional again. The C<continue> block, if any, is not executed. If 4218the LABEL is omitted, the command refers to the innermost enclosing 4219loop. Programs that want to lie to themselves about what was just input 4220normally use this command: 4221 4222 # a simpleminded Pascal comment stripper 4223 # (warning: assumes no { or } in strings) 4224 LINE: while (<STDIN>) { 4225 while (s|({.*}.*){.*}|$1 |) {} 4226 s|{.*}| |; 4227 if (s|{.*| |) { 4228 $front = $_; 4229 while (<STDIN>) { 4230 if (/}/) { # end of comment? 4231 s|^|$front\{|; 4232 redo LINE; 4233 } 4234 } 4235 } 4236 print; 4237 } 4238 4239C<redo> cannot be used to retry a block which returns a value such as 4240C<eval {}>, C<sub {}> or C<do {}>, and should not be used to exit 4241a grep() or map() operation. 4242 4243Note that a block by itself is semantically identical to a loop 4244that executes once. Thus C<redo> inside such a block will effectively 4245turn it into a looping construct. 4246 4247See also L</continue> for an illustration of how C<last>, C<next>, and 4248C<redo> work. 4249 4250=item ref EXPR 4251X<ref> X<reference> 4252 4253=item ref 4254 4255Returns a non-empty string if EXPR is a reference, the empty 4256string otherwise. If EXPR 4257is not specified, C<$_> will be used. The value returned depends on the 4258type of thing the reference is a reference to. 4259Builtin types include: 4260 4261 SCALAR 4262 ARRAY 4263 HASH 4264 CODE 4265 REF 4266 GLOB 4267 LVALUE 4268 4269If the referenced object has been blessed into a package, then that package 4270name is returned instead. You can think of C<ref> as a C<typeof> operator. 4271 4272 if (ref($r) eq "HASH") { 4273 print "r is a reference to a hash.\n"; 4274 } 4275 unless (ref($r)) { 4276 print "r is not a reference at all.\n"; 4277 } 4278 4279See also L<perlref>. 4280 4281=item rename OLDNAME,NEWNAME 4282X<rename> X<move> X<mv> X<ren> 4283 4284Changes the name of a file; an existing file NEWNAME will be 4285clobbered. Returns true for success, false otherwise. 4286 4287Behavior of this function varies wildly depending on your system 4288implementation. For example, it will usually not work across file system 4289boundaries, even though the system I<mv> command sometimes compensates 4290for this. Other restrictions include whether it works on directories, 4291open files, or pre-existing files. Check L<perlport> and either the 4292rename(2) manpage or equivalent system documentation for details. 4293 4294=item require VERSION 4295X<require> 4296 4297=item require EXPR 4298 4299=item require 4300 4301Demands a version of Perl specified by VERSION, or demands some semantics 4302specified by EXPR or by C<$_> if EXPR is not supplied. 4303 4304VERSION may be either a numeric argument such as 5.006, which will be 4305compared to C<$]>, or a literal of the form v5.6.1, which will be compared 4306to C<$^V> (aka $PERL_VERSION). A fatal error is produced at run time if 4307VERSION is greater than the version of the current Perl interpreter. 4308Compare with L</use>, which can do a similar check at compile time. 4309 4310Specifying VERSION as a literal of the form v5.6.1 should generally be 4311avoided, because it leads to misleading error messages under earlier 4312versions of Perl that do not support this syntax. The equivalent numeric 4313version should be used instead. 4314 4315 require v5.6.1; # run time version check 4316 require 5.6.1; # ditto 4317 require 5.006_001; # ditto; preferred for backwards compatibility 4318 4319Otherwise, C<require> demands that a library file be included if it 4320hasn't already been included. The file is included via the do-FILE 4321mechanism, which is essentially just a variety of C<eval>. Has 4322semantics similar to the following subroutine: 4323 4324 sub require { 4325 my ($filename) = @_; 4326 if (exists $INC{$filename}) { 4327 return 1 if $INC{$filename}; 4328 die "Compilation failed in require"; 4329 } 4330 my ($realfilename,$result); 4331 ITER: { 4332 foreach $prefix (@INC) { 4333 $realfilename = "$prefix/$filename"; 4334 if (-f $realfilename) { 4335 $INC{$filename} = $realfilename; 4336 $result = do $realfilename; 4337 last ITER; 4338 } 4339 } 4340 die "Can't find $filename in \@INC"; 4341 } 4342 if ($@) { 4343 $INC{$filename} = undef; 4344 die $@; 4345 } elsif (!$result) { 4346 delete $INC{$filename}; 4347 die "$filename did not return true value"; 4348 } else { 4349 return $result; 4350 } 4351 } 4352 4353Note that the file will not be included twice under the same specified 4354name. 4355 4356The file must return true as the last statement to indicate 4357successful execution of any initialization code, so it's customary to 4358end such a file with C<1;> unless you're sure it'll return true 4359otherwise. But it's better just to put the C<1;>, in case you add more 4360statements. 4361 4362If EXPR is a bareword, the require assumes a "F<.pm>" extension and 4363replaces "F<::>" with "F</>" in the filename for you, 4364to make it easy to load standard modules. This form of loading of 4365modules does not risk altering your namespace. 4366 4367In other words, if you try this: 4368 4369 require Foo::Bar; # a splendid bareword 4370 4371The require function will actually look for the "F<Foo/Bar.pm>" file in the 4372directories specified in the C<@INC> array. 4373 4374But if you try this: 4375 4376 $class = 'Foo::Bar'; 4377 require $class; # $class is not a bareword 4378 #or 4379 require "Foo::Bar"; # not a bareword because of the "" 4380 4381The require function will look for the "F<Foo::Bar>" file in the @INC array and 4382will complain about not finding "F<Foo::Bar>" there. In this case you can do: 4383 4384 eval "require $class"; 4385 4386Now that you understand how C<require> looks for files in the case of 4387a bareword argument, there is a little extra functionality going on 4388behind the scenes. Before C<require> looks for a "F<.pm>" extension, 4389it will first look for a filename with a "F<.pmc>" extension. A file 4390with this extension is assumed to be Perl bytecode generated by 4391L<B::Bytecode|B::Bytecode>. If this file is found, and its modification 4392time is newer than a coinciding "F<.pm>" non-compiled file, it will be 4393loaded in place of that non-compiled file ending in a "F<.pm>" extension. 4394 4395You can also insert hooks into the import facility, by putting directly 4396Perl code into the @INC array. There are three forms of hooks: subroutine 4397references, array references and blessed objects. 4398 4399Subroutine references are the simplest case. When the inclusion system 4400walks through @INC and encounters a subroutine, this subroutine gets 4401called with two parameters, the first being a reference to itself, and the 4402second the name of the file to be included (e.g. "F<Foo/Bar.pm>"). The 4403subroutine should return C<undef> or a filehandle, from which the file to 4404include will be read. If C<undef> is returned, C<require> will look at 4405the remaining elements of @INC. 4406 4407If the hook is an array reference, its first element must be a subroutine 4408reference. This subroutine is called as above, but the first parameter is 4409the array reference. This enables to pass indirectly some arguments to 4410the subroutine. 4411 4412In other words, you can write: 4413 4414 push @INC, \&my_sub; 4415 sub my_sub { 4416 my ($coderef, $filename) = @_; # $coderef is \&my_sub 4417 ... 4418 } 4419 4420or: 4421 4422 push @INC, [ \&my_sub, $x, $y, ... ]; 4423 sub my_sub { 4424 my ($arrayref, $filename) = @_; 4425 # Retrieve $x, $y, ... 4426 my @parameters = @$arrayref[1..$#$arrayref]; 4427 ... 4428 } 4429 4430If the hook is an object, it must provide an INC method that will be 4431called as above, the first parameter being the object itself. (Note that 4432you must fully qualify the sub's name, as it is always forced into package 4433C<main>.) Here is a typical code layout: 4434 4435 # In Foo.pm 4436 package Foo; 4437 sub new { ... } 4438 sub Foo::INC { 4439 my ($self, $filename) = @_; 4440 ... 4441 } 4442 4443 # In the main program 4444 push @INC, new Foo(...); 4445 4446Note that these hooks are also permitted to set the %INC entry 4447corresponding to the files they have loaded. See L<perlvar/%INC>. 4448 4449For a yet-more-powerful import facility, see L</use> and L<perlmod>. 4450 4451=item reset EXPR 4452X<reset> 4453 4454=item reset 4455 4456Generally used in a C<continue> block at the end of a loop to clear 4457variables and reset C<??> searches so that they work again. The 4458expression is interpreted as a list of single characters (hyphens 4459allowed for ranges). All variables and arrays beginning with one of 4460those letters are reset to their pristine state. If the expression is 4461omitted, one-match searches (C<?pattern?>) are reset to match again. Resets 4462only variables or searches in the current package. Always returns 44631. Examples: 4464 4465 reset 'X'; # reset all X variables 4466 reset 'a-z'; # reset lower case variables 4467 reset; # just reset ?one-time? searches 4468 4469Resetting C<"A-Z"> is not recommended because you'll wipe out your 4470C<@ARGV> and C<@INC> arrays and your C<%ENV> hash. Resets only package 4471variables--lexical variables are unaffected, but they clean themselves 4472up on scope exit anyway, so you'll probably want to use them instead. 4473See L</my>. 4474 4475=item return EXPR 4476X<return> 4477 4478=item return 4479 4480Returns from a subroutine, C<eval>, or C<do FILE> with the value 4481given in EXPR. Evaluation of EXPR may be in list, scalar, or void 4482context, depending on how the return value will be used, and the context 4483may vary from one execution to the next (see C<wantarray>). If no EXPR 4484is given, returns an empty list in list context, the undefined value in 4485scalar context, and (of course) nothing at all in a void context. 4486 4487(Note that in the absence of an explicit C<return>, a subroutine, eval, 4488or do FILE will automatically return the value of the last expression 4489evaluated.) 4490 4491=item reverse LIST 4492X<reverse> X<rev> X<invert> 4493 4494In list context, returns a list value consisting of the elements 4495of LIST in the opposite order. In scalar context, concatenates the 4496elements of LIST and returns a string value with all characters 4497in the opposite order. 4498 4499 print reverse <>; # line tac, last line first 4500 4501 undef $/; # for efficiency of <> 4502 print scalar reverse <>; # character tac, last line tsrif 4503 4504Used without arguments in scalar context, reverse() reverses C<$_>. 4505 4506This operator is also handy for inverting a hash, although there are some 4507caveats. If a value is duplicated in the original hash, only one of those 4508can be represented as a key in the inverted hash. Also, this has to 4509unwind one hash and build a whole new one, which may take some time 4510on a large hash, such as from a DBM file. 4511 4512 %by_name = reverse %by_address; # Invert the hash 4513 4514=item rewinddir DIRHANDLE 4515X<rewinddir> 4516 4517Sets the current position to the beginning of the directory for the 4518C<readdir> routine on DIRHANDLE. 4519 4520=item rindex STR,SUBSTR,POSITION 4521X<rindex> 4522 4523=item rindex STR,SUBSTR 4524 4525Works just like index() except that it returns the position of the I<last> 4526occurrence of SUBSTR in STR. If POSITION is specified, returns the 4527last occurrence beginning at or before that position. 4528 4529=item rmdir FILENAME 4530X<rmdir> X<rd> X<directory, remove> 4531 4532=item rmdir 4533 4534Deletes the directory specified by FILENAME if that directory is 4535empty. If it succeeds it returns true, otherwise it returns false and 4536sets C<$!> (errno). If FILENAME is omitted, uses C<$_>. 4537 4538=item s/// 4539 4540The substitution operator. See L<perlop>. 4541 4542=item scalar EXPR 4543X<scalar> X<context> 4544 4545Forces EXPR to be interpreted in scalar context and returns the value 4546of EXPR. 4547 4548 @counts = ( scalar @a, scalar @b, scalar @c ); 4549 4550There is no equivalent operator to force an expression to 4551be interpolated in list context because in practice, this is never 4552needed. If you really wanted to do so, however, you could use 4553the construction C<@{[ (some expression) ]}>, but usually a simple 4554C<(some expression)> suffices. 4555 4556Because C<scalar> is unary operator, if you accidentally use for EXPR a 4557parenthesized list, this behaves as a scalar comma expression, evaluating 4558all but the last element in void context and returning the final element 4559evaluated in scalar context. This is seldom what you want. 4560 4561The following single statement: 4562 4563 print uc(scalar(&foo,$bar)),$baz; 4564 4565is the moral equivalent of these two: 4566 4567 &foo; 4568 print(uc($bar),$baz); 4569 4570See L<perlop> for more details on unary operators and the comma operator. 4571 4572=item seek FILEHANDLE,POSITION,WHENCE 4573X<seek> X<fseek> X<filehandle, position> 4574 4575Sets FILEHANDLE's position, just like the C<fseek> call of C<stdio>. 4576FILEHANDLE may be an expression whose value gives the name of the 4577filehandle. The values for WHENCE are C<0> to set the new position 4578I<in bytes> to POSITION, C<1> to set it to the current position plus 4579POSITION, and C<2> to set it to EOF plus POSITION (typically 4580negative). For WHENCE you may use the constants C<SEEK_SET>, 4581C<SEEK_CUR>, and C<SEEK_END> (start of the file, current position, end 4582of the file) from the Fcntl module. Returns C<1> upon success, C<0> 4583otherwise. 4584 4585Note the I<in bytes>: even if the filehandle has been set to 4586operate on characters (for example by using the C<:utf8> open 4587layer), tell() will return byte offsets, not character offsets 4588(because implementing that would render seek() and tell() rather slow). 4589 4590If you want to position file for C<sysread> or C<syswrite>, don't use 4591C<seek>--buffering makes its effect on the file's system position 4592unpredictable and non-portable. Use C<sysseek> instead. 4593 4594Due to the rules and rigors of ANSI C, on some systems you have to do a 4595seek whenever you switch between reading and writing. Amongst other 4596things, this may have the effect of calling stdio's clearerr(3). 4597A WHENCE of C<1> (C<SEEK_CUR>) is useful for not moving the file position: 4598 4599 seek(TEST,0,1); 4600 4601This is also useful for applications emulating C<tail -f>. Once you hit 4602EOF on your read, and then sleep for a while, you might have to stick in a 4603seek() to reset things. The C<seek> doesn't change the current position, 4604but it I<does> clear the end-of-file condition on the handle, so that the 4605next C<< <FILE> >> makes Perl try again to read something. We hope. 4606 4607If that doesn't work (some IO implementations are particularly 4608cantankerous), then you may need something more like this: 4609 4610 for (;;) { 4611 for ($curpos = tell(FILE); $_ = <FILE>; 4612 $curpos = tell(FILE)) { 4613 # search for some stuff and put it into files 4614 } 4615 sleep($for_a_while); 4616 seek(FILE, $curpos, 0); 4617 } 4618 4619=item seekdir DIRHANDLE,POS 4620X<seekdir> 4621 4622Sets the current position for the C<readdir> routine on DIRHANDLE. POS 4623must be a value returned by C<telldir>. C<seekdir> also has the same caveats 4624about possible directory compaction as the corresponding system library 4625routine. 4626 4627=item select FILEHANDLE 4628X<select> X<filehandle, default> 4629 4630=item select 4631 4632Returns the currently selected filehandle. Sets the current default 4633filehandle for output, if FILEHANDLE is supplied. This has two 4634effects: first, a C<write> or a C<print> without a filehandle will 4635default to this FILEHANDLE. Second, references to variables related to 4636output will refer to this output channel. For example, if you have to 4637set the top of form format for more than one output channel, you might 4638do the following: 4639 4640 select(REPORT1); 4641 $^ = 'report1_top'; 4642 select(REPORT2); 4643 $^ = 'report2_top'; 4644 4645FILEHANDLE may be an expression whose value gives the name of the 4646actual filehandle. Thus: 4647 4648 $oldfh = select(STDERR); $| = 1; select($oldfh); 4649 4650Some programmers may prefer to think of filehandles as objects with 4651methods, preferring to write the last example as: 4652 4653 use IO::Handle; 4654 STDERR->autoflush(1); 4655 4656=item select RBITS,WBITS,EBITS,TIMEOUT 4657X<select> 4658 4659This calls the select(2) system call with the bit masks specified, which 4660can be constructed using C<fileno> and C<vec>, along these lines: 4661 4662 $rin = $win = $ein = ''; 4663 vec($rin,fileno(STDIN),1) = 1; 4664 vec($win,fileno(STDOUT),1) = 1; 4665 $ein = $rin | $win; 4666 4667If you want to select on many filehandles you might wish to write a 4668subroutine: 4669 4670 sub fhbits { 4671 my(@fhlist) = split(' ',$_[0]); 4672 my($bits); 4673 for (@fhlist) { 4674 vec($bits,fileno($_),1) = 1; 4675 } 4676 $bits; 4677 } 4678 $rin = fhbits('STDIN TTY SOCK'); 4679 4680The usual idiom is: 4681 4682 ($nfound,$timeleft) = 4683 select($rout=$rin, $wout=$win, $eout=$ein, $timeout); 4684 4685or to block until something becomes ready just do this 4686 4687 $nfound = select($rout=$rin, $wout=$win, $eout=$ein, undef); 4688 4689Most systems do not bother to return anything useful in $timeleft, so 4690calling select() in scalar context just returns $nfound. 4691 4692Any of the bit masks can also be undef. The timeout, if specified, is 4693in seconds, which may be fractional. Note: not all implementations are 4694capable of returning the $timeleft. If not, they always return 4695$timeleft equal to the supplied $timeout. 4696 4697You can effect a sleep of 250 milliseconds this way: 4698 4699 select(undef, undef, undef, 0.25); 4700 4701Note that whether C<select> gets restarted after signals (say, SIGALRM) 4702is implementation-dependent. See also L<perlport> for notes on the 4703portability of C<select>. 4704 4705On error, C<select> behaves like the select(2) system call : it returns 4706-1 and sets C<$!>. 4707 4708Note: on some Unixes, the select(2) system call may report a socket file 4709descriptor as "ready for reading", when actually no data is available, 4710thus a subsequent read blocks. It can be avoided using always the 4711O_NONBLOCK flag on the socket. See select(2) and fcntl(2) for further 4712details. 4713 4714B<WARNING>: One should not attempt to mix buffered I/O (like C<read> 4715or <FH>) with C<select>, except as permitted by POSIX, and even 4716then only on POSIX systems. You have to use C<sysread> instead. 4717 4718=item semctl ID,SEMNUM,CMD,ARG 4719X<semctl> 4720 4721Calls the System V IPC function C<semctl>. You'll probably have to say 4722 4723 use IPC::SysV; 4724 4725first to get the correct constant definitions. If CMD is IPC_STAT or 4726GETALL, then ARG must be a variable that will hold the returned 4727semid_ds structure or semaphore value array. Returns like C<ioctl>: 4728the undefined value for error, "C<0 but true>" for zero, or the actual 4729return value otherwise. The ARG must consist of a vector of native 4730short integers, which may be created with C<pack("s!",(0)x$nsem)>. 4731See also L<perlipc/"SysV IPC">, C<IPC::SysV>, C<IPC::Semaphore> 4732documentation. 4733 4734=item semget KEY,NSEMS,FLAGS 4735X<semget> 4736 4737Calls the System V IPC function semget. Returns the semaphore id, or 4738the undefined value if there is an error. See also 4739L<perlipc/"SysV IPC">, C<IPC::SysV>, C<IPC::SysV::Semaphore> 4740documentation. 4741 4742=item semop KEY,OPSTRING 4743X<semop> 4744 4745Calls the System V IPC function semop to perform semaphore operations 4746such as signalling and waiting. OPSTRING must be a packed array of 4747semop structures. Each semop structure can be generated with 4748C<pack("s!3", $semnum, $semop, $semflag)>. The length of OPSTRING 4749implies the number of semaphore operations. Returns true if 4750successful, or false if there is an error. As an example, the 4751following code waits on semaphore $semnum of semaphore id $semid: 4752 4753 $semop = pack("s!3", $semnum, -1, 0); 4754 die "Semaphore trouble: $!\n" unless semop($semid, $semop); 4755 4756To signal the semaphore, replace C<-1> with C<1>. See also 4757L<perlipc/"SysV IPC">, C<IPC::SysV>, and C<IPC::SysV::Semaphore> 4758documentation. 4759 4760=item send SOCKET,MSG,FLAGS,TO 4761X<send> 4762 4763=item send SOCKET,MSG,FLAGS 4764 4765Sends a message on a socket. Attempts to send the scalar MSG to the 4766SOCKET filehandle. Takes the same flags as the system call of the 4767same name. On unconnected sockets you must specify a destination to 4768send TO, in which case it does a C C<sendto>. Returns the number of 4769characters sent, or the undefined value if there is an error. The C 4770system call sendmsg(2) is currently unimplemented. See 4771L<perlipc/"UDP: Message Passing"> for examples. 4772 4773Note the I<characters>: depending on the status of the socket, either 4774(8-bit) bytes or characters are sent. By default all sockets operate 4775on bytes, but for example if the socket has been changed using 4776binmode() to operate with the C<:utf8> I/O layer (see L</open>, or the 4777C<open> pragma, L<open>), the I/O will operate on UTF-8 encoded 4778Unicode characters, not bytes. Similarly for the C<:encoding> pragma: 4779in that case pretty much any characters can be sent. 4780 4781=item setpgrp PID,PGRP 4782X<setpgrp> X<group> 4783 4784Sets the current process group for the specified PID, C<0> for the current 4785process. Will produce a fatal error if used on a machine that doesn't 4786implement POSIX setpgid(2) or BSD setpgrp(2). If the arguments are omitted, 4787it defaults to C<0,0>. Note that the BSD 4.2 version of C<setpgrp> does not 4788accept any arguments, so only C<setpgrp(0,0)> is portable. See also 4789C<POSIX::setsid()>. 4790 4791=item setpriority WHICH,WHO,PRIORITY 4792X<setpriority> X<priority> X<nice> X<renice> 4793 4794Sets the current priority for a process, a process group, or a user. 4795(See setpriority(2).) Will produce a fatal error if used on a machine 4796that doesn't implement setpriority(2). 4797 4798=item setsockopt SOCKET,LEVEL,OPTNAME,OPTVAL 4799X<setsockopt> 4800 4801Sets the socket option requested. Returns undefined if there is an 4802error. Use integer constants provided by the C<Socket> module for 4803LEVEL and OPNAME. Values for LEVEL can also be obtained from 4804getprotobyname. OPTVAL might either be a packed string or an integer. 4805An integer OPTVAL is shorthand for pack("i", OPTVAL). 4806 4807An example disabling the Nagle's algorithm for a socket: 4808 4809 use Socket qw(IPPROTO_TCP TCP_NODELAY); 4810 setsockopt($socket, IPPROTO_TCP, TCP_NODELAY, 1); 4811 4812=item shift ARRAY 4813X<shift> 4814 4815=item shift 4816 4817Shifts the first value of the array off and returns it, shortening the 4818array by 1 and moving everything down. If there are no elements in the 4819array, returns the undefined value. If ARRAY is omitted, shifts the 4820C<@_> array within the lexical scope of subroutines and formats, and the 4821C<@ARGV> array at file scopes or within the lexical scopes established by 4822the C<eval ''>, C<BEGIN {}>, C<INIT {}>, C<CHECK {}>, and C<END {}> 4823constructs. 4824 4825See also C<unshift>, C<push>, and C<pop>. C<shift> and C<unshift> do the 4826same thing to the left end of an array that C<pop> and C<push> do to the 4827right end. 4828 4829=item shmctl ID,CMD,ARG 4830X<shmctl> 4831 4832Calls the System V IPC function shmctl. You'll probably have to say 4833 4834 use IPC::SysV; 4835 4836first to get the correct constant definitions. If CMD is C<IPC_STAT>, 4837then ARG must be a variable that will hold the returned C<shmid_ds> 4838structure. Returns like ioctl: the undefined value for error, "C<0> but 4839true" for zero, or the actual return value otherwise. 4840See also L<perlipc/"SysV IPC"> and C<IPC::SysV> documentation. 4841 4842=item shmget KEY,SIZE,FLAGS 4843X<shmget> 4844 4845Calls the System V IPC function shmget. Returns the shared memory 4846segment id, or the undefined value if there is an error. 4847See also L<perlipc/"SysV IPC"> and C<IPC::SysV> documentation. 4848 4849=item shmread ID,VAR,POS,SIZE 4850X<shmread> 4851X<shmwrite> 4852 4853=item shmwrite ID,STRING,POS,SIZE 4854 4855Reads or writes the System V shared memory segment ID starting at 4856position POS for size SIZE by attaching to it, copying in/out, and 4857detaching from it. When reading, VAR must be a variable that will 4858hold the data read. When writing, if STRING is too long, only SIZE 4859bytes are used; if STRING is too short, nulls are written to fill out 4860SIZE bytes. Return true if successful, or false if there is an error. 4861shmread() taints the variable. See also L<perlipc/"SysV IPC">, 4862C<IPC::SysV> documentation, and the C<IPC::Shareable> module from CPAN. 4863 4864=item shutdown SOCKET,HOW 4865X<shutdown> 4866 4867Shuts down a socket connection in the manner indicated by HOW, which 4868has the same interpretation as in the system call of the same name. 4869 4870 shutdown(SOCKET, 0); # I/we have stopped reading data 4871 shutdown(SOCKET, 1); # I/we have stopped writing data 4872 shutdown(SOCKET, 2); # I/we have stopped using this socket 4873 4874This is useful with sockets when you want to tell the other 4875side you're done writing but not done reading, or vice versa. 4876It's also a more insistent form of close because it also 4877disables the file descriptor in any forked copies in other 4878processes. 4879 4880=item sin EXPR 4881X<sin> X<sine> X<asin> X<arcsine> 4882 4883=item sin 4884 4885Returns the sine of EXPR (expressed in radians). If EXPR is omitted, 4886returns sine of C<$_>. 4887 4888For the inverse sine operation, you may use the C<Math::Trig::asin> 4889function, or use this relation: 4890 4891 sub asin { atan2($_[0], sqrt(1 - $_[0] * $_[0])) } 4892 4893=item sleep EXPR 4894X<sleep> X<pause> 4895 4896=item sleep 4897 4898Causes the script to sleep for EXPR seconds, or forever if no EXPR. 4899May be interrupted if the process receives a signal such as C<SIGALRM>. 4900Returns the number of seconds actually slept. You probably cannot 4901mix C<alarm> and C<sleep> calls, because C<sleep> is often implemented 4902using C<alarm>. 4903 4904On some older systems, it may sleep up to a full second less than what 4905you requested, depending on how it counts seconds. Most modern systems 4906always sleep the full amount. They may appear to sleep longer than that, 4907however, because your process might not be scheduled right away in a 4908busy multitasking system. 4909 4910For delays of finer granularity than one second, you may use Perl's 4911C<syscall> interface to access setitimer(2) if your system supports 4912it, or else see L</select> above. The Time::HiRes module (from CPAN, 4913and starting from Perl 5.8 part of the standard distribution) may also 4914help. 4915 4916See also the POSIX module's C<pause> function. 4917 4918=item socket SOCKET,DOMAIN,TYPE,PROTOCOL 4919X<socket> 4920 4921Opens a socket of the specified kind and attaches it to filehandle 4922SOCKET. DOMAIN, TYPE, and PROTOCOL are specified the same as for 4923the system call of the same name. You should C<use Socket> first 4924to get the proper definitions imported. See the examples in 4925L<perlipc/"Sockets: Client/Server Communication">. 4926 4927On systems that support a close-on-exec flag on files, the flag will 4928be set for the newly opened file descriptor, as determined by the 4929value of $^F. See L<perlvar/$^F>. 4930 4931=item socketpair SOCKET1,SOCKET2,DOMAIN,TYPE,PROTOCOL 4932X<socketpair> 4933 4934Creates an unnamed pair of sockets in the specified domain, of the 4935specified type. DOMAIN, TYPE, and PROTOCOL are specified the same as 4936for the system call of the same name. If unimplemented, yields a fatal 4937error. Returns true if successful. 4938 4939On systems that support a close-on-exec flag on files, the flag will 4940be set for the newly opened file descriptors, as determined by the value 4941of $^F. See L<perlvar/$^F>. 4942 4943Some systems defined C<pipe> in terms of C<socketpair>, in which a call 4944to C<pipe(Rdr, Wtr)> is essentially: 4945 4946 use Socket; 4947 socketpair(Rdr, Wtr, AF_UNIX, SOCK_STREAM, PF_UNSPEC); 4948 shutdown(Rdr, 1); # no more writing for reader 4949 shutdown(Wtr, 0); # no more reading for writer 4950 4951See L<perlipc> for an example of socketpair use. Perl 5.8 and later will 4952emulate socketpair using IP sockets to localhost if your system implements 4953sockets but not socketpair. 4954 4955=item sort SUBNAME LIST 4956X<sort> X<qsort> X<quicksort> X<mergesort> 4957 4958=item sort BLOCK LIST 4959 4960=item sort LIST 4961 4962In list context, this sorts the LIST and returns the sorted list value. 4963In scalar context, the behaviour of C<sort()> is undefined. 4964 4965If SUBNAME or BLOCK is omitted, C<sort>s in standard string comparison 4966order. If SUBNAME is specified, it gives the name of a subroutine 4967that returns an integer less than, equal to, or greater than C<0>, 4968depending on how the elements of the list are to be ordered. (The C<< 4969<=> >> and C<cmp> operators are extremely useful in such routines.) 4970SUBNAME may be a scalar variable name (unsubscripted), in which case 4971the value provides the name of (or a reference to) the actual 4972subroutine to use. In place of a SUBNAME, you can provide a BLOCK as 4973an anonymous, in-line sort subroutine. 4974 4975If the subroutine's prototype is C<($$)>, the elements to be compared 4976are passed by reference in C<@_>, as for a normal subroutine. This is 4977slower than unprototyped subroutines, where the elements to be 4978compared are passed into the subroutine 4979as the package global variables $a and $b (see example below). Note that 4980in the latter case, it is usually counter-productive to declare $a and 4981$b as lexicals. 4982 4983In either case, the subroutine may not be recursive. The values to be 4984compared are always passed by reference and should not be modified. 4985 4986You also cannot exit out of the sort block or subroutine using any of the 4987loop control operators described in L<perlsyn> or with C<goto>. 4988 4989When C<use locale> is in effect, C<sort LIST> sorts LIST according to the 4990current collation locale. See L<perllocale>. 4991 4992sort() returns aliases into the original list, much as a for loop's index 4993variable aliases the list elements. That is, modifying an element of a 4994list returned by sort() (for example, in a C<foreach>, C<map> or C<grep>) 4995actually modifies the element in the original list. This is usually 4996something to be avoided when writing clear code. 4997 4998Perl 5.6 and earlier used a quicksort algorithm to implement sort. 4999That algorithm was not stable, and I<could> go quadratic. (A I<stable> sort 5000preserves the input order of elements that compare equal. Although 5001quicksort's run time is O(NlogN) when averaged over all arrays of 5002length N, the time can be O(N**2), I<quadratic> behavior, for some 5003inputs.) In 5.7, the quicksort implementation was replaced with 5004a stable mergesort algorithm whose worst-case behavior is O(NlogN). 5005But benchmarks indicated that for some inputs, on some platforms, 5006the original quicksort was faster. 5.8 has a sort pragma for 5007limited control of the sort. Its rather blunt control of the 5008underlying algorithm may not persist into future Perls, but the 5009ability to characterize the input or output in implementation 5010independent ways quite probably will. See L<sort>. 5011 5012Examples: 5013 5014 # sort lexically 5015 @articles = sort @files; 5016 5017 # same thing, but with explicit sort routine 5018 @articles = sort {$a cmp $b} @files; 5019 5020 # now case-insensitively 5021 @articles = sort {uc($a) cmp uc($b)} @files; 5022 5023 # same thing in reversed order 5024 @articles = sort {$b cmp $a} @files; 5025 5026 # sort numerically ascending 5027 @articles = sort {$a <=> $b} @files; 5028 5029 # sort numerically descending 5030 @articles = sort {$b <=> $a} @files; 5031 5032 # this sorts the %age hash by value instead of key 5033 # using an in-line function 5034 @eldest = sort { $age{$b} <=> $age{$a} } keys %age; 5035 5036 # sort using explicit subroutine name 5037 sub byage { 5038 $age{$a} <=> $age{$b}; # presuming numeric 5039 } 5040 @sortedclass = sort byage @class; 5041 5042 sub backwards { $b cmp $a } 5043 @harry = qw(dog cat x Cain Abel); 5044 @george = qw(gone chased yz Punished Axed); 5045 print sort @harry; 5046 # prints AbelCaincatdogx 5047 print sort backwards @harry; 5048 # prints xdogcatCainAbel 5049 print sort @george, 'to', @harry; 5050 # prints AbelAxedCainPunishedcatchaseddoggonetoxyz 5051 5052 # inefficiently sort by descending numeric compare using 5053 # the first integer after the first = sign, or the 5054 # whole record case-insensitively otherwise 5055 5056 @new = sort { 5057 ($b =~ /=(\d+)/)[0] <=> ($a =~ /=(\d+)/)[0] 5058 || 5059 uc($a) cmp uc($b) 5060 } @old; 5061 5062 # same thing, but much more efficiently; 5063 # we'll build auxiliary indices instead 5064 # for speed 5065 @nums = @caps = (); 5066 for (@old) { 5067 push @nums, /=(\d+)/; 5068 push @caps, uc($_); 5069 } 5070 5071 @new = @old[ sort { 5072 $nums[$b] <=> $nums[$a] 5073 || 5074 $caps[$a] cmp $caps[$b] 5075 } 0..$#old 5076 ]; 5077 5078 # same thing, but without any temps 5079 @new = map { $_->[0] } 5080 sort { $b->[1] <=> $a->[1] 5081 || 5082 $a->[2] cmp $b->[2] 5083 } map { [$_, /=(\d+)/, uc($_)] } @old; 5084 5085 # using a prototype allows you to use any comparison subroutine 5086 # as a sort subroutine (including other package's subroutines) 5087 package other; 5088 sub backwards ($$) { $_[1] cmp $_[0]; } # $a and $b are not set here 5089 5090 package main; 5091 @new = sort other::backwards @old; 5092 5093 # guarantee stability, regardless of algorithm 5094 use sort 'stable'; 5095 @new = sort { substr($a, 3, 5) cmp substr($b, 3, 5) } @old; 5096 5097 # force use of mergesort (not portable outside Perl 5.8) 5098 use sort '_mergesort'; # note discouraging _ 5099 @new = sort { substr($a, 3, 5) cmp substr($b, 3, 5) } @old; 5100 5101If you're using strict, you I<must not> declare $a 5102and $b as lexicals. They are package globals. That means 5103if you're in the C<main> package and type 5104 5105 @articles = sort {$b <=> $a} @files; 5106 5107then C<$a> and C<$b> are C<$main::a> and C<$main::b> (or C<$::a> and C<$::b>), 5108but if you're in the C<FooPack> package, it's the same as typing 5109 5110 @articles = sort {$FooPack::b <=> $FooPack::a} @files; 5111 5112The comparison function is required to behave. If it returns 5113inconsistent results (sometimes saying C<$x[1]> is less than C<$x[2]> and 5114sometimes saying the opposite, for example) the results are not 5115well-defined. 5116 5117Because C<< <=> >> returns C<undef> when either operand is C<NaN> 5118(not-a-number), and because C<sort> will trigger a fatal error unless the 5119result of a comparison is defined, when sorting with a comparison function 5120like C<< $a <=> $b >>, be careful about lists that might contain a C<NaN>. 5121The following example takes advantage of the fact that C<NaN != NaN> to 5122eliminate any C<NaN>s from the input. 5123 5124 @result = sort { $a <=> $b } grep { $_ == $_ } @input; 5125 5126=item splice ARRAY,OFFSET,LENGTH,LIST 5127X<splice> 5128 5129=item splice ARRAY,OFFSET,LENGTH 5130 5131=item splice ARRAY,OFFSET 5132 5133=item splice ARRAY 5134 5135Removes the elements designated by OFFSET and LENGTH from an array, and 5136replaces them with the elements of LIST, if any. In list context, 5137returns the elements removed from the array. In scalar context, 5138returns the last element removed, or C<undef> if no elements are 5139removed. The array grows or shrinks as necessary. 5140If OFFSET is negative then it starts that far from the end of the array. 5141If LENGTH is omitted, removes everything from OFFSET onward. 5142If LENGTH is negative, removes the elements from OFFSET onward 5143except for -LENGTH elements at the end of the array. 5144If both OFFSET and LENGTH are omitted, removes everything. If OFFSET is 5145past the end of the array, perl issues a warning, and splices at the 5146end of the array. 5147 5148The following equivalences hold (assuming C<< $[ == 0 and $#a >= $i >> ) 5149 5150 push(@a,$x,$y) splice(@a,@a,0,$x,$y) 5151 pop(@a) splice(@a,-1) 5152 shift(@a) splice(@a,0,1) 5153 unshift(@a,$x,$y) splice(@a,0,0,$x,$y) 5154 $a[$i] = $y splice(@a,$i,1,$y) 5155 5156Example, assuming array lengths are passed before arrays: 5157 5158 sub aeq { # compare two list values 5159 my(@a) = splice(@_,0,shift); 5160 my(@b) = splice(@_,0,shift); 5161 return 0 unless @a == @b; # same len? 5162 while (@a) { 5163 return 0 if pop(@a) ne pop(@b); 5164 } 5165 return 1; 5166 } 5167 if (&aeq($len,@foo[1..$len],0+@bar,@bar)) { ... } 5168 5169=item split /PATTERN/,EXPR,LIMIT 5170X<split> 5171 5172=item split /PATTERN/,EXPR 5173 5174=item split /PATTERN/ 5175 5176=item split 5177 5178Splits the string EXPR into a list of strings and returns that list. By 5179default, empty leading fields are preserved, and empty trailing ones are 5180deleted. (If all fields are empty, they are considered to be trailing.) 5181 5182In scalar context, returns the number of fields found and splits into 5183the C<@_> array. Use of split in scalar context is deprecated, however, 5184because it clobbers your subroutine arguments. 5185 5186If EXPR is omitted, splits the C<$_> string. If PATTERN is also omitted, 5187splits on whitespace (after skipping any leading whitespace). Anything 5188matching PATTERN is taken to be a delimiter separating the fields. (Note 5189that the delimiter may be longer than one character.) 5190 5191If LIMIT is specified and positive, it represents the maximum number 5192of fields the EXPR will be split into, though the actual number of 5193fields returned depends on the number of times PATTERN matches within 5194EXPR. If LIMIT is unspecified or zero, trailing null fields are 5195stripped (which potential users of C<pop> would do well to remember). 5196If LIMIT is negative, it is treated as if an arbitrarily large LIMIT 5197had been specified. Note that splitting an EXPR that evaluates to the 5198empty string always returns the empty list, regardless of the LIMIT 5199specified. 5200 5201A pattern matching the null string (not to be confused with 5202a null pattern C<//>, which is just one member of the set of patterns 5203matching a null string) will split the value of EXPR into separate 5204characters at each point it matches that way. For example: 5205 5206 print join(':', split(/ */, 'hi there')); 5207 5208produces the output 'h:i:t:h:e:r:e'. 5209 5210As a special case for C<split>, using the empty pattern C<//> specifically 5211matches only the null string, and is not be confused with the regular use 5212of C<//> to mean "the last successful pattern match". So, for C<split>, 5213the following: 5214 5215 print join(':', split(//, 'hi there')); 5216 5217produces the output 'h:i: :t:h:e:r:e'. 5218 5219Empty leading (or trailing) fields are produced when there are positive 5220width matches at the beginning (or end) of the string; a zero-width match 5221at the beginning (or end) of the string does not produce an empty field. 5222For example: 5223 5224 print join(':', split(/(?=\w)/, 'hi there!')); 5225 5226produces the output 'h:i :t:h:e:r:e!'. 5227 5228The LIMIT parameter can be used to split a line partially 5229 5230 ($login, $passwd, $remainder) = split(/:/, $_, 3); 5231 5232When assigning to a list, if LIMIT is omitted, or zero, Perl supplies 5233a LIMIT one larger than the number of variables in the list, to avoid 5234unnecessary work. For the list above LIMIT would have been 4 by 5235default. In time critical applications it behooves you not to split 5236into more fields than you really need. 5237 5238If the PATTERN contains parentheses, additional list elements are 5239created from each matching substring in the delimiter. 5240 5241 split(/([,-])/, "1-10,20", 3); 5242 5243produces the list value 5244 5245 (1, '-', 10, ',', 20) 5246 5247If you had the entire header of a normal Unix email message in $header, 5248you could split it up into fields and their values this way: 5249 5250 $header =~ s/\n\s+/ /g; # fix continuation lines 5251 %hdrs = (UNIX_FROM => split /^(\S*?):\s*/m, $header); 5252 5253The pattern C</PATTERN/> may be replaced with an expression to specify 5254patterns that vary at runtime. (To do runtime compilation only once, 5255use C</$variable/o>.) 5256 5257As a special case, specifying a PATTERN of space (S<C<' '>>) will split on 5258white space just as C<split> with no arguments does. Thus, S<C<split(' ')>> can 5259be used to emulate B<awk>'s default behavior, whereas S<C<split(/ /)>> 5260will give you as many null initial fields as there are leading spaces. 5261A C<split> on C</\s+/> is like a S<C<split(' ')>> except that any leading 5262whitespace produces a null first field. A C<split> with no arguments 5263really does a S<C<split(' ', $_)>> internally. 5264 5265A PATTERN of C</^/> is treated as if it were C</^/m>, since it isn't 5266much use otherwise. 5267 5268Example: 5269 5270 open(PASSWD, '/etc/passwd'); 5271 while (<PASSWD>) { 5272 chomp; 5273 ($login, $passwd, $uid, $gid, 5274 $gcos, $home, $shell) = split(/:/); 5275 #... 5276 } 5277 5278As with regular pattern matching, any capturing parentheses that are not 5279matched in a C<split()> will be set to C<undef> when returned: 5280 5281 @fields = split /(A)|B/, "1A2B3"; 5282 # @fields is (1, 'A', 2, undef, 3) 5283 5284=item sprintf FORMAT, LIST 5285X<sprintf> 5286 5287Returns a string formatted by the usual C<printf> conventions of the C 5288library function C<sprintf>. See below for more details 5289and see L<sprintf(3)> or L<printf(3)> on your system for an explanation of 5290the general principles. 5291 5292For example: 5293 5294 # Format number with up to 8 leading zeroes 5295 $result = sprintf("%08d", $number); 5296 5297 # Round number to 3 digits after decimal point 5298 $rounded = sprintf("%.3f", $number); 5299 5300Perl does its own C<sprintf> formatting--it emulates the C 5301function C<sprintf>, but it doesn't use it (except for floating-point 5302numbers, and even then only the standard modifiers are allowed). As a 5303result, any non-standard extensions in your local C<sprintf> are not 5304available from Perl. 5305 5306Unlike C<printf>, C<sprintf> does not do what you probably mean when you 5307pass it an array as your first argument. The array is given scalar context, 5308and instead of using the 0th element of the array as the format, Perl will 5309use the count of elements in the array as the format, which is almost never 5310useful. 5311 5312Perl's C<sprintf> permits the following universally-known conversions: 5313 5314 %% a percent sign 5315 %c a character with the given number 5316 %s a string 5317 %d a signed integer, in decimal 5318 %u an unsigned integer, in decimal 5319 %o an unsigned integer, in octal 5320 %x an unsigned integer, in hexadecimal 5321 %e a floating-point number, in scientific notation 5322 %f a floating-point number, in fixed decimal notation 5323 %g a floating-point number, in %e or %f notation 5324 5325In addition, Perl permits the following widely-supported conversions: 5326 5327 %X like %x, but using upper-case letters 5328 %E like %e, but using an upper-case "E" 5329 %G like %g, but with an upper-case "E" (if applicable) 5330 %b an unsigned integer, in binary 5331 %p a pointer (outputs the Perl value's address in hexadecimal) 5332 %n special: *stores* the number of characters output so far 5333 into the next variable in the parameter list 5334 5335Finally, for backward (and we do mean "backward") compatibility, Perl 5336permits these unnecessary but widely-supported conversions: 5337 5338 %i a synonym for %d 5339 %D a synonym for %ld 5340 %U a synonym for %lu 5341 %O a synonym for %lo 5342 %F a synonym for %f 5343 5344Note that the number of exponent digits in the scientific notation produced 5345by C<%e>, C<%E>, C<%g> and C<%G> for numbers with the modulus of the 5346exponent less than 100 is system-dependent: it may be three or less 5347(zero-padded as necessary). In other words, 1.23 times ten to the 534899th may be either "1.23e99" or "1.23e099". 5349 5350Between the C<%> and the format letter, you may specify a number of 5351additional attributes controlling the interpretation of the format. 5352In order, these are: 5353 5354=over 4 5355 5356=item format parameter index 5357 5358An explicit format parameter index, such as C<2$>. By default sprintf 5359will format the next unused argument in the list, but this allows you 5360to take the arguments out of order, e.g.: 5361 5362 printf '%2$d %1$d', 12, 34; # prints "34 12" 5363 printf '%3$d %d %1$d', 1, 2, 3; # prints "3 1 1" 5364 5365=item flags 5366 5367one or more of: 5368 space prefix positive number with a space 5369 + prefix positive number with a plus sign 5370 - left-justify within the field 5371 0 use zeros, not spaces, to right-justify 5372 # prefix non-zero octal with "0", non-zero hex with "0x", 5373 non-zero binary with "0b" 5374 5375For example: 5376 5377 printf '<% d>', 12; # prints "< 12>" 5378 printf '<%+d>', 12; # prints "<+12>" 5379 printf '<%6s>', 12; # prints "< 12>" 5380 printf '<%-6s>', 12; # prints "<12 >" 5381 printf '<%06s>', 12; # prints "<000012>" 5382 printf '<%#x>', 12; # prints "<0xc>" 5383 5384=item vector flag 5385 5386This flag tells perl to interpret the supplied string as a vector of 5387integers, one for each character in the string. Perl applies the format to 5388each integer in turn, then joins the resulting strings with a separator (a 5389dot C<.> by default). This can be useful for displaying ordinal values of 5390characters in arbitrary strings: 5391 5392 printf "%vd", "AB\x{100}"; # prints "65.66.256" 5393 printf "version is v%vd\n", $^V; # Perl's version 5394 5395Put an asterisk C<*> before the C<v> to override the string to 5396use to separate the numbers: 5397 5398 printf "address is %*vX\n", ":", $addr; # IPv6 address 5399 printf "bits are %0*v8b\n", " ", $bits; # random bitstring 5400 5401You can also explicitly specify the argument number to use for 5402the join string using e.g. C<*2$v>: 5403 5404 printf '%*4$vX %*4$vX %*4$vX', @addr[1..3], ":"; # 3 IPv6 addresses 5405 5406=item (minimum) width 5407 5408Arguments are usually formatted to be only as wide as required to 5409display the given value. You can override the width by putting 5410a number here, or get the width from the next argument (with C<*>) 5411or from a specified argument (with e.g. C<*2$>): 5412 5413 printf '<%s>', "a"; # prints "<a>" 5414 printf '<%6s>', "a"; # prints "< a>" 5415 printf '<%*s>', 6, "a"; # prints "< a>" 5416 printf '<%*2$s>', "a", 6; # prints "< a>" 5417 printf '<%2s>', "long"; # prints "<long>" (does not truncate) 5418 5419If a field width obtained through C<*> is negative, it has the same 5420effect as the C<-> flag: left-justification. 5421 5422=item precision, or maximum width 5423X<precision> 5424 5425You can specify a precision (for numeric conversions) or a maximum 5426width (for string conversions) by specifying a C<.> followed by a number. 5427For floating point formats, with the exception of 'g' and 'G', this specifies 5428the number of decimal places to show (the default being 6), e.g.: 5429 5430 # these examples are subject to system-specific variation 5431 printf '<%f>', 1; # prints "<1.000000>" 5432 printf '<%.1f>', 1; # prints "<1.0>" 5433 printf '<%.0f>', 1; # prints "<1>" 5434 printf '<%e>', 10; # prints "<1.000000e+01>" 5435 printf '<%.1e>', 10; # prints "<1.0e+01>" 5436 5437For 'g' and 'G', this specifies the maximum number of digits to show, 5438including prior to the decimal point as well as after it, e.g.: 5439 5440 # these examples are subject to system-specific variation 5441 printf '<%g>', 1; # prints "<1>" 5442 printf '<%.10g>', 1; # prints "<1>" 5443 printf '<%g>', 100; # prints "<100>" 5444 printf '<%.1g>', 100; # prints "<1e+02>" 5445 printf '<%.2g>', 100.01; # prints "<1e+02>" 5446 printf '<%.5g>', 100.01; # prints "<100.01>" 5447 printf '<%.4g>', 100.01; # prints "<100>" 5448 5449For integer conversions, specifying a precision implies that the 5450output of the number itself should be zero-padded to this width: 5451 5452 printf '<%.6x>', 1; # prints "<000001>" 5453 printf '<%#.6x>', 1; # prints "<0x000001>" 5454 printf '<%-10.6x>', 1; # prints "<000001 >" 5455 5456For string conversions, specifying a precision truncates the string 5457to fit in the specified width: 5458 5459 printf '<%.5s>', "truncated"; # prints "<trunc>" 5460 printf '<%10.5s>', "truncated"; # prints "< trunc>" 5461 5462You can also get the precision from the next argument using C<.*>: 5463 5464 printf '<%.6x>', 1; # prints "<000001>" 5465 printf '<%.*x>', 6, 1; # prints "<000001>" 5466 5467You cannot currently get the precision from a specified number, 5468but it is intended that this will be possible in the future using 5469e.g. C<.*2$>: 5470 5471 printf '<%.*2$x>', 1, 6; # INVALID, but in future will print "<000001>" 5472 5473=item size 5474 5475For numeric conversions, you can specify the size to interpret the 5476number as using C<l>, C<h>, C<V>, C<q>, C<L>, or C<ll>. For integer 5477conversions (C<d u o x X b i D U O>), numbers are usually assumed to be 5478whatever the default integer size is on your platform (usually 32 or 64 5479bits), but you can override this to use instead one of the standard C types, 5480as supported by the compiler used to build Perl: 5481 5482 l interpret integer as C type "long" or "unsigned long" 5483 h interpret integer as C type "short" or "unsigned short" 5484 q, L or ll interpret integer as C type "long long", "unsigned long long". 5485 or "quads" (typically 64-bit integers) 5486 5487The last will produce errors if Perl does not understand "quads" in your 5488installation. (This requires that either the platform natively supports quads 5489or Perl was specifically compiled to support quads.) You can find out 5490whether your Perl supports quads via L<Config>: 5491 5492 use Config; 5493 ($Config{use64bitint} eq 'define' || $Config{longsize} >= 8) && 5494 print "quads\n"; 5495 5496For floating point conversions (C<e f g E F G>), numbers are usually assumed 5497to be the default floating point size on your platform (double or long double), 5498but you can force 'long double' with C<q>, C<L>, or C<ll> if your 5499platform supports them. You can find out whether your Perl supports long 5500doubles via L<Config>: 5501 5502 use Config; 5503 $Config{d_longdbl} eq 'define' && print "long doubles\n"; 5504 5505You can find out whether Perl considers 'long double' to be the default 5506floating point size to use on your platform via L<Config>: 5507 5508 use Config; 5509 ($Config{uselongdouble} eq 'define') && 5510 print "long doubles by default\n"; 5511 5512It can also be the case that long doubles and doubles are the same thing: 5513 5514 use Config; 5515 ($Config{doublesize} == $Config{longdblsize}) && 5516 print "doubles are long doubles\n"; 5517 5518The size specifier C<V> has no effect for Perl code, but it is supported 5519for compatibility with XS code; it means 'use the standard size for 5520a Perl integer (or floating-point number)', which is already the 5521default for Perl code. 5522 5523=item order of arguments 5524 5525Normally, sprintf takes the next unused argument as the value to 5526format for each format specification. If the format specification 5527uses C<*> to require additional arguments, these are consumed from 5528the argument list in the order in which they appear in the format 5529specification I<before> the value to format. Where an argument is 5530specified using an explicit index, this does not affect the normal 5531order for the arguments (even when the explicitly specified index 5532would have been the next argument in any case). 5533 5534So: 5535 5536 printf '<%*.*s>', $a, $b, $c; 5537 5538would use C<$a> for the width, C<$b> for the precision and C<$c> 5539as the value to format, while: 5540 5541 print '<%*1$.*s>', $a, $b; 5542 5543would use C<$a> for the width and the precision, and C<$b> as the 5544value to format. 5545 5546Here are some more examples - beware that when using an explicit 5547index, the C<$> may need to be escaped: 5548 5549 printf "%2\$d %d\n", 12, 34; # will print "34 12\n" 5550 printf "%2\$d %d %d\n", 12, 34; # will print "34 12 34\n" 5551 printf "%3\$d %d %d\n", 12, 34, 56; # will print "56 12 34\n" 5552 printf "%2\$*3\$d %d\n", 12, 34, 3; # will print " 34 12\n" 5553 5554=back 5555 5556If C<use locale> is in effect, the character used for the decimal 5557point in formatted real numbers is affected by the LC_NUMERIC locale. 5558See L<perllocale>. 5559 5560=item sqrt EXPR 5561X<sqrt> X<root> X<square root> 5562 5563=item sqrt 5564 5565Return the square root of EXPR. If EXPR is omitted, returns square 5566root of C<$_>. Only works on non-negative operands, unless you've 5567loaded the standard Math::Complex module. 5568 5569 use Math::Complex; 5570 print sqrt(-2); # prints 1.4142135623731i 5571 5572=item srand EXPR 5573X<srand> X<seed> X<randseed> 5574 5575=item srand 5576 5577Sets the random number seed for the C<rand> operator. 5578 5579The point of the function is to "seed" the C<rand> function so that 5580C<rand> can produce a different sequence each time you run your 5581program. 5582 5583If srand() is not called explicitly, it is called implicitly at the 5584first use of the C<rand> operator. However, this was not the case in 5585versions of Perl before 5.004, so if your script will run under older 5586Perl versions, it should call C<srand>. 5587 5588Most programs won't even call srand() at all, except those that 5589need a cryptographically-strong starting point rather than the 5590generally acceptable default, which is based on time of day, 5591process ID, and memory allocation, or the F</dev/urandom> device, 5592if available. 5593 5594You can call srand($seed) with the same $seed to reproduce the 5595I<same> sequence from rand(), but this is usually reserved for 5596generating predictable results for testing or debugging. 5597Otherwise, don't call srand() more than once in your program. 5598 5599Do B<not> call srand() (i.e. without an argument) more than once in 5600a script. The internal state of the random number generator should 5601contain more entropy than can be provided by any seed, so calling 5602srand() again actually I<loses> randomness. 5603 5604Most implementations of C<srand> take an integer and will silently 5605truncate decimal numbers. This means C<srand(42)> will usually 5606produce the same results as C<srand(42.1)>. To be safe, always pass 5607C<srand> an integer. 5608 5609In versions of Perl prior to 5.004 the default seed was just the 5610current C<time>. This isn't a particularly good seed, so many old 5611programs supply their own seed value (often C<time ^ $$> or C<time ^ 5612($$ + ($$ << 15))>), but that isn't necessary any more. 5613 5614For cryptographic purposes, however, you need something much more random 5615than the default seed. Checksumming the compressed output of one or more 5616rapidly changing operating system status programs is the usual method. For 5617example: 5618 5619 srand (time ^ $$ ^ unpack "%L*", `ps axww | gzip`); 5620 5621If you're particularly concerned with this, see the C<Math::TrulyRandom> 5622module in CPAN. 5623 5624Frequently called programs (like CGI scripts) that simply use 5625 5626 time ^ $$ 5627 5628for a seed can fall prey to the mathematical property that 5629 5630 a^b == (a+1)^(b+1) 5631 5632one-third of the time. So don't do that. 5633 5634=item stat FILEHANDLE 5635X<stat> X<file, status> 5636 5637=item stat EXPR 5638 5639=item stat 5640 5641Returns a 13-element list giving the status info for a file, either 5642the file opened via FILEHANDLE, or named by EXPR. If EXPR is omitted, 5643it stats C<$_>. Returns a null list if the stat fails. Typically used 5644as follows: 5645 5646 ($dev,$ino,$mode,$nlink,$uid,$gid,$rdev,$size, 5647 $atime,$mtime,$ctime,$blksize,$blocks) 5648 = stat($filename); 5649 5650Not all fields are supported on all filesystem types. Here are the 5651meanings of the fields: 5652 5653 0 dev device number of filesystem 5654 1 ino inode number 5655 2 mode file mode (type and permissions) 5656 3 nlink number of (hard) links to the file 5657 4 uid numeric user ID of file's owner 5658 5 gid numeric group ID of file's owner 5659 6 rdev the device identifier (special files only) 5660 7 size total size of file, in bytes 5661 8 atime last access time in seconds since the epoch 5662 9 mtime last modify time in seconds since the epoch 5663 10 ctime inode change time in seconds since the epoch (*) 5664 11 blksize preferred block size for file system I/O 5665 12 blocks actual number of blocks allocated 5666 5667(The epoch was at 00:00 January 1, 1970 GMT.) 5668 5669(*) Not all fields are supported on all filesystem types. Notably, the 5670ctime field is non-portable. In particular, you cannot expect it to be a 5671"creation time", see L<perlport/"Files and Filesystems"> for details. 5672 5673If C<stat> is passed the special filehandle consisting of an underline, no 5674stat is done, but the current contents of the stat structure from the 5675last C<stat>, C<lstat>, or filetest are returned. Example: 5676 5677 if (-x $file && (($d) = stat(_)) && $d < 0) { 5678 print "$file is executable NFS file\n"; 5679 } 5680 5681(This works on machines only for which the device number is negative 5682under NFS.) 5683 5684Because the mode contains both the file type and its permissions, you 5685should mask off the file type portion and (s)printf using a C<"%o"> 5686if you want to see the real permissions. 5687 5688 $mode = (stat($filename))[2]; 5689 printf "Permissions are %04o\n", $mode & 07777; 5690 5691In scalar context, C<stat> returns a boolean value indicating success 5692or failure, and, if successful, sets the information associated with 5693the special filehandle C<_>. 5694 5695The File::stat module provides a convenient, by-name access mechanism: 5696 5697 use File::stat; 5698 $sb = stat($filename); 5699 printf "File is %s, size is %s, perm %04o, mtime %s\n", 5700 $filename, $sb->size, $sb->mode & 07777, 5701 scalar localtime $sb->mtime; 5702 5703You can import symbolic mode constants (C<S_IF*>) and functions 5704(C<S_IS*>) from the Fcntl module: 5705 5706 use Fcntl ':mode'; 5707 5708 $mode = (stat($filename))[2]; 5709 5710 $user_rwx = ($mode & S_IRWXU) >> 6; 5711 $group_read = ($mode & S_IRGRP) >> 3; 5712 $other_execute = $mode & S_IXOTH; 5713 5714 printf "Permissions are %04o\n", S_IMODE($mode), "\n"; 5715 5716 $is_setuid = $mode & S_ISUID; 5717 $is_setgid = S_ISDIR($mode); 5718 5719You could write the last two using the C<-u> and C<-d> operators. 5720The commonly available C<S_IF*> constants are 5721 5722 # Permissions: read, write, execute, for user, group, others. 5723 5724 S_IRWXU S_IRUSR S_IWUSR S_IXUSR 5725 S_IRWXG S_IRGRP S_IWGRP S_IXGRP 5726 S_IRWXO S_IROTH S_IWOTH S_IXOTH 5727 5728 # Setuid/Setgid/Stickiness/SaveText. 5729 # Note that the exact meaning of these is system dependent. 5730 5731 S_ISUID S_ISGID S_ISVTX S_ISTXT 5732 5733 # File types. Not necessarily all are available on your system. 5734 5735 S_IFREG S_IFDIR S_IFLNK S_IFBLK S_IFCHR S_IFIFO S_IFSOCK S_IFWHT S_ENFMT 5736 5737 # The following are compatibility aliases for S_IRUSR, S_IWUSR, S_IXUSR. 5738 5739 S_IREAD S_IWRITE S_IEXEC 5740 5741and the C<S_IF*> functions are 5742 5743 S_IMODE($mode) the part of $mode containing the permission bits 5744 and the setuid/setgid/sticky bits 5745 5746 S_IFMT($mode) the part of $mode containing the file type 5747 which can be bit-anded with e.g. S_IFREG 5748 or with the following functions 5749 5750 # The operators -f, -d, -l, -b, -c, -p, and -S. 5751 5752 S_ISREG($mode) S_ISDIR($mode) S_ISLNK($mode) 5753 S_ISBLK($mode) S_ISCHR($mode) S_ISFIFO($mode) S_ISSOCK($mode) 5754 5755 # No direct -X operator counterpart, but for the first one 5756 # the -g operator is often equivalent. The ENFMT stands for 5757 # record flocking enforcement, a platform-dependent feature. 5758 5759 S_ISENFMT($mode) S_ISWHT($mode) 5760 5761See your native chmod(2) and stat(2) documentation for more details 5762about the C<S_*> constants. To get status info for a symbolic link 5763instead of the target file behind the link, use the C<lstat> function. 5764 5765=item study SCALAR 5766X<study> 5767 5768=item study 5769 5770Takes extra time to study SCALAR (C<$_> if unspecified) in anticipation of 5771doing many pattern matches on the string before it is next modified. 5772This may or may not save time, depending on the nature and number of 5773patterns you are searching on, and on the distribution of character 5774frequencies in the string to be searched--you probably want to compare 5775run times with and without it to see which runs faster. Those loops 5776that scan for many short constant strings (including the constant 5777parts of more complex patterns) will benefit most. You may have only 5778one C<study> active at a time--if you study a different scalar the first 5779is "unstudied". (The way C<study> works is this: a linked list of every 5780character in the string to be searched is made, so we know, for 5781example, where all the C<'k'> characters are. From each search string, 5782the rarest character is selected, based on some static frequency tables 5783constructed from some C programs and English text. Only those places 5784that contain this "rarest" character are examined.) 5785 5786For example, here is a loop that inserts index producing entries 5787before any line containing a certain pattern: 5788 5789 while (<>) { 5790 study; 5791 print ".IX foo\n" if /\bfoo\b/; 5792 print ".IX bar\n" if /\bbar\b/; 5793 print ".IX blurfl\n" if /\bblurfl\b/; 5794 # ... 5795 print; 5796 } 5797 5798In searching for C</\bfoo\b/>, only those locations in C<$_> that contain C<f> 5799will be looked at, because C<f> is rarer than C<o>. In general, this is 5800a big win except in pathological cases. The only question is whether 5801it saves you more time than it took to build the linked list in the 5802first place. 5803 5804Note that if you have to look for strings that you don't know till 5805runtime, you can build an entire loop as a string and C<eval> that to 5806avoid recompiling all your patterns all the time. Together with 5807undefining C<$/> to input entire files as one record, this can be very 5808fast, often faster than specialized programs like fgrep(1). The following 5809scans a list of files (C<@files>) for a list of words (C<@words>), and prints 5810out the names of those files that contain a match: 5811 5812 $search = 'while (<>) { study;'; 5813 foreach $word (@words) { 5814 $search .= "++\$seen{\$ARGV} if /\\b$word\\b/;\n"; 5815 } 5816 $search .= "}"; 5817 @ARGV = @files; 5818 undef $/; 5819 eval $search; # this screams 5820 $/ = "\n"; # put back to normal input delimiter 5821 foreach $file (sort keys(%seen)) { 5822 print $file, "\n"; 5823 } 5824 5825=item sub NAME BLOCK 5826X<sub> 5827 5828=item sub NAME (PROTO) BLOCK 5829 5830=item sub NAME : ATTRS BLOCK 5831 5832=item sub NAME (PROTO) : ATTRS BLOCK 5833 5834This is subroutine definition, not a real function I<per se>. 5835Without a BLOCK it's just a forward declaration. Without a NAME, 5836it's an anonymous function declaration, and does actually return 5837a value: the CODE ref of the closure you just created. 5838 5839See L<perlsub> and L<perlref> for details about subroutines and 5840references, and L<attributes> and L<Attribute::Handlers> for more 5841information about attributes. 5842 5843=item substr EXPR,OFFSET,LENGTH,REPLACEMENT 5844X<substr> X<substring> X<mid> X<left> X<right> 5845 5846=item substr EXPR,OFFSET,LENGTH 5847 5848=item substr EXPR,OFFSET 5849 5850Extracts a substring out of EXPR and returns it. First character is at 5851offset C<0>, or whatever you've set C<$[> to (but don't do that). 5852If OFFSET is negative (or more precisely, less than C<$[>), starts 5853that far from the end of the string. If LENGTH is omitted, returns 5854everything to the end of the string. If LENGTH is negative, leaves that 5855many characters off the end of the string. 5856 5857You can use the substr() function as an lvalue, in which case EXPR 5858must itself be an lvalue. If you assign something shorter than LENGTH, 5859the string will shrink, and if you assign something longer than LENGTH, 5860the string will grow to accommodate it. To keep the string the same 5861length you may need to pad or chop your value using C<sprintf>. 5862 5863If OFFSET and LENGTH specify a substring that is partly outside the 5864string, only the part within the string is returned. If the substring 5865is beyond either end of the string, substr() returns the undefined 5866value and produces a warning. When used as an lvalue, specifying a 5867substring that is entirely outside the string is a fatal error. 5868Here's an example showing the behavior for boundary cases: 5869 5870 my $name = 'fred'; 5871 substr($name, 4) = 'dy'; # $name is now 'freddy' 5872 my $null = substr $name, 6, 2; # returns '' (no warning) 5873 my $oops = substr $name, 7; # returns undef, with warning 5874 substr($name, 7) = 'gap'; # fatal error 5875 5876An alternative to using substr() as an lvalue is to specify the 5877replacement string as the 4th argument. This allows you to replace 5878parts of the EXPR and return what was there before in one operation, 5879just as you can with splice(). 5880 5881 5882=item symlink OLDFILE,NEWFILE 5883X<symlink> X<link> X<symbolic link> X<link, symbolic> 5884 5885Creates a new filename symbolically linked to the old filename. 5886Returns C<1> for success, C<0> otherwise. On systems that don't support 5887symbolic links, produces a fatal error at run time. To check for that, 5888use eval: 5889 5890 $symlink_exists = eval { symlink("",""); 1 }; 5891 5892=item syscall NUMBER, LIST 5893X<syscall> X<system call> 5894 5895Calls the system call specified as the first element of the list, 5896passing the remaining elements as arguments to the system call. If 5897unimplemented, produces a fatal error. The arguments are interpreted 5898as follows: if a given argument is numeric, the argument is passed as 5899an int. If not, the pointer to the string value is passed. You are 5900responsible to make sure a string is pre-extended long enough to 5901receive any result that might be written into a string. You can't use a 5902string literal (or other read-only string) as an argument to C<syscall> 5903because Perl has to assume that any string pointer might be written 5904through. If your 5905integer arguments are not literals and have never been interpreted in a 5906numeric context, you may need to add C<0> to them to force them to look 5907like numbers. This emulates the C<syswrite> function (or vice versa): 5908 5909 require 'syscall.ph'; # may need to run h2ph 5910 $s = "hi there\n"; 5911 syscall(&SYS_write, fileno(STDOUT), $s, length $s); 5912 5913Note that Perl supports passing of up to only 14 arguments to your system call, 5914which in practice should usually suffice. 5915 5916Syscall returns whatever value returned by the system call it calls. 5917If the system call fails, C<syscall> returns C<-1> and sets C<$!> (errno). 5918Note that some system calls can legitimately return C<-1>. The proper 5919way to handle such calls is to assign C<$!=0;> before the call and 5920check the value of C<$!> if syscall returns C<-1>. 5921 5922There's a problem with C<syscall(&SYS_pipe)>: it returns the file 5923number of the read end of the pipe it creates. There is no way 5924to retrieve the file number of the other end. You can avoid this 5925problem by using C<pipe> instead. 5926 5927=item sysopen FILEHANDLE,FILENAME,MODE 5928X<sysopen> 5929 5930=item sysopen FILEHANDLE,FILENAME,MODE,PERMS 5931 5932Opens the file whose filename is given by FILENAME, and associates it 5933with FILEHANDLE. If FILEHANDLE is an expression, its value is used as 5934the name of the real filehandle wanted. This function calls the 5935underlying operating system's C<open> function with the parameters 5936FILENAME, MODE, PERMS. 5937 5938The possible values and flag bits of the MODE parameter are 5939system-dependent; they are available via the standard module C<Fcntl>. 5940See the documentation of your operating system's C<open> to see which 5941values and flag bits are available. You may combine several flags 5942using the C<|>-operator. 5943 5944Some of the most common values are C<O_RDONLY> for opening the file in 5945read-only mode, C<O_WRONLY> for opening the file in write-only mode, 5946and C<O_RDWR> for opening the file in read-write mode. 5947X<O_RDONLY> X<O_RDWR> X<O_WRONLY> 5948 5949For historical reasons, some values work on almost every system 5950supported by perl: zero means read-only, one means write-only, and two 5951means read/write. We know that these values do I<not> work under 5952OS/390 & VM/ESA Unix and on the Macintosh; you probably don't want to 5953use them in new code. 5954 5955If the file named by FILENAME does not exist and the C<open> call creates 5956it (typically because MODE includes the C<O_CREAT> flag), then the value of 5957PERMS specifies the permissions of the newly created file. If you omit 5958the PERMS argument to C<sysopen>, Perl uses the octal value C<0666>. 5959These permission values need to be in octal, and are modified by your 5960process's current C<umask>. 5961X<O_CREAT> 5962 5963In many systems the C<O_EXCL> flag is available for opening files in 5964exclusive mode. This is B<not> locking: exclusiveness means here that 5965if the file already exists, sysopen() fails. C<O_EXCL> may not work 5966on network filesystems, and has no effect unless the C<O_CREAT> flag 5967is set as well. Setting C<O_CREAT|O_EXCL> prevents the file from 5968being opened if it is a symbolic link. It does not protect against 5969symbolic links in the file's path. 5970X<O_EXCL> 5971 5972Sometimes you may want to truncate an already-existing file. This 5973can be done using the C<O_TRUNC> flag. The behavior of 5974C<O_TRUNC> with C<O_RDONLY> is undefined. 5975X<O_TRUNC> 5976 5977You should seldom if ever use C<0644> as argument to C<sysopen>, because 5978that takes away the user's option to have a more permissive umask. 5979Better to omit it. See the perlfunc(1) entry on C<umask> for more 5980on this. 5981 5982Note that C<sysopen> depends on the fdopen() C library function. 5983On many UNIX systems, fdopen() is known to fail when file descriptors 5984exceed a certain value, typically 255. If you need more file 5985descriptors than that, consider rebuilding Perl to use the C<sfio> 5986library, or perhaps using the POSIX::open() function. 5987 5988See L<perlopentut> for a kinder, gentler explanation of opening files. 5989 5990=item sysread FILEHANDLE,SCALAR,LENGTH,OFFSET 5991X<sysread> 5992 5993=item sysread FILEHANDLE,SCALAR,LENGTH 5994 5995Attempts to read LENGTH bytes of data into variable SCALAR from the 5996specified FILEHANDLE, using the system call read(2). It bypasses 5997buffered IO, so mixing this with other kinds of reads, C<print>, 5998C<write>, C<seek>, C<tell>, or C<eof> can cause confusion because the 5999perlio or stdio layers usually buffers data. Returns the number of 6000bytes actually read, C<0> at end of file, or undef if there was an 6001error (in the latter case C<$!> is also set). SCALAR will be grown or 6002shrunk so that the last byte actually read is the last byte of the 6003scalar after the read. 6004 6005An OFFSET may be specified to place the read data at some place in the 6006string other than the beginning. A negative OFFSET specifies 6007placement at that many characters counting backwards from the end of 6008the string. A positive OFFSET greater than the length of SCALAR 6009results in the string being padded to the required size with C<"\0"> 6010bytes before the result of the read is appended. 6011 6012There is no syseof() function, which is ok, since eof() doesn't work 6013very well on device files (like ttys) anyway. Use sysread() and check 6014for a return value for 0 to decide whether you're done. 6015 6016Note that if the filehandle has been marked as C<:utf8> Unicode 6017characters are read instead of bytes (the LENGTH, OFFSET, and the 6018return value of sysread() are in Unicode characters). 6019The C<:encoding(...)> layer implicitly introduces the C<:utf8> layer. 6020See L</binmode>, L</open>, and the C<open> pragma, L<open>. 6021 6022=item sysseek FILEHANDLE,POSITION,WHENCE 6023X<sysseek> X<lseek> 6024 6025Sets FILEHANDLE's system position in bytes using the system call 6026lseek(2). FILEHANDLE may be an expression whose value gives the name 6027of the filehandle. The values for WHENCE are C<0> to set the new 6028position to POSITION, C<1> to set the it to the current position plus 6029POSITION, and C<2> to set it to EOF plus POSITION (typically 6030negative). 6031 6032Note the I<in bytes>: even if the filehandle has been set to operate 6033on characters (for example by using the C<:utf8> I/O layer), tell() 6034will return byte offsets, not character offsets (because implementing 6035that would render sysseek() very slow). 6036 6037sysseek() bypasses normal buffered IO, so mixing this with reads (other 6038than C<sysread>, for example C<< <> >> or read()) C<print>, C<write>, 6039C<seek>, C<tell>, or C<eof> may cause confusion. 6040 6041For WHENCE, you may also use the constants C<SEEK_SET>, C<SEEK_CUR>, 6042and C<SEEK_END> (start of the file, current position, end of the file) 6043from the Fcntl module. Use of the constants is also more portable 6044than relying on 0, 1, and 2. For example to define a "systell" function: 6045 6046 use Fcntl 'SEEK_CUR'; 6047 sub systell { sysseek($_[0], 0, SEEK_CUR) } 6048 6049Returns the new position, or the undefined value on failure. A position 6050of zero is returned as the string C<"0 but true">; thus C<sysseek> returns 6051true on success and false on failure, yet you can still easily determine 6052the new position. 6053 6054=item system LIST 6055X<system> X<shell> 6056 6057=item system PROGRAM LIST 6058 6059Does exactly the same thing as C<exec LIST>, except that a fork is 6060done first, and the parent process waits for the child process to 6061complete. Note that argument processing varies depending on the 6062number of arguments. If there is more than one argument in LIST, 6063or if LIST is an array with more than one value, starts the program 6064given by the first element of the list with arguments given by the 6065rest of the list. If there is only one scalar argument, the argument 6066is checked for shell metacharacters, and if there are any, the 6067entire argument is passed to the system's command shell for parsing 6068(this is C</bin/sh -c> on Unix platforms, but varies on other 6069platforms). If there are no shell metacharacters in the argument, 6070it is split into words and passed directly to C<execvp>, which is 6071more efficient. 6072 6073Beginning with v5.6.0, Perl will attempt to flush all files opened for 6074output before any operation that may do a fork, but this may not be 6075supported on some platforms (see L<perlport>). To be safe, you may need 6076to set C<$|> ($AUTOFLUSH in English) or call the C<autoflush()> method 6077of C<IO::Handle> on any open handles. 6078 6079The return value is the exit status of the program as returned by the 6080C<wait> call. To get the actual exit value, shift right by eight (see 6081below). See also L</exec>. This is I<not> what you want to use to capture 6082the output from a command, for that you should use merely backticks or 6083C<qx//>, as described in L<perlop/"`STRING`">. Return value of -1 6084indicates a failure to start the program or an error of the wait(2) system 6085call (inspect $! for the reason). 6086 6087Like C<exec>, C<system> allows you to lie to a program about its name if 6088you use the C<system PROGRAM LIST> syntax. Again, see L</exec>. 6089 6090Since C<SIGINT> and C<SIGQUIT> are ignored during the execution of 6091C<system>, if you expect your program to terminate on receipt of these 6092signals you will need to arrange to do so yourself based on the return 6093value. 6094 6095 @args = ("command", "arg1", "arg2"); 6096 system(@args) == 0 6097 or die "system @args failed: $?" 6098 6099You can check all the failure possibilities by inspecting 6100C<$?> like this: 6101 6102 if ($? == -1) { 6103 print "failed to execute: $!\n"; 6104 } 6105 elsif ($? & 127) { 6106 printf "child died with signal %d, %s coredump\n", 6107 ($? & 127), ($? & 128) ? 'with' : 'without'; 6108 } 6109 else { 6110 printf "child exited with value %d\n", $? >> 8; 6111 } 6112 6113or more portably by using the W*() calls of the POSIX extension; 6114see L<perlport> for more information. 6115 6116When the arguments get executed via the system shell, results 6117and return codes will be subject to its quirks and capabilities. 6118See L<perlop/"`STRING`"> and L</exec> for details. 6119 6120=item syswrite FILEHANDLE,SCALAR,LENGTH,OFFSET 6121X<syswrite> 6122 6123=item syswrite FILEHANDLE,SCALAR,LENGTH 6124 6125=item syswrite FILEHANDLE,SCALAR 6126 6127Attempts to write LENGTH bytes of data from variable SCALAR to the 6128specified FILEHANDLE, using the system call write(2). If LENGTH is 6129not specified, writes whole SCALAR. It bypasses buffered IO, so 6130mixing this with reads (other than C<sysread())>, C<print>, C<write>, 6131C<seek>, C<tell>, or C<eof> may cause confusion because the perlio and 6132stdio layers usually buffers data. Returns the number of bytes 6133actually written, or C<undef> if there was an error (in this case the 6134errno variable C<$!> is also set). If the LENGTH is greater than the 6135available data in the SCALAR after the OFFSET, only as much data as is 6136available will be written. 6137 6138An OFFSET may be specified to write the data from some part of the 6139string other than the beginning. A negative OFFSET specifies writing 6140that many characters counting backwards from the end of the string. 6141In the case the SCALAR is empty you can use OFFSET but only zero offset. 6142 6143Note that if the filehandle has been marked as C<:utf8>, Unicode 6144characters are written instead of bytes (the LENGTH, OFFSET, and the 6145return value of syswrite() are in UTF-8 encoded Unicode characters). 6146The C<:encoding(...)> layer implicitly introduces the C<:utf8> layer. 6147See L</binmode>, L</open>, and the C<open> pragma, L<open>. 6148 6149=item tell FILEHANDLE 6150X<tell> 6151 6152=item tell 6153 6154Returns the current position I<in bytes> for FILEHANDLE, or -1 on 6155error. FILEHANDLE may be an expression whose value gives the name of 6156the actual filehandle. If FILEHANDLE is omitted, assumes the file 6157last read. 6158 6159Note the I<in bytes>: even if the filehandle has been set to 6160operate on characters (for example by using the C<:utf8> open 6161layer), tell() will return byte offsets, not character offsets 6162(because that would render seek() and tell() rather slow). 6163 6164The return value of tell() for the standard streams like the STDIN 6165depends on the operating system: it may return -1 or something else. 6166tell() on pipes, fifos, and sockets usually returns -1. 6167 6168There is no C<systell> function. Use C<sysseek(FH, 0, 1)> for that. 6169 6170Do not use tell() (or other buffered I/O operations) on a file handle 6171that has been manipulated by sysread(), syswrite() or sysseek(). 6172Those functions ignore the buffering, while tell() does not. 6173 6174=item telldir DIRHANDLE 6175X<telldir> 6176 6177Returns the current position of the C<readdir> routines on DIRHANDLE. 6178Value may be given to C<seekdir> to access a particular location in a 6179directory. C<telldir> has the same caveats about possible directory 6180compaction as the corresponding system library routine. 6181 6182=item tie VARIABLE,CLASSNAME,LIST 6183X<tie> 6184 6185This function binds a variable to a package class that will provide the 6186implementation for the variable. VARIABLE is the name of the variable 6187to be enchanted. CLASSNAME is the name of a class implementing objects 6188of correct type. Any additional arguments are passed to the C<new> 6189method of the class (meaning C<TIESCALAR>, C<TIEHANDLE>, C<TIEARRAY>, 6190or C<TIEHASH>). Typically these are arguments such as might be passed 6191to the C<dbm_open()> function of C. The object returned by the C<new> 6192method is also returned by the C<tie> function, which would be useful 6193if you want to access other methods in CLASSNAME. 6194 6195Note that functions such as C<keys> and C<values> may return huge lists 6196when used on large objects, like DBM files. You may prefer to use the 6197C<each> function to iterate over such. Example: 6198 6199 # print out history file offsets 6200 use NDBM_File; 6201 tie(%HIST, 'NDBM_File', '/usr/lib/news/history', 1, 0); 6202 while (($key,$val) = each %HIST) { 6203 print $key, ' = ', unpack('L',$val), "\n"; 6204 } 6205 untie(%HIST); 6206 6207A class implementing a hash should have the following methods: 6208 6209 TIEHASH classname, LIST 6210 FETCH this, key 6211 STORE this, key, value 6212 DELETE this, key 6213 CLEAR this 6214 EXISTS this, key 6215 FIRSTKEY this 6216 NEXTKEY this, lastkey 6217 SCALAR this 6218 DESTROY this 6219 UNTIE this 6220 6221A class implementing an ordinary array should have the following methods: 6222 6223 TIEARRAY classname, LIST 6224 FETCH this, key 6225 STORE this, key, value 6226 FETCHSIZE this 6227 STORESIZE this, count 6228 CLEAR this 6229 PUSH this, LIST 6230 POP this 6231 SHIFT this 6232 UNSHIFT this, LIST 6233 SPLICE this, offset, length, LIST 6234 EXTEND this, count 6235 DESTROY this 6236 UNTIE this 6237 6238A class implementing a file handle should have the following methods: 6239 6240 TIEHANDLE classname, LIST 6241 READ this, scalar, length, offset 6242 READLINE this 6243 GETC this 6244 WRITE this, scalar, length, offset 6245 PRINT this, LIST 6246 PRINTF this, format, LIST 6247 BINMODE this 6248 EOF this 6249 FILENO this 6250 SEEK this, position, whence 6251 TELL this 6252 OPEN this, mode, LIST 6253 CLOSE this 6254 DESTROY this 6255 UNTIE this 6256 6257A class implementing a scalar should have the following methods: 6258 6259 TIESCALAR classname, LIST 6260 FETCH this, 6261 STORE this, value 6262 DESTROY this 6263 UNTIE this 6264 6265Not all methods indicated above need be implemented. See L<perltie>, 6266L<Tie::Hash>, L<Tie::Array>, L<Tie::Scalar>, and L<Tie::Handle>. 6267 6268Unlike C<dbmopen>, the C<tie> function will not use or require a module 6269for you--you need to do that explicitly yourself. See L<DB_File> 6270or the F<Config> module for interesting C<tie> implementations. 6271 6272For further details see L<perltie>, L<"tied VARIABLE">. 6273 6274=item tied VARIABLE 6275X<tied> 6276 6277Returns a reference to the object underlying VARIABLE (the same value 6278that was originally returned by the C<tie> call that bound the variable 6279to a package.) Returns the undefined value if VARIABLE isn't tied to a 6280package. 6281 6282=item time 6283X<time> X<epoch> 6284 6285Returns the number of non-leap seconds since whatever time the system 6286considers to be the epoch, suitable for feeding to C<gmtime> and 6287C<localtime>. On most systems the epoch is 00:00:00 UTC, January 1, 1970; 6288a prominent exception being Mac OS Classic which uses 00:00:00, January 1, 62891904 in the current local time zone for its epoch. 6290 6291For measuring time in better granularity than one second, 6292you may use either the Time::HiRes module (from CPAN, and starting from 6293Perl 5.8 part of the standard distribution), or if you have 6294gettimeofday(2), you may be able to use the C<syscall> interface of Perl. 6295See L<perlfaq8> for details. 6296 6297=item times 6298X<times> 6299 6300Returns a four-element list giving the user and system times, in 6301seconds, for this process and the children of this process. 6302 6303 ($user,$system,$cuser,$csystem) = times; 6304 6305In scalar context, C<times> returns C<$user>. 6306 6307=item tr/// 6308 6309The transliteration operator. Same as C<y///>. See L<perlop>. 6310 6311=item truncate FILEHANDLE,LENGTH 6312X<truncate> 6313 6314=item truncate EXPR,LENGTH 6315 6316Truncates the file opened on FILEHANDLE, or named by EXPR, to the 6317specified length. Produces a fatal error if truncate isn't implemented 6318on your system. Returns true if successful, the undefined value 6319otherwise. 6320 6321The behavior is undefined if LENGTH is greater than the length of the 6322file. 6323 6324=item uc EXPR 6325X<uc> X<uppercase> X<toupper> 6326 6327=item uc 6328 6329Returns an uppercased version of EXPR. This is the internal function 6330implementing the C<\U> escape in double-quoted strings. Respects 6331current LC_CTYPE locale if C<use locale> in force. See L<perllocale> 6332and L<perlunicode> for more details about locale and Unicode support. 6333It does not attempt to do titlecase mapping on initial letters. See 6334C<ucfirst> for that. 6335 6336If EXPR is omitted, uses C<$_>. 6337 6338=item ucfirst EXPR 6339X<ucfirst> X<uppercase> 6340 6341=item ucfirst 6342 6343Returns the value of EXPR with the first character in uppercase 6344(titlecase in Unicode). This is the internal function implementing 6345the C<\u> escape in double-quoted strings. Respects current LC_CTYPE 6346locale if C<use locale> in force. See L<perllocale> and L<perlunicode> 6347for more details about locale and Unicode support. 6348 6349If EXPR is omitted, uses C<$_>. 6350 6351=item umask EXPR 6352X<umask> 6353 6354=item umask 6355 6356Sets the umask for the process to EXPR and returns the previous value. 6357If EXPR is omitted, merely returns the current umask. 6358 6359The Unix permission C<rwxr-x---> is represented as three sets of three 6360bits, or three octal digits: C<0750> (the leading 0 indicates octal 6361and isn't one of the digits). The C<umask> value is such a number 6362representing disabled permissions bits. The permission (or "mode") 6363values you pass C<mkdir> or C<sysopen> are modified by your umask, so 6364even if you tell C<sysopen> to create a file with permissions C<0777>, 6365if your umask is C<0022> then the file will actually be created with 6366permissions C<0755>. If your C<umask> were C<0027> (group can't 6367write; others can't read, write, or execute), then passing 6368C<sysopen> C<0666> would create a file with mode C<0640> (C<0666 &~ 6369027> is C<0640>). 6370 6371Here's some advice: supply a creation mode of C<0666> for regular 6372files (in C<sysopen>) and one of C<0777> for directories (in 6373C<mkdir>) and executable files. This gives users the freedom of 6374choice: if they want protected files, they might choose process umasks 6375of C<022>, C<027>, or even the particularly antisocial mask of C<077>. 6376Programs should rarely if ever make policy decisions better left to 6377the user. The exception to this is when writing files that should be 6378kept private: mail files, web browser cookies, I<.rhosts> files, and 6379so on. 6380 6381If umask(2) is not implemented on your system and you are trying to 6382restrict access for I<yourself> (i.e., (EXPR & 0700) > 0), produces a 6383fatal error at run time. If umask(2) is not implemented and you are 6384not trying to restrict access for yourself, returns C<undef>. 6385 6386Remember that a umask is a number, usually given in octal; it is I<not> a 6387string of octal digits. See also L</oct>, if all you have is a string. 6388 6389=item undef EXPR 6390X<undef> X<undefine> 6391 6392=item undef 6393 6394Undefines the value of EXPR, which must be an lvalue. Use only on a 6395scalar value, an array (using C<@>), a hash (using C<%>), a subroutine 6396(using C<&>), or a typeglob (using C<*>). (Saying C<undef $hash{$key}> 6397will probably not do what you expect on most predefined variables or 6398DBM list values, so don't do that; see L<delete>.) Always returns the 6399undefined value. You can omit the EXPR, in which case nothing is 6400undefined, but you still get an undefined value that you could, for 6401instance, return from a subroutine, assign to a variable or pass as a 6402parameter. Examples: 6403 6404 undef $foo; 6405 undef $bar{'blurfl'}; # Compare to: delete $bar{'blurfl'}; 6406 undef @ary; 6407 undef %hash; 6408 undef &mysub; 6409 undef *xyz; # destroys $xyz, @xyz, %xyz, &xyz, etc. 6410 return (wantarray ? (undef, $errmsg) : undef) if $they_blew_it; 6411 select undef, undef, undef, 0.25; 6412 ($a, $b, undef, $c) = &foo; # Ignore third value returned 6413 6414Note that this is a unary operator, not a list operator. 6415 6416=item unlink LIST 6417X<unlink> X<delete> X<remove> X<rm> 6418 6419=item unlink 6420 6421Deletes a list of files. Returns the number of files successfully 6422deleted. 6423 6424 $cnt = unlink 'a', 'b', 'c'; 6425 unlink @goners; 6426 unlink <*.bak>; 6427 6428Note: C<unlink> will not attempt to delete directories unless you are superuser 6429and the B<-U> flag is supplied to Perl. Even if these conditions are 6430met, be warned that unlinking a directory can inflict damage on your 6431filesystem. Finally, using C<unlink> on directories is not supported on 6432many operating systems. Use C<rmdir> instead. 6433 6434If LIST is omitted, uses C<$_>. 6435 6436=item unpack TEMPLATE,EXPR 6437X<unpack> 6438 6439C<unpack> does the reverse of C<pack>: it takes a string 6440and expands it out into a list of values. 6441(In scalar context, it returns merely the first value produced.) 6442 6443The string is broken into chunks described by the TEMPLATE. Each chunk 6444is converted separately to a value. Typically, either the string is a result 6445of C<pack>, or the bytes of the string represent a C structure of some 6446kind. 6447 6448The TEMPLATE has the same format as in the C<pack> function. 6449Here's a subroutine that does substring: 6450 6451 sub substr { 6452 my($what,$where,$howmuch) = @_; 6453 unpack("x$where a$howmuch", $what); 6454 } 6455 6456and then there's 6457 6458 sub ordinal { unpack("c",$_[0]); } # same as ord() 6459 6460In addition to fields allowed in pack(), you may prefix a field with 6461a %<number> to indicate that 6462you want a <number>-bit checksum of the items instead of the items 6463themselves. Default is a 16-bit checksum. Checksum is calculated by 6464summing numeric values of expanded values (for string fields the sum of 6465C<ord($char)> is taken, for bit fields the sum of zeroes and ones). 6466 6467For example, the following 6468computes the same number as the System V sum program: 6469 6470 $checksum = do { 6471 local $/; # slurp! 6472 unpack("%32C*",<>) % 65535; 6473 }; 6474 6475The following efficiently counts the number of set bits in a bit vector: 6476 6477 $setbits = unpack("%32b*", $selectmask); 6478 6479The C<p> and C<P> formats should be used with care. Since Perl 6480has no way of checking whether the value passed to C<unpack()> 6481corresponds to a valid memory location, passing a pointer value that's 6482not known to be valid is likely to have disastrous consequences. 6483 6484If there are more pack codes or if the repeat count of a field or a group 6485is larger than what the remainder of the input string allows, the result 6486is not well defined: in some cases, the repeat count is decreased, or 6487C<unpack()> will produce null strings or zeroes, or terminate with an 6488error. If the input string is longer than one described by the TEMPLATE, 6489the rest is ignored. 6490 6491See L</pack> for more examples and notes. 6492 6493=item untie VARIABLE 6494X<untie> 6495 6496Breaks the binding between a variable and a package. (See C<tie>.) 6497Has no effect if the variable is not tied. 6498 6499=item unshift ARRAY,LIST 6500X<unshift> 6501 6502Does the opposite of a C<shift>. Or the opposite of a C<push>, 6503depending on how you look at it. Prepends list to the front of the 6504array, and returns the new number of elements in the array. 6505 6506 unshift(@ARGV, '-e') unless $ARGV[0] =~ /^-/; 6507 6508Note the LIST is prepended whole, not one element at a time, so the 6509prepended elements stay in the same order. Use C<reverse> to do the 6510reverse. 6511 6512=item use Module VERSION LIST 6513X<use> X<module> X<import> 6514 6515=item use Module VERSION 6516 6517=item use Module LIST 6518 6519=item use Module 6520 6521=item use VERSION 6522 6523Imports some semantics into the current package from the named module, 6524generally by aliasing certain subroutine or variable names into your 6525package. It is exactly equivalent to 6526 6527 BEGIN { require Module; import Module LIST; } 6528 6529except that Module I<must> be a bareword. 6530 6531VERSION may be either a numeric argument such as 5.006, which will be 6532compared to C<$]>, or a literal of the form v5.6.1, which will be compared 6533to C<$^V> (aka $PERL_VERSION. A fatal error is produced if VERSION is 6534greater than the version of the current Perl interpreter; Perl will not 6535attempt to parse the rest of the file. Compare with L</require>, which can 6536do a similar check at run time. 6537 6538Specifying VERSION as a literal of the form v5.6.1 should generally be 6539avoided, because it leads to misleading error messages under earlier 6540versions of Perl that do not support this syntax. The equivalent numeric 6541version should be used instead. 6542 6543 use v5.6.1; # compile time version check 6544 use 5.6.1; # ditto 6545 use 5.006_001; # ditto; preferred for backwards compatibility 6546 6547This is often useful if you need to check the current Perl version before 6548C<use>ing library modules that have changed in incompatible ways from 6549older versions of Perl. (We try not to do this more than we have to.) 6550 6551The C<BEGIN> forces the C<require> and C<import> to happen at compile time. The 6552C<require> makes sure the module is loaded into memory if it hasn't been 6553yet. The C<import> is not a builtin--it's just an ordinary static method 6554call into the C<Module> package to tell the module to import the list of 6555features back into the current package. The module can implement its 6556C<import> method any way it likes, though most modules just choose to 6557derive their C<import> method via inheritance from the C<Exporter> class that 6558is defined in the C<Exporter> module. See L<Exporter>. If no C<import> 6559method can be found then the call is skipped. 6560 6561If you do not want to call the package's C<import> method (for instance, 6562to stop your namespace from being altered), explicitly supply the empty list: 6563 6564 use Module (); 6565 6566That is exactly equivalent to 6567 6568 BEGIN { require Module } 6569 6570If the VERSION argument is present between Module and LIST, then the 6571C<use> will call the VERSION method in class Module with the given 6572version as an argument. The default VERSION method, inherited from 6573the UNIVERSAL class, croaks if the given version is larger than the 6574value of the variable C<$Module::VERSION>. 6575 6576Again, there is a distinction between omitting LIST (C<import> called 6577with no arguments) and an explicit empty LIST C<()> (C<import> not 6578called). Note that there is no comma after VERSION! 6579 6580Because this is a wide-open interface, pragmas (compiler directives) 6581are also implemented this way. Currently implemented pragmas are: 6582 6583 use constant; 6584 use diagnostics; 6585 use integer; 6586 use sigtrap qw(SEGV BUS); 6587 use strict qw(subs vars refs); 6588 use subs qw(afunc blurfl); 6589 use warnings qw(all); 6590 use sort qw(stable _quicksort _mergesort); 6591 6592Some of these pseudo-modules import semantics into the current 6593block scope (like C<strict> or C<integer>, unlike ordinary modules, 6594which import symbols into the current package (which are effective 6595through the end of the file). 6596 6597There's a corresponding C<no> command that unimports meanings imported 6598by C<use>, i.e., it calls C<unimport Module LIST> instead of C<import>. 6599 6600 no integer; 6601 no strict 'refs'; 6602 no warnings; 6603 6604See L<perlmodlib> for a list of standard modules and pragmas. See L<perlrun> 6605for the C<-M> and C<-m> command-line options to perl that give C<use> 6606functionality from the command-line. 6607 6608=item utime LIST 6609X<utime> 6610 6611Changes the access and modification times on each file of a list of 6612files. The first two elements of the list must be the NUMERICAL access 6613and modification times, in that order. Returns the number of files 6614successfully changed. The inode change time of each file is set 6615to the current time. For example, this code has the same effect as the 6616Unix touch(1) command when the files I<already exist> and belong to 6617the user running the program: 6618 6619 #!/usr/bin/perl 6620 $atime = $mtime = time; 6621 utime $atime, $mtime, @ARGV; 6622 6623Since perl 5.7.2, if the first two elements of the list are C<undef>, then 6624the utime(2) function in the C library will be called with a null second 6625argument. On most systems, this will set the file's access and 6626modification times to the current time (i.e. equivalent to the example 6627above) and will even work on other users' files where you have write 6628permission: 6629 6630 utime undef, undef, @ARGV; 6631 6632Under NFS this will use the time of the NFS server, not the time of 6633the local machine. If there is a time synchronization problem, the 6634NFS server and local machine will have different times. The Unix 6635touch(1) command will in fact normally use this form instead of the 6636one shown in the first example. 6637 6638Note that only passing one of the first two elements as C<undef> will 6639be equivalent of passing it as 0 and will not have the same effect as 6640described when they are both C<undef>. This case will also trigger an 6641uninitialized warning. 6642 6643=item values HASH 6644X<values> 6645 6646Returns a list consisting of all the values of the named hash. 6647(In a scalar context, returns the number of values.) 6648 6649The values are returned in an apparently random order. The actual 6650random order is subject to change in future versions of perl, but it 6651is guaranteed to be the same order as either the C<keys> or C<each> 6652function would produce on the same (unmodified) hash. Since Perl 66535.8.1 the ordering is different even between different runs of Perl 6654for security reasons (see L<perlsec/"Algorithmic Complexity Attacks">). 6655 6656As a side effect, calling values() resets the HASH's internal iterator, 6657see L</each>. (In particular, calling values() in void context resets 6658the iterator with no other overhead.) 6659 6660Note that the values are not copied, which means modifying them will 6661modify the contents of the hash: 6662 6663 for (values %hash) { s/foo/bar/g } # modifies %hash values 6664 for (@hash{keys %hash}) { s/foo/bar/g } # same 6665 6666See also C<keys>, C<each>, and C<sort>. 6667 6668=item vec EXPR,OFFSET,BITS 6669X<vec> X<bit> X<bit vector> 6670 6671Treats the string in EXPR as a bit vector made up of elements of 6672width BITS, and returns the value of the element specified by OFFSET 6673as an unsigned integer. BITS therefore specifies the number of bits 6674that are reserved for each element in the bit vector. This must 6675be a power of two from 1 to 32 (or 64, if your platform supports 6676that). 6677 6678If BITS is 8, "elements" coincide with bytes of the input string. 6679 6680If BITS is 16 or more, bytes of the input string are grouped into chunks 6681of size BITS/8, and each group is converted to a number as with 6682pack()/unpack() with big-endian formats C<n>/C<N> (and analogously 6683for BITS==64). See L<"pack"> for details. 6684 6685If bits is 4 or less, the string is broken into bytes, then the bits 6686of each byte are broken into 8/BITS groups. Bits of a byte are 6687numbered in a little-endian-ish way, as in C<0x01>, C<0x02>, 6688C<0x04>, C<0x08>, C<0x10>, C<0x20>, C<0x40>, C<0x80>. For example, 6689breaking the single input byte C<chr(0x36)> into two groups gives a list 6690C<(0x6, 0x3)>; breaking it into 4 groups gives C<(0x2, 0x1, 0x3, 0x0)>. 6691 6692C<vec> may also be assigned to, in which case parentheses are needed 6693to give the expression the correct precedence as in 6694 6695 vec($image, $max_x * $x + $y, 8) = 3; 6696 6697If the selected element is outside the string, the value 0 is returned. 6698If an element off the end of the string is written to, Perl will first 6699extend the string with sufficiently many zero bytes. It is an error 6700to try to write off the beginning of the string (i.e. negative OFFSET). 6701 6702The string should not contain any character with the value > 255 (which 6703can only happen if you're using UTF-8 encoding). If it does, it will be 6704treated as something that is not UTF-8 encoded. When the C<vec> was 6705assigned to, other parts of your program will also no longer consider the 6706string to be UTF-8 encoded. In other words, if you do have such characters 6707in your string, vec() will operate on the actual byte string, and not the 6708conceptual character string. 6709 6710Strings created with C<vec> can also be manipulated with the logical 6711operators C<|>, C<&>, C<^>, and C<~>. These operators will assume a bit 6712vector operation is desired when both operands are strings. 6713See L<perlop/"Bitwise String Operators">. 6714 6715The following code will build up an ASCII string saying C<'PerlPerlPerl'>. 6716The comments show the string after each step. Note that this code works 6717in the same way on big-endian or little-endian machines. 6718 6719 my $foo = ''; 6720 vec($foo, 0, 32) = 0x5065726C; # 'Perl' 6721 6722 # $foo eq "Perl" eq "\x50\x65\x72\x6C", 32 bits 6723 print vec($foo, 0, 8); # prints 80 == 0x50 == ord('P') 6724 6725 vec($foo, 2, 16) = 0x5065; # 'PerlPe' 6726 vec($foo, 3, 16) = 0x726C; # 'PerlPerl' 6727 vec($foo, 8, 8) = 0x50; # 'PerlPerlP' 6728 vec($foo, 9, 8) = 0x65; # 'PerlPerlPe' 6729 vec($foo, 20, 4) = 2; # 'PerlPerlPe' . "\x02" 6730 vec($foo, 21, 4) = 7; # 'PerlPerlPer' 6731 # 'r' is "\x72" 6732 vec($foo, 45, 2) = 3; # 'PerlPerlPer' . "\x0c" 6733 vec($foo, 93, 1) = 1; # 'PerlPerlPer' . "\x2c" 6734 vec($foo, 94, 1) = 1; # 'PerlPerlPerl' 6735 # 'l' is "\x6c" 6736 6737To transform a bit vector into a string or list of 0's and 1's, use these: 6738 6739 $bits = unpack("b*", $vector); 6740 @bits = split(//, unpack("b*", $vector)); 6741 6742If you know the exact length in bits, it can be used in place of the C<*>. 6743 6744Here is an example to illustrate how the bits actually fall in place: 6745 6746 #!/usr/bin/perl -wl 6747 6748 print <<'EOT'; 6749 0 1 2 3 6750 unpack("V",$_) 01234567890123456789012345678901 6751 ------------------------------------------------------------------ 6752 EOT 6753 6754 for $w (0..3) { 6755 $width = 2**$w; 6756 for ($shift=0; $shift < $width; ++$shift) { 6757 for ($off=0; $off < 32/$width; ++$off) { 6758 $str = pack("B*", "0"x32); 6759 $bits = (1<<$shift); 6760 vec($str, $off, $width) = $bits; 6761 $res = unpack("b*",$str); 6762 $val = unpack("V", $str); 6763 write; 6764 } 6765 } 6766 } 6767 6768 format STDOUT = 6769 vec($_,@#,@#) = @<< == @######### @>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> 6770 $off, $width, $bits, $val, $res 6771 . 6772 __END__ 6773 6774Regardless of the machine architecture on which it is run, the above 6775example should print the following table: 6776 6777 0 1 2 3 6778 unpack("V",$_) 01234567890123456789012345678901 6779 ------------------------------------------------------------------ 6780 vec($_, 0, 1) = 1 == 1 10000000000000000000000000000000 6781 vec($_, 1, 1) = 1 == 2 01000000000000000000000000000000 6782 vec($_, 2, 1) = 1 == 4 00100000000000000000000000000000 6783 vec($_, 3, 1) = 1 == 8 00010000000000000000000000000000 6784 vec($_, 4, 1) = 1 == 16 00001000000000000000000000000000 6785 vec($_, 5, 1) = 1 == 32 00000100000000000000000000000000 6786 vec($_, 6, 1) = 1 == 64 00000010000000000000000000000000 6787 vec($_, 7, 1) = 1 == 128 00000001000000000000000000000000 6788 vec($_, 8, 1) = 1 == 256 00000000100000000000000000000000 6789 vec($_, 9, 1) = 1 == 512 00000000010000000000000000000000 6790 vec($_,10, 1) = 1 == 1024 00000000001000000000000000000000 6791 vec($_,11, 1) = 1 == 2048 00000000000100000000000000000000 6792 vec($_,12, 1) = 1 == 4096 00000000000010000000000000000000 6793 vec($_,13, 1) = 1 == 8192 00000000000001000000000000000000 6794 vec($_,14, 1) = 1 == 16384 00000000000000100000000000000000 6795 vec($_,15, 1) = 1 == 32768 00000000000000010000000000000000 6796 vec($_,16, 1) = 1 == 65536 00000000000000001000000000000000 6797 vec($_,17, 1) = 1 == 131072 00000000000000000100000000000000 6798 vec($_,18, 1) = 1 == 262144 00000000000000000010000000000000 6799 vec($_,19, 1) = 1 == 524288 00000000000000000001000000000000 6800 vec($_,20, 1) = 1 == 1048576 00000000000000000000100000000000 6801 vec($_,21, 1) = 1 == 2097152 00000000000000000000010000000000 6802 vec($_,22, 1) = 1 == 4194304 00000000000000000000001000000000 6803 vec($_,23, 1) = 1 == 8388608 00000000000000000000000100000000 6804 vec($_,24, 1) = 1 == 16777216 00000000000000000000000010000000 6805 vec($_,25, 1) = 1 == 33554432 00000000000000000000000001000000 6806 vec($_,26, 1) = 1 == 67108864 00000000000000000000000000100000 6807 vec($_,27, 1) = 1 == 134217728 00000000000000000000000000010000 6808 vec($_,28, 1) = 1 == 268435456 00000000000000000000000000001000 6809 vec($_,29, 1) = 1 == 536870912 00000000000000000000000000000100 6810 vec($_,30, 1) = 1 == 1073741824 00000000000000000000000000000010 6811 vec($_,31, 1) = 1 == 2147483648 00000000000000000000000000000001 6812 vec($_, 0, 2) = 1 == 1 10000000000000000000000000000000 6813 vec($_, 1, 2) = 1 == 4 00100000000000000000000000000000 6814 vec($_, 2, 2) = 1 == 16 00001000000000000000000000000000 6815 vec($_, 3, 2) = 1 == 64 00000010000000000000000000000000 6816 vec($_, 4, 2) = 1 == 256 00000000100000000000000000000000 6817 vec($_, 5, 2) = 1 == 1024 00000000001000000000000000000000 6818 vec($_, 6, 2) = 1 == 4096 00000000000010000000000000000000 6819 vec($_, 7, 2) = 1 == 16384 00000000000000100000000000000000 6820 vec($_, 8, 2) = 1 == 65536 00000000000000001000000000000000 6821 vec($_, 9, 2) = 1 == 262144 00000000000000000010000000000000 6822 vec($_,10, 2) = 1 == 1048576 00000000000000000000100000000000 6823 vec($_,11, 2) = 1 == 4194304 00000000000000000000001000000000 6824 vec($_,12, 2) = 1 == 16777216 00000000000000000000000010000000 6825 vec($_,13, 2) = 1 == 67108864 00000000000000000000000000100000 6826 vec($_,14, 2) = 1 == 268435456 00000000000000000000000000001000 6827 vec($_,15, 2) = 1 == 1073741824 00000000000000000000000000000010 6828 vec($_, 0, 2) = 2 == 2 01000000000000000000000000000000 6829 vec($_, 1, 2) = 2 == 8 00010000000000000000000000000000 6830 vec($_, 2, 2) = 2 == 32 00000100000000000000000000000000 6831 vec($_, 3, 2) = 2 == 128 00000001000000000000000000000000 6832 vec($_, 4, 2) = 2 == 512 00000000010000000000000000000000 6833 vec($_, 5, 2) = 2 == 2048 00000000000100000000000000000000 6834 vec($_, 6, 2) = 2 == 8192 00000000000001000000000000000000 6835 vec($_, 7, 2) = 2 == 32768 00000000000000010000000000000000 6836 vec($_, 8, 2) = 2 == 131072 00000000000000000100000000000000 6837 vec($_, 9, 2) = 2 == 524288 00000000000000000001000000000000 6838 vec($_,10, 2) = 2 == 2097152 00000000000000000000010000000000 6839 vec($_,11, 2) = 2 == 8388608 00000000000000000000000100000000 6840 vec($_,12, 2) = 2 == 33554432 00000000000000000000000001000000 6841 vec($_,13, 2) = 2 == 134217728 00000000000000000000000000010000 6842 vec($_,14, 2) = 2 == 536870912 00000000000000000000000000000100 6843 vec($_,15, 2) = 2 == 2147483648 00000000000000000000000000000001 6844 vec($_, 0, 4) = 1 == 1 10000000000000000000000000000000 6845 vec($_, 1, 4) = 1 == 16 00001000000000000000000000000000 6846 vec($_, 2, 4) = 1 == 256 00000000100000000000000000000000 6847 vec($_, 3, 4) = 1 == 4096 00000000000010000000000000000000 6848 vec($_, 4, 4) = 1 == 65536 00000000000000001000000000000000 6849 vec($_, 5, 4) = 1 == 1048576 00000000000000000000100000000000 6850 vec($_, 6, 4) = 1 == 16777216 00000000000000000000000010000000 6851 vec($_, 7, 4) = 1 == 268435456 00000000000000000000000000001000 6852 vec($_, 0, 4) = 2 == 2 01000000000000000000000000000000 6853 vec($_, 1, 4) = 2 == 32 00000100000000000000000000000000 6854 vec($_, 2, 4) = 2 == 512 00000000010000000000000000000000 6855 vec($_, 3, 4) = 2 == 8192 00000000000001000000000000000000 6856 vec($_, 4, 4) = 2 == 131072 00000000000000000100000000000000 6857 vec($_, 5, 4) = 2 == 2097152 00000000000000000000010000000000 6858 vec($_, 6, 4) = 2 == 33554432 00000000000000000000000001000000 6859 vec($_, 7, 4) = 2 == 536870912 00000000000000000000000000000100 6860 vec($_, 0, 4) = 4 == 4 00100000000000000000000000000000 6861 vec($_, 1, 4) = 4 == 64 00000010000000000000000000000000 6862 vec($_, 2, 4) = 4 == 1024 00000000001000000000000000000000 6863 vec($_, 3, 4) = 4 == 16384 00000000000000100000000000000000 6864 vec($_, 4, 4) = 4 == 262144 00000000000000000010000000000000 6865 vec($_, 5, 4) = 4 == 4194304 00000000000000000000001000000000 6866 vec($_, 6, 4) = 4 == 67108864 00000000000000000000000000100000 6867 vec($_, 7, 4) = 4 == 1073741824 00000000000000000000000000000010 6868 vec($_, 0, 4) = 8 == 8 00010000000000000000000000000000 6869 vec($_, 1, 4) = 8 == 128 00000001000000000000000000000000 6870 vec($_, 2, 4) = 8 == 2048 00000000000100000000000000000000 6871 vec($_, 3, 4) = 8 == 32768 00000000000000010000000000000000 6872 vec($_, 4, 4) = 8 == 524288 00000000000000000001000000000000 6873 vec($_, 5, 4) = 8 == 8388608 00000000000000000000000100000000 6874 vec($_, 6, 4) = 8 == 134217728 00000000000000000000000000010000 6875 vec($_, 7, 4) = 8 == 2147483648 00000000000000000000000000000001 6876 vec($_, 0, 8) = 1 == 1 10000000000000000000000000000000 6877 vec($_, 1, 8) = 1 == 256 00000000100000000000000000000000 6878 vec($_, 2, 8) = 1 == 65536 00000000000000001000000000000000 6879 vec($_, 3, 8) = 1 == 16777216 00000000000000000000000010000000 6880 vec($_, 0, 8) = 2 == 2 01000000000000000000000000000000 6881 vec($_, 1, 8) = 2 == 512 00000000010000000000000000000000 6882 vec($_, 2, 8) = 2 == 131072 00000000000000000100000000000000 6883 vec($_, 3, 8) = 2 == 33554432 00000000000000000000000001000000 6884 vec($_, 0, 8) = 4 == 4 00100000000000000000000000000000 6885 vec($_, 1, 8) = 4 == 1024 00000000001000000000000000000000 6886 vec($_, 2, 8) = 4 == 262144 00000000000000000010000000000000 6887 vec($_, 3, 8) = 4 == 67108864 00000000000000000000000000100000 6888 vec($_, 0, 8) = 8 == 8 00010000000000000000000000000000 6889 vec($_, 1, 8) = 8 == 2048 00000000000100000000000000000000 6890 vec($_, 2, 8) = 8 == 524288 00000000000000000001000000000000 6891 vec($_, 3, 8) = 8 == 134217728 00000000000000000000000000010000 6892 vec($_, 0, 8) = 16 == 16 00001000000000000000000000000000 6893 vec($_, 1, 8) = 16 == 4096 00000000000010000000000000000000 6894 vec($_, 2, 8) = 16 == 1048576 00000000000000000000100000000000 6895 vec($_, 3, 8) = 16 == 268435456 00000000000000000000000000001000 6896 vec($_, 0, 8) = 32 == 32 00000100000000000000000000000000 6897 vec($_, 1, 8) = 32 == 8192 00000000000001000000000000000000 6898 vec($_, 2, 8) = 32 == 2097152 00000000000000000000010000000000 6899 vec($_, 3, 8) = 32 == 536870912 00000000000000000000000000000100 6900 vec($_, 0, 8) = 64 == 64 00000010000000000000000000000000 6901 vec($_, 1, 8) = 64 == 16384 00000000000000100000000000000000 6902 vec($_, 2, 8) = 64 == 4194304 00000000000000000000001000000000 6903 vec($_, 3, 8) = 64 == 1073741824 00000000000000000000000000000010 6904 vec($_, 0, 8) = 128 == 128 00000001000000000000000000000000 6905 vec($_, 1, 8) = 128 == 32768 00000000000000010000000000000000 6906 vec($_, 2, 8) = 128 == 8388608 00000000000000000000000100000000 6907 vec($_, 3, 8) = 128 == 2147483648 00000000000000000000000000000001 6908 6909=item wait 6910X<wait> 6911 6912Behaves like the wait(2) system call on your system: it waits for a child 6913process to terminate and returns the pid of the deceased process, or 6914C<-1> if there are no child processes. The status is returned in C<$?>. 6915Note that a return value of C<-1> could mean that child processes are 6916being automatically reaped, as described in L<perlipc>. 6917 6918=item waitpid PID,FLAGS 6919X<waitpid> 6920 6921Waits for a particular child process to terminate and returns the pid of 6922the deceased process, or C<-1> if there is no such child process. On some 6923systems, a value of 0 indicates that there are processes still running. 6924The status is returned in C<$?>. If you say 6925 6926 use POSIX ":sys_wait_h"; 6927 #... 6928 do { 6929 $kid = waitpid(-1, WNOHANG); 6930 } until $kid > 0; 6931 6932then you can do a non-blocking wait for all pending zombie processes. 6933Non-blocking wait is available on machines supporting either the 6934waitpid(2) or wait4(2) system calls. However, waiting for a particular 6935pid with FLAGS of C<0> is implemented everywhere. (Perl emulates the 6936system call by remembering the status values of processes that have 6937exited but have not been harvested by the Perl script yet.) 6938 6939Note that on some systems, a return value of C<-1> could mean that child 6940processes are being automatically reaped. See L<perlipc> for details, 6941and for other examples. 6942 6943=item wantarray 6944X<wantarray> X<context> 6945 6946Returns true if the context of the currently executing subroutine or 6947C<eval> is looking for a list value. Returns false if the context is 6948looking for a scalar. Returns the undefined value if the context is 6949looking for no value (void context). 6950 6951 return unless defined wantarray; # don't bother doing more 6952 my @a = complex_calculation(); 6953 return wantarray ? @a : "@a"; 6954 6955C<wantarray()>'s result is unspecified in the top level of a file, 6956in a C<BEGIN>, C<CHECK>, C<INIT> or C<END> block, or in a C<DESTROY> 6957method. 6958 6959This function should have been named wantlist() instead. 6960 6961=item warn LIST 6962X<warn> X<warning> X<STDERR> 6963 6964Produces a message on STDERR just like C<die>, but doesn't exit or throw 6965an exception. 6966 6967If LIST is empty and C<$@> already contains a value (typically from a 6968previous eval) that value is used after appending C<"\t...caught"> 6969to C<$@>. This is useful for staying almost, but not entirely similar to 6970C<die>. 6971 6972If C<$@> is empty then the string C<"Warning: Something's wrong"> is used. 6973 6974No message is printed if there is a C<$SIG{__WARN__}> handler 6975installed. It is the handler's responsibility to deal with the message 6976as it sees fit (like, for instance, converting it into a C<die>). Most 6977handlers must therefore make arrangements to actually display the 6978warnings that they are not prepared to deal with, by calling C<warn> 6979again in the handler. Note that this is quite safe and will not 6980produce an endless loop, since C<__WARN__> hooks are not called from 6981inside one. 6982 6983You will find this behavior is slightly different from that of 6984C<$SIG{__DIE__}> handlers (which don't suppress the error text, but can 6985instead call C<die> again to change it). 6986 6987Using a C<__WARN__> handler provides a powerful way to silence all 6988warnings (even the so-called mandatory ones). An example: 6989 6990 # wipe out *all* compile-time warnings 6991 BEGIN { $SIG{'__WARN__'} = sub { warn $_[0] if $DOWARN } } 6992 my $foo = 10; 6993 my $foo = 20; # no warning about duplicate my $foo, 6994 # but hey, you asked for it! 6995 # no compile-time or run-time warnings before here 6996 $DOWARN = 1; 6997 6998 # run-time warnings enabled after here 6999 warn "\$foo is alive and $foo!"; # does show up 7000 7001See L<perlvar> for details on setting C<%SIG> entries, and for more 7002examples. See the Carp module for other kinds of warnings using its 7003carp() and cluck() functions. 7004 7005=item write FILEHANDLE 7006X<write> 7007 7008=item write EXPR 7009 7010=item write 7011 7012Writes a formatted record (possibly multi-line) to the specified FILEHANDLE, 7013using the format associated with that file. By default the format for 7014a file is the one having the same name as the filehandle, but the 7015format for the current output channel (see the C<select> function) may be set 7016explicitly by assigning the name of the format to the C<$~> variable. 7017 7018Top of form processing is handled automatically: if there is 7019insufficient room on the current page for the formatted record, the 7020page is advanced by writing a form feed, a special top-of-page format 7021is used to format the new page header, and then the record is written. 7022By default the top-of-page format is the name of the filehandle with 7023"_TOP" appended, but it may be dynamically set to the format of your 7024choice by assigning the name to the C<$^> variable while the filehandle is 7025selected. The number of lines remaining on the current page is in 7026variable C<$->, which can be set to C<0> to force a new page. 7027 7028If FILEHANDLE is unspecified, output goes to the current default output 7029channel, which starts out as STDOUT but may be changed by the 7030C<select> operator. If the FILEHANDLE is an EXPR, then the expression 7031is evaluated and the resulting string is used to look up the name of 7032the FILEHANDLE at run time. For more on formats, see L<perlform>. 7033 7034Note that write is I<not> the opposite of C<read>. Unfortunately. 7035 7036=item y/// 7037 7038The transliteration operator. Same as C<tr///>. See L<perlop>. 7039 7040=back 7041