Provided by: libparanoid-perl_2.10-1_all bug

NAME

       Paranoid::IO - Paranoid IO support

VERSION

       $Id: lib/Paranoid/IO.pm, 2.10 2022/03/08 00:01:04 acorliss Exp $

SYNOPSIS

         use Fcntl qw(:DEFAULT :flock :mode :seek);
         use Paranoid::IO qw(:all);

         # Implicit open
         $chars = pread("./foo.log", $in, 2048);

         # Implcit write/append
         $chars = pwrite("./bar.log", $out);
         $chars = pappend("./bar.log", $out);

         # Adjust block read size
         PIOBLKSIZE = 8192;

         # Adjust max file size for file scans
         PIOMAXFSIZE = 65536;

         # Enable flock lock stack
         PIOLOCKSTACK = 1;

         # Explicit open with explicit locking
         $fh = popen($filename, O_RDWR | O_CREAT | O_TRUNC, 0600);
         $rv = pseek($filename, 0, SEEK_END);
         $rv = pflock($filename, LOCK_EX);
         if ($rv > 0) {
           pseek($filename, 0, SEEK_SET) && ptruncate($filename);
         }
         $rv = pwrite($filename, $text)
         $rv = ptell($filename);
         $rv = plockstat($filename);

         # Calls that ignore file locks
         $rv = pnlwrite($filename, $text)
         $rv = pnlappend($filename, $text)
         $rv = pnlread($filename, $text)

         # After fork
         $rv = preopen();

         $rv = pclose($filename);
         $rv = pcloseAll();

DESCRIPTION

       Paranoid::IO is intended to make basic file I/O access easier while keeping with the tenets of paranoid
       programming.  Most of these calls are essentially wrappers for the basic system calls (exposed as
       sysopen, syswrite, etc.) with some additional logic to reduce the need to explicitly code every step of
       normal safe access rules, such as file locking.  In the most basic of usage patterns, even explicitly
       opening files isn't necessary.

       For the most part the system calls that are wrapped here act identically as the underlying calls, both in
       the arguments they take and the values they return.  The one notable difference, however, is the popen
       function itself.  A glob variable isn't passed for assignation since this module stores those references
       internally along with some meta data, so popen returns file handles directly.

       That semantic, however, is what gives the rest of the functions the flexibility of accepting either a
       file name or a file handle to work on.  In the case of file names some of these functions can open files
       automatically, and the rest of the features are granted automatically.

       In the case of passing file handles the full feature set of this module is only available if the file
       handle was originally opened with popen.  The calls will still work even if it wasn't, but some of the
       safety features, like being fork-safe, won't have the meta data to work properly.

       The features provided by this module are:

       •   Opportunistic file access

       •   File handle caching

       •   Fork-safe file access

       •   Inherent file locking

       •   O_APPEND access patterns where needed even for files not opened with O_APPEND

       •   Intelligent file tracking

       •   Optional flock lock stack for transactional I/O patterns

       The following sections will explain each feature in more depth:

   Opportunistic file access
       Opportunistic  file  access  is  defined  here  as  not needing the explicit I/O handling for what can be
       implied.  For instance, to read content from a file one can simply use the pread function without  having
       to  open  and  apply  a shared file lock.  In a similar manner one should be able to write or append to a
       file.  Files are automatically opened (with the file mode being intuited by the type of call) as  needed.
       Only  where  more  complicated access patterns (such as read/write file handles) should an explicit popen
       call be needed.

       Opportunism is limited to where it makes sense, however.  Files are not opportunistically opened  if  the
       first I/O call is to pseek or ptell.  The intent of the file I/O (in regards to read/write file modes) is
       impossible to tell within those calls.

   File handle caching
       This  module  provides  a  replacement  for  Perl's  internal  sysopen,  which  should be used even where
       read/write file access is necessary.  One key benefit for doing so is  that  it  provides  internal  file
       handle  caching  based  on  the  file  name.  All the additional functions provided by this module use it
       internally to retrieve that cached file handle to avoid the overhead of repetitive opening and closing of
       files.

   Fork-safe file access
       A greater benefit of popen, however, is in it's fork-safe behavior.  Every call checks to see if the file
       handle it has was inherited from its parent, and if so, transparently closed  and  reopened  so  I/O  can
       continue  without both processes conflicting over cursor positions and buffers.  After files are reopened
       read cursors are placed at the same location they were prior to the first I/O access in the child.

       File modes are preserved without the obvious conflicts of intent as well.  Files  opened  in  the  parent
       with O_TRUNC are reopened without that flag to prevent content from being clobbered.

   Inherent file locking
       Except  where  explicitly  ignored  (like for pnlread) all read, write, and append operations use locking
       internally, alleviating the need for the developer to do so explicitly.  Locks are applied and removed as
       quickly as possible to facilitate concurrent access.

       If you're managing flocks directly, however, all of the read/write functions  in  this  module  not  only
       support  an  option  boolean  argument to disable internal flocking, but also have pnl* wrapper functions
       that set that argument for you.

   O_APPEND access patterns
       pappend allows you to mimic O_APPEND access patterns even for files that weren't explicitly  opened  with
       O_APPEND.   If you open a file with O_RDWR you can still call pappend and the content will be appended to
       the end of the file, without moving the file's cursor position for regular reads and writes.

   Intelligent file tracking
       popen caches file handles by file name.  If files are opened with relative paths this has  the  potential
       to  cause  some  confusion  if  the  process  or  children  are  changing  their working directories.  In
       anticipation of this popen also tracks the real path (as resolved by the realpath system call)  and  file
       name.   This way you can still access the same file regardless of the process or its children's movements
       on the file system.

       This could be, however, a double-edged sword if your program intends to open identically named  files  in
       multiple  locations.   If  that  is your intent you would be cautioned to avoid using relative paths with
       popen.

   Optional flock lock stack for transactional I/O patterns
       Complex I/O patterns on file I/O can sometimes extensive  nested  function  calls  that  each  manipulate
       flocks  independently.   Those  nested  calls can come into conflict when one call degrades a needed lock
       applied by a previous call.

       For instance, a pattern where a new block needs to be allocated to an opened file, but an index of blocks
       must be maintained within the same file.  One might have a function which retrieves  the  list  of  block
       addresses from the index, and that function rationally applies a shared flock before reading, and removes
       it  afterwards.   One  might  try to get an exclusive lock on the file, then retrieve the index using the
       existing function. That function, however, would end up replacing your exclusive  lock  with  the  shared
       lock,  potentially making it impossible to reacquire that exclusive lock depending on other processes and
       their I/O.

       The lock stack attempts to solve those kinds of problems by maintaining a stack  of  flocks,  and  making
       sure  that  no new locks degrade the previous locks.  In previous example, it would notice that the stack
       was opened with an exclusive lock, and when the index retrieval function attempts  to  apply  the  shared
       lock,  it  would simply upgrade that lock to preserve the exclusive lock.  Since a stack tracks each call
       to pflock(), once that function attempts to release the shared lock, the lock stack would simply pop  off
       it's upgraded call from the stack, and make sure the preceding lock stays in place.

       Another way to describe this in psuedo code:

           # Enable the lock stack
           PIOLOCKSTACK = 1;

           sub readIdx {
               pflock($file, LOCK_SH);
               # ... read data
               pflock($file, LOCK_UN);
               # ... return data
           }

           sub writeIdx {
               pflock($file, LOCK_EX);
               # ... write data
               pflock($file, LOCK_UN);
           }

           sub writeData {
               pflock($file, LOCK_EX);
               # ... write data
               pflock($file, LOCK_UN);
           }

           sub writeTx {
               pflock($file, LOCK_EX);
               readIdx();
               writeData();
               writeIdx();
               pflock($file, LOCK_UN);
           }

           # Execute the transaction
           writeTx();

       Without the lock stack, executing the transaction function would cause the following to happen:

           writeTx:
               # apply LOCK_EX
               # readIdx:
                   # apply LOCK_SH
                   # read data
                   # release all locks w/LOCK_UN
               # writeData:
                   # apply LOCK_EX
                   # ERROR: any write decisions at this point based on the previous
                   # ERROR: index read may cause file corruption because the index
                   # ERROR: may have changed while this process was waiting to
                   # ERROR: reacquire the exclusive lock!

       With the lock stack in place, however, it goes like this:

           writeTx:
               # apply LOCK_EX
                   # lock stack: (LOCK_EX)
               # readIdx:
                   # asks for LOCK_SH, but maintains LOCK_EX
                       # lock stack: (LOCK_EX, LOCK_EX)
                   # read data
                   # deletes its lock from the stack, but preserves the previous lock
                       # lock stack: (LOCK_EX)
               # writeData:
                   # asks for LOCK_EX
                       # lock stack: (LOCK_EX, LOCK_EX)
                   # writes data
                   # deletes its lock from the stack, but preserves the previous lock
                       # lock stack: (LOCK_EX)
               # writeIdx:
                   # asks for LOCK_EX
                       # lock stack: (LOCK_EX, LOCK_EX)
                   # writes data
                   # deletes its lock from the stack, but preserves the previous lock
                       # lock stack: (LOCK_EX)
               # release lock
                   # lock stack: ()

       At  no  point  was  the  advisory  lock  lost,  and  hence, transactional integrity was preserved for all
       compliant processes.

       The lock stack is off by default to allow the developer complete control over locking and  I/O  patterns,
       but  it's there to make functions easier to write without having to worry about any locks applied outside
       of their code scope.

       One downside of the lock stack is that affects all I/O performed via the Paranoid::IO  framework,  it  is
       not  locallized to specific file handles.  For that reason, one must be confident that flocks are applied
       as atomically as possible throughout the code space leveraging it.

IMPORT LISTS

       This module exports the following symbols by default:

           pclose pcloseAll popen preopen ptell pseek pflock
           plockstat pread pnlread pwrite pappend ptruncate

       The following specialized import lists also exist:

           List        Members
           ---------------------------------------------------------
           all         @defaults PIOBLKSIZE PIOMAXFSIZE PIOLOCKSTACK

SUBROUTINES/METHODS

   PIOBLKSIZE
           PIOBLKSIZE = 65536;

       This lvalue function is not exported by default.  It is used to determine the default block size to  read
       when  a  size  is not explicitly passed.  Depending on hardware and file system parameters there might be
       performance gains to be had when doing default-sized reads.

       The default is 4096, which is generally a safe size for most applications.

   PIOMAXFSIZE
           PIOMAXFSIZE = 131072;

       This lvalue function is not exported by default.  It is used to determine the maximum file size that will
       be read.  This is not used in this module, but provided for use in dependent modules  that  may  want  to
       impose file size limits, such as Paranoid::IO::Line and others.

       The default is 65536.

   PIOLOCKSTACK
           PIOLOCKSTACK = 1

       This lvalue function is not exported by default.  It is used to enable the flock lock stack functionality
       in order to support transactional I/O patterns.  It is disabled by default.

   popen
           $fh = popen($filename);
           $fh = popen($filename, $mode);
           $fh = popen($filename, $mode, $perms);
           $fh = popen($fh);

       Returns  a  file  handle  if  the  file could be opened.  If the mode is omitted the default is O_CREAT |
       O_RDWR.  File permissions (for newly created files) default to 0666 ^ UMASK.

       Failures to open a file will result in an undef return value, with a text description of the fault stored
       in Paranoid::ERROR.

       If a file handle is passed to popen it will attempt to  match  it  to  a  tracked  file  handle  and,  if
       identified,  take  the  appropriate  action.   If  it doesn't match any tracked file handles it will just
       return that file handle back to the caller.

   pclose
           $rv = pclose($filename);
           $rv = pclose($fh);

       Returns the value from close.  Attempts to close a file that's already closed is  considered  a  success,
       and  true  value  is  returned.   Handing it a stale file handle, however, will be handed to the internal
       close, with all the expected results.

   preopen
           $rv = preopen();
           $rv = preopen(@filenames);
           $rv = preopen(@filehandles);

       This checks each tracked file handle (i.e., file handles that were opened by popen) and reopens  them  if
       necessary.   This  is  typically only useful after a fork.  It is also not striclty necessary since every
       call to a function in this module does that with every invocation, but if you have several  file  handles
       that  you  may  not  access  immediately  you run the risk of the parent moving the current file position
       before the child gets back to those files.  You may or may not  care.   If  you  do,  use  this  function
       immediately after a fork.

       Called  with  a list of file names means that only those files are examined and reopened.  Any failure to
       reopen any single file handle will result in a false return value.  That  said,  any  failures  will  not
       interrupt the function from trying every file in the list.

   pcloseAll
           $rv = pcloseAll();
           $rv = pcloseAll(@filenames);
           $rv = pcloseAll(@filehandles);

       This  function  returns  a  boolean  value  denoting  any errors while trying to close every tracked file
       handle.  This function is also not strictly necessary for all the normal Perl I/O reasons, but it's  here
       for those that want to be explicit.

   ptell
           $pos = ptell($filename);
           $pos = ptell($fh);

       Returns  the  current  position of the file cursor.  Returns the results of sysseek, which means that any
       successful seek is true, even if the cursor is at the beginning of the file.  In that instance it returns
       "0 but true" which is boolean true while converting to an integer appropriately.

       Any failures are returned as false or undef.

   pseek
           $rv = pseek($filename, $pos, $whence);
           $rv = pseek($fh, $pos, $whence);

       This returns the return value from sysseek.  The appropriate whence values sould be  one  of  the  SEEK_*
       constants as exported by Fcntl.

   pflock
           $rv = pflock($filename, $locktype);
           $rv = pflock($fh, $locktype);

       This  returns  the return value from flock.  The appropriate lock type values should be one of the LOCK_*
       constants as exported by Fcntl.

       NOTE: This function essentially acts like a pass-through to the native flock function for any file handle
       not opened via this module's functions.

   plockstat
           $lock = plockstat($filename);

       This returns the last flock applied via pflock.

   pread
           $bytes = pread($filename, $text, $length);
           $bytes = pread($filename, $text, $length, $offset);
           $bytes = pread($fh, $text, $length);
           $bytes = pread($fh, $text, $length, $offset);

       This returns the number of bytes read, or undef on errors.  If this is called prior to an explicit  popen
       it will default to a mode of O_RDONLY.  Length defaults to PIOBLKSIZE.

   pnlread
           $bytes = pnlread($filename, $text, $length);
           $bytes = pnlread($filename, $text, $length, $offset);
           $bytes = pnlread($fh, $text, $length);
           $bytes = pnlread($fh, $text, $length, $offset);

       This  is  a  wrapper function for pread that calls it with inherent file locking disabled.  It is assumed
       that the dev is either managing flocks directly, or they're not needed for this application.

   pwrite
           $bytes = pwrite($filename, $text);
           $bytes = pwrite($filename, $text, $length);
           $bytes = pwrite($filename, $text, $length, $offset);
           $bytes = pwrite($filename, $text, $length, $nolock);
           $bytes = pwrite($fh, $text);
           $bytes = pwrite($fj, $text, $length);
           $bytes = pwrite($fh, $text, $length, $offset);
           $bytes = pwrite($fh, $text, $length, $offset, $nolock);

       This returns the number of bytes written, or undef for any critical failures.  If this is called prior to
       an explicit popen it uses a default mode of O_WRONLY | O_CREAT | O_TRUNC.

       The optional boolean fifth argument (nolock)  will  bypass  automatic  flocks  since  it  assumes  you're
       managing the lock directly.

   pnlwrite
           $bytes = pnlwrite($filename, $text);
           $bytes = pnlwrite($filename, $text, $length);
           $bytes = pnlwrite($filename, $text, $length, $offset);
           $bytes = pnlwrite($fh, $text);
           $bytes = pnlwrite($fj, $text, $length);
           $bytes = pnlwrite($fh, $text, $length, $offset);

       This  is  a wrapper function for pwrite that calls it with inherent file locking disabled.  It is assumed
       that the dev is either managing flocks directly, or they're not needed for this application.

   pappend
           $bytes = pappend($filename, $text);
           $bytes = pappend($filename, $text, $length);
           $bytes = pappend($filename, $text, $length, $offset);
           $bytes = pappend($filename, $text, $length, $offset, $nolock);
           $bytes = pappend($fh, $text);
           $bytes = pappend($fh, $text, $length);
           $bytes = pappend($fh, $text, $length, $offset, $nolock);

       This behaves identically to pwrite with the sole exception that this preserves the  file  position  after
       explicitly seeking and writing to the end of the file.  The default mode here, however, would be O_WRONLY
       | O_CREAT | O_APPEND for those files not explicitly opened.

       The  optional  boolean  fifth  argument  (nolock)  will  bypass  automatic flocks since it assumes you're
       managing the lock directly.

   pnlappend
           $bytes = pnlappend($filename, $text);
           $bytes = pnlappend($filename, $text, $length);
           $bytes = pnlappend($filename, $text, $length, $offset);
           $bytes = pnlappend($fh, $text);
           $bytes = pnlappend($fj, $text, $length);
           $bytes = pnlappend($fh, $text, $length, $offset);

       This is a wrapper function for pappend that calls it with inherent file locking disabled.  It is  assumed
       that the dev is either managing flocks directly, or they're not needed for this application.

   ptruncate
           $rv = ptruncate($filename);
           $rv = ptruncate($filename, $pos, $nolock);
           $rv = ptruncate($fh);
           $rv = ptruncate($fh, $pos, $nolock);

       This  returns the result of the internal truncate call.  If called without an explicit popen it will open
       the named file with the default mode of O_RDWR | O_CREAT.  Omitting the position to  truncate  from  will
       result in the file being truncated at the beginning of the file.

       The  optional  boolean  third  argument  (nolock)  will  bypass  automatic flocks since it assumes you're
       managing the lock directly.

   pnltruncate
           $rv = pnltruncate($filename);
           $rv = pnltruncate($fh);

       This is a wrapper function for pnltruncate that calls it with inherent  file  locking  disabled.   It  is
       assumed that the dev is either managing flocks directly, or they're not needed for this application.

DEPENDENCIES

       o   Cwd

       o   Fcntl

       o   IO::Handle

       o   Paranoid

       o   Paranoid::Debug

       o   Paranoid::Input

BUGS AND LIMITATIONS

       It  may  not always be benficial to cache file handles.  You must explicitly pclose file handles to avoid
       that.  That said, with straight Perl you'd have to either  explicitly  close  the  file  handles  or  use
       lexical  scoping, anyway.  From that perspective I don't find it onerous to do so, especially with all of
       the other code-saving features this module provides.

AUTHOR

       Arthur Corliss (corliss@digitalmages.com)

LICENSE AND COPYRIGHT

       This software is free software.  Similar to Perl, you can redistribute it  and/or  modify  it  under  the
       terms of either:

         a)     the GNU General Public License
                <https://www.gnu.org/licenses/gpl-1.0.html> as published by the
                Free Software Foundation <http://www.fsf.org/>; either version 1
                <https://www.gnu.org/licenses/gpl-1.0.html>, or any later version
                <https://www.gnu.org/licenses/license-list.html#GNUGPL>, or
         b)     the Artistic License 2.0
                <https://opensource.org/licenses/Artistic-2.0>,

       subject  to  the  following additional term:  No trademark rights to "Paranoid" have been or are conveyed
       under any of the above licenses.  However, "Paranoid" may be used  fairly  to  describe  this  unmodified
       software, in good faith, but not as a trademark.

       (c)   2005   -   2021,  Arthur  Corliss  (corliss@digitalmages.com)  (tm)  2008  -  2021,  Paranoid  Inc.
       (www.paranoid.com)

perl v5.34.0                                       2022-03-13                                  Paranoid::IO(3pm)