bionic (1) rdup.1.gz

Provided by: rdup_1.1.15-1_amd64 bug

NAME

       rdup - generate a file list suitable for making backups

SYNOPSIS

       rdup [-N timestamp] -[-P CMD]...  [OPTION]...  FILELIST [DIR/FILE]...

DESCRIPTION

       rdup  is  a  utility  inspired  by  rsync and the Plan9 way of doing backups. rdup itself does not backup
       anything. It only prints a list of files that are changed, or all files in case of a null dump.  It  also
       handles files that are removed, allowing for correct incremental backups. All paths printed are absolute.
       rdup uses the change time (ctime) to decide whether a file is altered.

       It works as follows, for a full dump

       1.     Crawl all directories, and print all the names found to standard output.  Also check for  ._rdup_.
              files to restore the original ownership (see the -u flag).

       2.     Write  a  filelist with all the names found when crawling.  Use this list to calculate the correct
              incremental dump.

       And for incremental dumps

       1.     Read in the filelist that was written when doing a full dump.

       2.     Touch the time stamp file.

       3.     Crawl all the directories again.

       4.     Diff 1. and 2. to get two lists; one of removed items and one of added/modified items.

       5.     Write the removed items to standard output

       6.     Write the modified/new items to standard output.

       7.     Write a new filelist.

       The FILELIST is a internal list rdup writes to, to keep track of which files are  in  a  backup.  If  you
       don't want this (i.e. make a full backup), use /dev/null here. The file /dev/null is handled specially by
       rdup: if detected no new file list is written.

       The DIRS/FILES can be specified multiple times. These are the directories and files you want  to  backup.
       If omitted it defaults to the current directory "." .

       If  the  -N  timestamp option is not given, all paths found are printed. Only when a -N timestamp file is
       given, times can be compared and an incremental output can be generated.

       rdup prints a filelist to standard output.  Subsequent programs in a pipe line can be  used  to  actually
       implement  to backup scheme.  After a run a new FILELIST is written. No warning is given when FILELIST is
       an existing file, it just gets overwritten by rdup. New runs will print out only those  files  that  have
       actually changed or are removed since the last run, thereby making incremental backups possible.

       Files  are  checked  for  changes  by  comparing the c-time (change time), if this time is NEWER than the
       c-time of timestamp file the pathname is printed to standard output. When files are removed they are also
       printed  to  standard output, but they are prefixed with a '-'. See FORMAT below. The default format rdup
       uses is: "%p%T %b %t %u %U %g %G %l %s\n%n%C"

       Note, that rdup also supports hashing of files, this makes it possible to check the local hash  with  the
       hash of the backed up file.

       All  errors are written to standard error.  If the directory or file does not exist, they are skipped and
       a warning is emitted.

       The general idea is to be very UNIX like and create a bunch of simple programs  which  each  do  a  their
       specific  thing  very well. With rdup and a small shell script (50 lines) one can implement encrypted and
       compressed backups.

       As rdup doesn't backup anything, the backup policy; what you backup, how you backup, how  often  and  how
       you  restore;  is  all left to the scripts and your imagination. To kick start your imagination see rdup-
       tr(1), rdup-up(1) and maybe rdup-backups.

OPTIONS

       -P command
              Filter all output through command, multiple -P's can be used.

              Due to the nature of pipes in Unix, this pipeline is recreated for every file processed. Also  see
              'Child Processes' below.

       -F format
              Specify a printf-style format to use. See FORMAT below.

       -N timestamp
              use  the  c_time  of  file timestamp as the timestamp to decide what to include in the incremental
              backup list. If timestamp does not exist a full dump is performed.

       -M timestamp
              As -N, but look at the m_time of timestamp.

       -R     Reverse the output of rdup. Tools accepting this output must create leading directory as they  see
              them. This option allows a script -- running as a normal user -- to put files in a directory which
              could have 0600 as its permission.

       -E file
              The file named 'file' contains a list of Perl-compatible regular expressions (PCRE), one per line,
              that  rdup  will  use  to  exclude  names.  A '#' at the start of the line can be used to signal a
              comment.  Empty lines are ignored.

              If a directory is excluded, rdup won't descend in that directory, so all files in  that  directory
              are also excluded.

              The  directories  leading  up  to  the directory to be backed up can not be excluded. If you use a
              command line like:

                      rdup /dev/null /home/miekg/bin

              The directories '/home', '/home/miekg', '/home/miekg/bin' are always printed.

              If you want to exclude the  file  '/home/miekg/blaat'  you  need  to  add  the  following  regular
              expression: '/home/miekg/blaat'.

              If   you   want   to   exclude   all   .mozilla/cache   directories  of  all  users  you  can  use
              '/home/.*/.mozilla/cache/.*'. This doesn't exclude the directory itself and I'm assuming that  the
              users' home directories are found under '/home'.

              Also note that rdup does not print directories with a trailing slash.

       -a     Restore the original access times on files and directories.

       -n     Don't  honor  .nobackup  files.  Normally  if  such  a  file  is found the directory and all files
              containing it, are not printed to standard output. Now they are.

       -u     Output files with a ._rdup_. prefix just as normal files. When this  option  is  not  given  these
              files are skipped by rdup.

       -r     Only print removed files; entries that start with a `-'. This option unsets -m.

       -m     Only print modified/new files; entries that start with a `+'. This option unsets -r.

       -v     Be more verbose. When used each path will also be printed to standard error.

       -s size
              Don't  output  files  larger  than size bytes.  This can be used to limit the amount of data to be
              transferred when doing a remote backup.  This option only applies to files.

       -x     Stay on the local filesystem.

       -V     Print rdup's version.

       -h     Give an overview of the options.

   Child Processes (-P flag)
       When creating output you might also want to 'pipe'  the  contents  of  each  file  through  a  number  of
       commands,  say  a  compression  and  encryption utility. Note that this is different than compressing the
       entire archive as GNU tar allows by using the -z option.  So this is where rdup comes in. It  allows  you
       to  create  a normal archive in which each file is encrypted (or compressed.  reversed or whatever). rdup
       does this by forking child processes which transform the content.

       If one of the forked children returns an exit  code  other  than  zero  (0),  it  is  assumed  the  whole
       conversion process failed. In that case rdup terminates.

       As  said  rdup  works  by  forking  off  a  number  of  child processes (those commands named with the -P
       option(s)), interconnecting these with pipes. The current file is connected  to  the  first  child.   The
       output  created  by these child processes is captured by the parent (rdup).  The contents is then written
       to standard output in an archive format.  As a picture says more than a thousand words here is  an  ASCII
       image of the process:

                          +--- ...   (stdout)    ... ----> archive
                         /
                     rdup  <--- ...   ... <----+
                                               |
                 loop #files                   |
                                               |
                file ---> cmd1 | cmd2 | ...| cmdN

BACKUPS

       With:
               rm -f timestamp && rdup -N timestamp LIST DIR

       A full-dump filelist is printed to standard output. And with:

               rdup -N timestamp LIST DIR

       An incremental dump filelist is printed. The file timestamp is used to save the exact time of rdup's run.
       The file LIST is used to calculate the correct incremental dump list, this is needed for files  that  are
       removed, or have a different type.

NON-ROOT BACKUPS

       If  backups  are  made  by  a non-root user or on a filesystem that does not implement/allow chown (think
       sshfs mounted by an ordinary user), rdup creates a separate file which stores the correct user and  group
       information.  If  the  file's  name  is  foobar  a  new file called ._rdup_.foobar is created in the same
       directory. This file contains one line, for instance:

                  root:0/root:0

       That tells that the actual ownership should be root:root. For directories  a  ._rdup_.  file  is  created
       inside the current directory.

FORMAT

       The default format rdup uses is: "%p%T %b %t %u %U %g %G %l %s\n%n%C"

       The following escape sequences are understood by rdup:

               'p': '+' if file is new/modified, '-' if removed
               'b': permission bits from lstat(2), octal in four digits
               'm': the file mode bits, st_mode from lstat(2), decimal digits
               'u': uid
               'U': username
               'g': gid
               'G': groupname
               'l': path name length
               's': original file size, but see CAVEATS
               'n': path name
               'N': path name, but in case of a soft- or hardlink only the link name
               't': time of modification (seconds from epoch)
               'H': the SHA1 hash of the file, all zeros ("0") for all other types
               'T': file type
                     - normal file, l symlink, h hardlink, d directory,
                     c character device, b block device, p named pipe
                     and s socket
               'C': the content of the file (none for all other types)

       To delimit the output of rdup with NULLs you can use '\0' in the format string.

       Any  file content is written in a block/chunk based manner. The last block is signaled with a null block.
       A block start entry is ASCII and is formatted as follows: VVBLOCKBBBBB\n .  Where 'VV'  is  the  version,
       currently at '01', then the literal string 'BLOCK' and then the amount of bytes (BBBBB), typical '08192'.
       And then a newline.  This look like this:

               01BLOCK08192
               <START OF THE FIRST 8192 BYTES>01BLOCK00015
               <ANOTHER 15 BYTES>01BLOCK00000

       A byte count of zero signals a stop block.

FILELIST

       rdup writes the (internal) FILELIST in the following format:

              MODE DEV INODE LINK UID GID PATH_SIZE FILE_SIZE PATH

       Where MODE is the st_mode from stat(2), DEV is the dev id as returned by the stat call and INODE  is  the
       inode  number  -  rdup  needs  this  info  to  decide if a directory is renamed. LINK is equal to 'h' for
       hardlinks, 'l' for symlinks and otherwise it is '*'.  UID and GID are the numeric user and  group  id  of
       the  file. PATH_SIZE is the length of PATH. FILE_SIZE the file size.  And finally PATH is the path of the
       file.

       A typical example is:

              16893 2050 32085 * 1000 1000 30 4096 /home/miekg/git/rdup/.git/logs

OUTPUT FORMAT

       The output generated by rdup is formatted like:

               +|-TYPE BITS MTIME UID USER GID GROUP PATH_SIZE FILE_SIZE\n
               PATH FILE_CONTENTS

       This makes it possible possible for a remote shell script to receive the actual file contetns and make  a
       backup.

       For directories: the FILE_SIZE is zero and no content is printed. Thus:

               +d 0755 1260243445 1000 miekg 1000 miekg 11 0\n
               /home/miekg

       For regular files the following is a sample output:

               +- 0644 1260243445 1000 miekg 1000 miekg 32 6\n
               /home/miekg/svn/rdup/trunk/aaa/a01BLOCK00006\n
               hello\n
               01BLOCK00000\n

       Where aaa/a is a regular file containing the word 'hello\n'

   CAVEATS
       Soft-  and  hardlinks are handled differently when using %n, if you don't like this behavior use %N.  The
       PATH name is generated from the link's name and its target. A symlink like

               /home/bin/blaat -> /home/bin/bliep

       is printed as '/home/bin/blaat -> /home/bin/bliep'. The PATH_SIZE is modified accordingly, where '  ->  '
       (4  characters)  is  also  counted.  The FILE_SIZE is not needed for soft- or hardlinks, so it is set the
       length of the link's name -- the part left of the ' ->', in this case the length of '/home/bin/blaat'.

       If rdup encounters a hardlink it is handled in the same way, but the output type is set to 'h' instead of
       'l'. A hardlink is only detected if rdup finds a file with the same inode and device number as a previous
       one, i.e. such hardlinks must be contained in your backup.

       Again note: with '%N' only the link's name is printed. The FILE_SIZE is still set to the  length  of  the
       link's name.

   Device Files
       For  devices  the  size field (%s) is changed to hold the major,minor number of the device. So if a major
       number is 8 and the minor number is 0 (under Linux this is /dev/sda), its size will be 8,0.  The  numbers
       are only separated with a comma `,'.

   Symlinks
       You  will  probably  think  rdup will descend into the directory the symbolic link points to. This is not
       what actually happens, rdup will print any directories leading up to the symlink  and  will  not  descend
       into the directory. GNU tar works the same.

EXIT CODE

       rdup  return  a zero exit code on success, otherwise 1 is returned.  rdup will abort if a file can not be
       concatenated, if a regular expression can not be compiled or if a signal is received.

EXAMPLES

       The next set of examples will all make a full dump -- because of the use of /dev/null. See rdup-tr(1) for
       more advanced examples.

   rdup (mirroring)
       Backup:
               rdup /dev/null ~/bin | rdup-up -t /shared/backup
       Restore:
              rdup /dev/null /shared/backup | rdup-up -t /tmp/restore
       or
              cp -rap /shared/backup /tmp/restore

   rdup (archiving)
       Backup:
              rdup /dev/null ~/bin > my-archive.rdup
       Restore:
              rdup-up -t /tmp/restore < my-archive.rdup

   cpio
       Backup:
              rdup -R -F '%N\n' /dev/null ~/bin | cpio -o -Hcrc > my-archive.cpio
       Restore:
              cpio -i -d -Hcrc < my-archive.cpio

   tar
       Backup:
              rdup -F '%N\n' /dev/null ~/bin | tar c -f my-archive.tar -T - --no-recursion
       Restore:
              tar x -f my-archive.tar

AUTHOR

       Written by Miek Gieben.

REPORTING BUGS

       Report bugs to <miek@miek.nl>.

SEE ALSO

       http:/www.miek.nl/projects/rdup  is  the  main  site  of  rdup. Also see rdup-tr(1), rdup-up(1) and rdup-
       backups(7).

       Copyright (C) 2005-2011 Miek Gieben.  This  is  free  software.  There  is  NO  warranty;  not  even  for
       MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.

       Licensed under the GPL version 3. See the file LICENSE in the source distribution of rdup.