Provided by: manpages-posix_2.16-1_all bug

NAME

       od - dump files in various formats

SYNOPSIS

       od [-v][-A address_base][-j skip][-N count][-t type_string]...
              [file...]

       od [-bcdosx][file] [[+]offset[.][b]]

DESCRIPTION

       The  od  utility shall write the contents of its input files to standard output in a user-
       specified format.

OPTIONS

       The od utility shall conform to  the  Base  Definitions  volume  of  IEEE Std 1003.1-2001,
       Section  12.2,  Utility Syntax Guidelines, except that the order of presentation of the -t
       options    and the -bcdosx options is significant.

       The following options shall be supported:

       -A  address_base

              Specify  the  input  offset  base.  See  the  EXTENDED  DESCRIPTION  section.   The
              application  shall ensure that the address_base option-argument is a character. The
              characters 'd' , 'o' , and 'x' specify that the offset base  shall  be  written  in
              decimal,  octal, or hexadecimal, respectively. The character 'n' specifies that the
              offset shall not be written.

       -b     Interpret bytes in octal. This shall be equivalent to -t o1.

       -c     Interpret bytes as characters specified by the  current  setting  of  the  LC_CTYPE
              category.  Certain non-graphic characters appear as C escapes: "NUL=\0" , "BS=\b" ,
              "FF=\f" , "NL=\n" , "CR=\r" , "HT=\t" ; others appear as 3-digit octal numbers.

       -d     Interpret words (two-byte units) in unsigned decimal. This shall be  equivalent  to
              -t u2.

       -j  skip
              Jump  over skip bytes from the beginning of the input. The od utility shall read or
              seek past the first skip bytes in the concatenated input  files.  If  the  combined
              input  is  not  at  least  skip bytes long, the od utility shall write a diagnostic
              message to standard error and exit with a non-zero exit status.

       By default, the skip option-argument shall be interpreted as  a  decimal  number.  With  a
       leading 0x or 0X, the offset shall be interpreted as a hexadecimal number; otherwise, with
       a leading '0' , the offset  shall  be  interpreted  as  an  octal  number.  Appending  the
       character  'b'  , 'k' , or 'm' to offset shall cause it to be interpreted as a multiple of
       512, 1024, or 1048576 bytes, respectively. If the skip number is hexadecimal, any appended
       'b' shall be considered to be the final hexadecimal digit.

       -N  count
              Format no more than count bytes of input. By default, count shall be interpreted as
              a decimal number. With a leading  0x  or  0X,  count  shall  be  interpreted  as  a
              hexadecimal  number;  otherwise, with a leading '0' , it shall be interpreted as an
              octal number. If count bytes of input (after successfully skipping, if -j  skip  is
              specified)  are  not available, it shall not be considered an error; the od utility
              shall format the input that is available.

       -o     Interpret words (two-byte units) in octal. This shall be equivalent to -t o2.

       -s     Interpret words (two-byte units) in signed decimal. This  shall  be  equivalent  to
              -t d2.

       -t  type_string

              Specify  one  or  more  output  types.  See  the EXTENDED DESCRIPTION section.  The
              application  shall  ensure  that  the  type_string  option-argument  is  a   string
              specifying  the  types  to  be  used  when writing the input data. The string shall
              consist of the type specification characters a , c , d , f ,  o  ,  u  ,  and  x  ,
              specifying  named  character,  character,  signed  decimal,  floating point, octal,
              unsigned decimal, and hexadecimal, respectively. The type specification  characters
              d  , f , o , u , and x can be followed by an optional unsigned decimal integer that
              specifies the number of bytes to be transformed by  each  instance  of  the  output
              type.  The type specification character f can be followed by an optional F , D , or
              L indicating that the conversion should be  applied  to  an  item  of  type  float,
              double, or long double, respectively. The type specification characters d , o , u ,
              and x can be followed by an optional C  ,  S  ,  I  ,  or  L  indicating  that  the
              conversion  should  be  applied  to  an  item  of  type  char, short, int, or long,
              respectively. Multiple types can be concatenated within the  same  type_string  and
              multiple  -t  options can be specified. Output lines shall be written for each type
              specified in the order in which the type specification characters are specified.

       -v     Write all input data. Without the -v option, any number of groups of output  lines,
              which would be identical to the immediately preceding group of output lines (except
              for the byte offsets), shall be replaced with a line containing only an asterisk  (
              '*' ).

       -x     Interpret words (two-byte units) in hexadecimal. This shall be equivalent to -t x2.

       Multiple  types  can  be  specified  by using multiple -bcdostx options.  Output lines are
       written for each type specified in the order in which the types are specified.

OPERANDS

       The following operands shall be supported:

       file   A pathname of a file to be read. If no file operands are  specified,  the  standard
              input shall be used.

       If  there  are  no  more  than  two  operands,  none  of  the -A, -j, -N, or -t options is
       specified, and either of the following is true: the first character of the last operand is
       a plus sign ( '+' ), or there are two operands and the first character of the last operand
       is numeric;    the last operand  shall  be  interpreted  as  an  offset  operand  on  XSI-
       conformant  systems.   Under these conditions, the results are unspecified on systems that
       are not XSI-conformant systems.

       [+]offset[.][b]
              The offset operand specifies the offset in the file where dumping is  to  commence.
              This operand is normally interpreted as octal bytes. If '.' is appended, the offset
              shall be  interpreted  in  decimal.  If  'b'  is  appended,  the  offset  shall  be
              interpreted in units of 512 bytes.

STDIN

       The  standard  input  shall  be used only if no file operands are specified. See the INPUT
       FILES section.

INPUT FILES

       The input files can be any file type.

ENVIRONMENT VARIABLES

       The following environment variables shall affect the execution of od:

       LANG   Provide a default value for the internationalization variables that  are  unset  or
              null.  (See  the  Base  Definitions  volume  of  IEEE Std 1003.1-2001, Section 8.2,
              Internationalization Variables for the precedence of internationalization variables
              used to determine the values of locale categories.)

       LC_ALL If  set  to  a  non-empty  string  value,  override  the  values  of  all the other
              internationalization variables.

       LC_CTYPE
              Determine the locale for the interpretation of sequences of bytes of text  data  as
              characters  (for  example,  single-byte  as  opposed  to  multi-byte  characters in
              arguments and input files).

       LC_MESSAGES
              Determine the locale that should be used to  affect  the  format  and  contents  of
              diagnostic messages written to standard error.

       LC_NUMERIC

              Determine  the locale for selecting the radix character used when writing floating-
              point formatted output.

       NLSPATH
              Determine the location of message catalogs for the processing of LC_MESSAGES .

ASYNCHRONOUS EVENTS

       Default.

STDOUT

       See the EXTENDED DESCRIPTION section.

STDERR

       The standard error shall be used only for diagnostic messages.

OUTPUT FILES

       None.

EXTENDED DESCRIPTION

       The od utility shall copy sequentially each input file to  standard  output,  transforming
       the  input data according to the output types specified by the -t option    or the -bcdosx
       options.  If no output type is specified, the default output shall be as if -t oS had been
       specified.

       The  number  of bytes transformed by the output type specifier c may be variable depending
       on the LC_CTYPE category.

       The default number of bytes transformed by output type specifiers d , f , o , u  ,  and  x
       corresponds  to the various C-language types as follows. If the c99 compiler is present on
       the system, these specifiers shall correspond  to  the  sizes  used  by  default  in  that
       compiler.   Otherwise,   these   sizes   may   vary   among   systems   that   conform  to
       IEEE Std 1003.1-2001.

        * For the type specifier characters d , o , u , and x , the default number of bytes shall
          correspond to the size of the underlying implementation's basic integer type. For these
          specifier characters, the implementation shall support values of the optional number of
          bytes  to  be  converted  corresponding  to the number of bytes in the C-language types
          char, short, int, and long. These numbers can also be specified by  an  application  as
          the characters 'C' , 'S' , 'I' , and 'L' , respectively.  The implementation shall also
          support the values 1, 2, 4, and 8, even if it provides no  C-Language  types  of  those
          sizes.   The  implementation  shall  support  the decimal value corresponding to the C-
          language type long long. The byte  order  used  when  interpreting  numeric  values  is
          implementation-defined,  but  shall  correspond to the order in which a constant of the
          corresponding type is stored in memory on the system.

        * For the type specifier character f , the default number of bytes  shall  correspond  to
          the number of bytes in the underlying implementation's basic double precision floating-
          point data type. The implementation shall support values  of  the  optional  number  of
          bytes  to  be  converted  corresponding  to the number of bytes in the C-language types
          float, double, and long double. These numbers can also be specified by  an  application
          as the characters 'F' , 'D' , and 'L' , respectively.

       The  type  specifier  character  a  specifies  that  bytes  shall  be interpreted as named
       characters  from  the  International  Reference  Version  (IRV)  of  the  ISO/IEC 646:1991
       standard.  Only  the least significant seven bits of each byte shall be used for this type
       specification. Bytes with the values listed in the following table shall be written  using
       the corresponding names for those characters.

                                     Table: Named Characters in od

                         Value  Name  Value  Name  Value  Name      Value  Name
                         \000   nul   \001   soh   \002   stx       \003   etx
                         \004   eot   \005   enq   \006   ack       \007   bel
                         \010   bs    \011   ht    \012   lf or nl  \013   vt
                         \014   ff    \015   cr    \016   so        \017   si
                         \020   dle   \021   dc1   \022   dc2       \023   dc3
                         \024   dc4   \025   nak   \026   syn       \027   etb
                         \030   can   \031   em    \032   sub       \033   esc
                         \034   fs    \035   gs    \036   rs        \037   us
                         \040   sp    \177   del

       Note:  The "\012" value may be written either as lf or nl.

       The  type  specifier  character  c specifies that bytes shall be interpreted as characters
       specified by the current setting of the LC_CTYPE locale category. Characters listed in the
       table  in  the  Base  Definitions  volume  of IEEE Std 1003.1-2001, Chapter 5, File Format
       Notation ( '\\' , '\a' , '\b' , '\f' , '\n' , '\r' , '\t' , '\v' ) shall be written as the
       corresponding  escape  sequences,  except  that  backslash  shall  be  written as a single
       backslash and a NUL shall be written as '\0' . Other  non-printable  characters  shall  be
       written  as  one three-digit octal number for each byte in the character. If the size of a
       byte on the system is greater than nine bits, the format used for non-printable characters
       is  implementation-defined.   Printable multi-byte characters shall be written in the area
       corresponding to the first byte of the character; the two-character sequence "**" shall be
       written  in  the  area  corresponding  to  each  remaining  byte  in  the character, as an
       indication that the character is continued. When either the -j skip or -N count option  is
       specified  along  with  the  c  type specifier, and this results in an attempt to start or
       finish in the middle of a multi-byte character, the result is implementation-defined.

       The input data shall be manipulated in blocks, where a block is defined as a  multiple  of
       the  least  common  multiple  of  the  number of bytes transformed by the specified output
       types. If the least common multiple is greater than 16, the results are unspecified.  Each
       input  block shall be written as transformed by each output type, one per written line, in
       the order that the output types were specified. If the input block size is larger than the
       number  of  bytes  transformed  by  the  output  type,  the output type shall sequentially
       transform the parts of the input block, and the output from each  of  the  transformations
       shall be separated by one or more <blank>s.

       If,  as a result of the specification of the -N option or end-of-file being reached on the
       last input file, input data only partially satisfies an output type, the  input  shall  be
       extended sufficiently with null bytes to write the last byte of the input.

       Unless  -A n  is  specified,  the first output line produced for each input block shall be
       preceded by the input offset, cumulative across input  files,  of  the  next  byte  to  be
       written.  The format of the input offset is unspecified; however, it shall not contain any
       <blank>s, shall start at the first character of the output line, and shall be followed  by
       one  or more <blank>s. In addition, the offset of the byte following the last byte written
       shall be written after all the input data has been processed, but shall not be followed by
       any <blank>s.

       If no -A option is specified, the input offset base is unspecified.

EXIT STATUS

       The following exit values shall be returned:

        0     All input files were processed successfully.

       >0     An error occurred.

CONSEQUENCES OF ERRORS

       Default.

       The following sections are informative.

APPLICATION USAGE

       XSI-conformant  applications  are warned not to use filenames starting with '+' or a first
       operand starting with a numeric character so that the old functionality can be  maintained
       by  implementations,  unless  they  specify one of the -A, -j, or -N options. To guarantee
       that one of these filenames is always interpreted as  a  filename,  an  application  could
       always specify the address base format with the -A option.

EXAMPLES

       If  a  file  containing 128 bytes with decimal values zero to 127, in increasing order, is
       supplied as standard input to the command:

              od -A d -t a

       on an implementation using  an  input  block  size  of  16  bytes,  the  standard  output,
       independent of the current locale setting, would be similar to:

              0000000 nul soh stx etx eot enq ack bel  bs  ht  nl  vt  ff  cr  so  si
              0000016 dle dc1 dc2 dc3 dc4 nak syn etb can  em sub esc  fs  gs  rs  us
              0000032  sp   !   "   #   $   %   &   '   (   )   *   +   ,   -   .  /
              0000048   0   1   2   3   4   5   6   7   8   9   :   ;   <   =   >   ?
              0000064   @   A   B   C   D   E   F   G   H   I   J   K   L   M   N   O
              0000080   P   Q   R   S   T   U   V   W   X   Y   Z   [   \   ]   ^   _
              0000096   `   a   b   c   d   e   f   g   h   i   j   k   l   m   n   o
              0000112   p   q   r   s   t   u   v   w   x   y   z   {   |   }   ~ del
              0000128

       Note  that  this volume of IEEE Std 1003.1-2001 allows nl or lf to be used as the name for
       the ISO/IEC 646:1991 standard IRV character with decimal value  10.  The  IRV  names  this
       character  lf (line feed), but traditional implementations have referred to this character
       as newline ( nl) and the POSIX locale character set symbolic name  for  the  corresponding
       character is a <newline>.

       The command:

              od -A o -t o2x2x -N 18

       on  a system with 32-bit words and an implementation using an input block size of 16 bytes
       could write 18 bytes in approximately the following format:

              0000000 032056 031440 041123 042040 052516 044530 020043 031464
                        342e   3320   4253   4420   554e   4958   2023   3334
                           342e3320      42534420      554e4958      20233334
              0000020 032472
                        353a
                           353a0000
              0000022

       The command:

              od -A d -t f -t o4 -t x4 -N 24 -j 0x15

       on a system with 64-bit doubles (for example, IEEE Std 754-1985 double precision floating-
       point  format)  would skip 21 bytes of input data and then write 24 bytes in approximately
       the following format:

              0000000    1.00000000000000e+00    1.57350000000000e+01
                      07774000000 00000000000 10013674121 35341217270
                         3ff00000    00000000    402f3851    eb851eb8
              0000016    1.40668230000000e+02
                      10030312542 04370303230
                         40619562    23e18698
              0000024

RATIONALE

       The od utility went through several names in early proposals, including hd, xd,  and  most
       recently  hexdump.  There  were  several objections to all of these based on the following
       reasons:

        * The hd and xd names conflicted with historical utilities that behaved differently.

        * The hexdump description was much more complex than needed for a simple dump utility.

        * The od utility has been available on all historical implementations and  there  was  no
          need to create a new name for a utility so similar to the historical od utility.

       The  original  reasons  for  not  standardizing historical od were also fairly widespread.
       Those reasons are given below along with rationale explaining why the standard  developers
       believe that this version does not suffer from the indicated problem:

        * The  BSD  and  System  V versions of od have diverged, and the intersection of features
          provided by both does not meet the needs of the user community. In fact, the  System  V
          version  only  provides  a  mechanism  for  dumping  octal bytes and shorts, signed and
          unsigned decimal shorts, hexadecimal  shorts,  and  ASCII  characters.  BSD  added  the
          ability  to  dump  floats,  doubles, named ASCII characters, and octal, signed decimal,
          unsigned decimal, and hexadecimal longs.  The  version  presented  here  provides  more
          normalized  forms  for dumping bytes, shorts, ints, and longs in octal, signed decimal,
          unsigned decimal, and hexadecimal; float, double, and long double; and named  ASCII  as
          well as current locale characters.

        * It  would not be possible to come up with a compatible superset of the BSD and System V
          flags that met the requirements of the standard developers. The historical  default  od
          output  is  the  specified  default  output of this utility. None of the option letters
          chosen for this version of od conflict with any of the options to  historical  versions
          of od.

        * On  systems  with different sizes for short, int, and long, there was no way to ask for
          dumps of ints, even in the BSD version. Because of the way options are named, the  name
          space  could  not  be  extended  to solve these problems. This is why the -t option was
          added (with type specifiers more closely matched to the printf() formats  used  in  the
          rest of this volume of IEEE Std 1003.1-2001) and the optional field sizes were added to
          the d , f , o , u , and x type specifiers. It is  also  one  of  the  reasons  why  the
          historical  practice  was  not mandated as a required obsolescent form of od. (Although
          the old versions of od are not listed as an obsolescent form, implementations are urged
          to  continue  to recognize the older forms for several more years.) The a , c , f , o ,
          and x types match the meaning of the corresponding format characters in the  historical
          implementations  of  od  except  for  the  default sizes of the fields converted. The d
          format is signed in this volume of IEEE Std 1003.1-2001 to match the printf() notation.
          (Historical  versions  of  od  used  d as a synonym for u in this version. The System V
          implementation uses s for signed decimal; BSD uses i for signed decimal and s for null-
          terminated  strings.)  Other  than  d  and  u , all of the type specifiers match format
          characters in the historical BSD version of od.

       The sizes of the C-language types char, short, int, long, float, double, and  long  double
       are used even though it is recognized that there may be zero or more than one compiler for
       the C language on an implementation and that they may use  different  sizes  for  some  of
       these  types.  (For  example,  one  compiler might use 2 bytes shorts, 2 bytes ints, and 4
       bytes longs, while another compiler (or an option to  the  same  compiler)  uses  2  bytes
       shorts, 4 bytes ints, and 4 bytes longs.)  Nonetheless, there has to be a basic size known
       by the implementation for these types, corresponding to the values reported by invocations
       of  the  getconf  utility  when called with system_var operands {UCHAR_MAX}, {USHORT_MAX},
       {UINT_MAX}, and {ULONG_MAX} for the types char, short, int, and long, respectively.  There
       are  similar  constants  required  by  the  ISO C standard, but not required by the System
       Interfaces volume of IEEE Std 1003.1-2001 or this volume of IEEE Std 1003.1-2001. They are
       {FLT_MANT_DIG},  {DBL_MANT_DIG}, and {LDBL_MANT_DIG} for the types float, double, and long
       double, respectively. If the optional c99 utility is provided by  the  implementation  and
       used  as  specified by this volume of IEEE Std 1003.1-2001, these are the sizes that would
       be provided. If an option is used that specifies different sizes for these types, there is
       no guarantee that the od utility is able to interpret binary data output by such a program
       correctly.

       This volume of IEEE Std 1003.1-2001 requires that the numeric values of these  lengths  be
       recognized  by  the  od  utility  and  that  symbolic  forms  also  be recognized. Thus, a
       conforming application can always look at an array of unsigned long data elements using od
       -t uL.

        * The  method  of  specifying  the  format  for  the  address field based on specifying a
          starting offset in a file unnecessarily tied  the  two  together.  The  -A  option  now
          specifies the address base and the -S option specifies a starting offset.

        * It   would  be  difficult  to  break  the  dependence  on  U.S.  ASCII  to  achieve  an
          internationalized utility. It does not seem to be any harder for od to dump  characters
          in  the  current  locale  than it is for the ed or sed l commands. The c type specifier
          does  this  without  difficulty  and  is  completely  compatible  with  the  historical
          implementations  of  the  c format character when the current locale uses a superset of
          the ISO/IEC 646:1991 standard as a codeset. The a type specifier (from the BSD a format
          character)   was   left   as  a  portable  means  to  dump  ASCII  (or  more  correctly
          ISO/IEC 646:1991 standard (IRV)) so that headers produced by pax  could  be  deciphered
          even on systems that do not use the ISO/IEC 646:1991 standard as a subset of their base
          codeset.

       The use of "**" as an indication of continuation of a multi-byte character in c  specifier
       output  was  chosen  based  on  seeing  an  implementation  that  uses  this  method.  The
       continuation bytes have to be marked in a way that is not ambiguous with  another  single-
       byte or multi-byte character.

       An  early  proposal  used  -S  and  -n, respectively, for the -j and -N options eventually
       selected. These were changed to avoid conflicts with historical implementations.

       The original standard specified -t o2 as the default when no output type was  given.  This
       was changed to -t oS (the length of a short) to accommodate a supercomputer implementation
       that historically used 64 bits as its default (and that defined shorts as 64  bits).  This
       change should not affect conforming applications. The requirement to support lengths of 1,
       2, and 4 was added at the same time to address an historical implementation  that  had  no
       two-byte data types in its C compiler.

       The  use  of a basic integer data type is intended to allow the implementation to choose a
       word size commonly used by applications on that architecture.

FUTURE DIRECTIONS

       All option and operand interfaces marked as  extensions  may  be  withdrawn  in  a  future
       version.

SEE ALSO

       c99 , sed

COPYRIGHT

       Portions  of  this  text  are  reprinted  and  reproduced in electronic form from IEEE Std
       1003.1, 2003 Edition, Standard for Information Technology  --  Portable  Operating  System
       Interface  (POSIX), The Open Group Base Specifications Issue 6, Copyright (C) 2001-2003 by
       the Institute of Electrical and Electronics Engineers, Inc and  The  Open  Group.  In  the
       event  of  any  discrepancy  between this version and the original IEEE and The Open Group
       Standard, the original IEEE and The Open Group  Standard  is  the  referee  document.  The
       original Standard can be obtained online at http://www.opengroup.org/unix/online.html .