Provided by: manpages-posix_2.16-1_all bug

NAME

       od - dump files in various formats

SYNOPSIS

       od [-v][-A address_base][-j skip][-N count][-t type_string]...
              [file...]

       od [-bcdosx][file] [[+]offset[.][b]]

DESCRIPTION

       The od utility shall write the contents of its input files to standard output in a user-specified format.

OPTIONS

       The  od  utility  shall  conform  to  the  Base Definitions volume of IEEE Std 1003.1-2001, Section 12.2,
       Utility Syntax Guidelines, except that the order of presentation of the -t  options     and  the  -bcdosx
       options is significant.

       The following options shall be supported:

       -A  address_base

              Specify the input offset base. See the EXTENDED DESCRIPTION section.  The application shall ensure
              that  the  address_base option-argument is a character. The characters 'd' , 'o' , and 'x' specify
              that the offset base shall be  written  in  decimal,  octal,  or  hexadecimal,  respectively.  The
              character 'n' specifies that the offset shall not be written.

       -b     Interpret bytes in octal. This shall be equivalent to -t o1.

       -c     Interpret  bytes  as characters specified by the current setting of the LC_CTYPE category. Certain
              non-graphic characters appear as C escapes: "NUL=\0" , "BS=\b" , "FF=\f" ,  "NL=\n"  ,  "CR=\r"  ,
              "HT=\t" ; others appear as 3-digit octal numbers.

       -d     Interpret words (two-byte units) in unsigned decimal. This shall be equivalent to -t u2.

       -j  skip
              Jump  over  skip bytes from the beginning of the input. The od utility shall read or seek past the
              first skip bytes in the concatenated input files. If the combined input is not at least skip bytes
              long, the od utility shall write a diagnostic message to standard error and exit with  a  non-zero
              exit status.

       By  default,  the skip option-argument shall be interpreted as a decimal number. With a leading 0x or 0X,
       the offset shall be interpreted as a hexadecimal number; otherwise, with a leading '0' , the offset shall
       be interpreted as an octal number. Appending the character 'b' , 'k' , or 'm' to offset shall cause it to
       be interpreted as a multiple of 512, 1024,  or  1048576  bytes,  respectively.  If  the  skip  number  is
       hexadecimal, any appended 'b' shall be considered to be the final hexadecimal digit.

       -N  count
              Format  no  more  than  count  bytes of input. By default, count shall be interpreted as a decimal
              number. With a leading 0x or 0X, count shall be interpreted as a  hexadecimal  number;  otherwise,
              with  a  leading  '0' , it shall be interpreted as an octal number. If count bytes of input (after
              successfully skipping, if -j skip is specified) are not available, it shall not be  considered  an
              error; the od utility shall format the input that is available.

       -o     Interpret words (two-byte units) in octal. This shall be equivalent to -t o2.

       -s     Interpret words (two-byte units) in signed decimal. This shall be equivalent to -t d2.

       -t  type_string

              Specify  one  or  more  output types. See the EXTENDED DESCRIPTION section.  The application shall
              ensure that the type_string option-argument is a string specifying  the  types  to  be  used  when
              writing  the input data. The string shall consist of the type specification characters a , c , d ,
              f , o , u , and x , specifying named character, character, signed decimal, floating point,  octal,
              unsigned decimal, and hexadecimal, respectively. The type specification characters d , f , o , u ,
              and  x  can be followed by an optional unsigned decimal integer that specifies the number of bytes
              to be transformed by each instance of the output type. The type specification character f  can  be
              followed  by  an optional F , D , or L indicating that the conversion should be applied to an item
              of type float, double, or long double, respectively. The type specification characters d , o , u ,
              and x can be followed by an optional C , S , I , or L indicating that  the  conversion  should  be
              applied  to  an  item  of  type  char,  short,  int,  or long, respectively. Multiple types can be
              concatenated within the same type_string and multiple -t options can be  specified.  Output  lines
              shall  be  written for each type specified in the order in which the type specification characters
              are specified.

       -v     Write all input data. Without the -v option, any number of groups of output lines, which would  be
              identical  to the immediately preceding group of output lines (except for the byte offsets), shall
              be replaced with a line containing only an asterisk ( '*' ).

       -x     Interpret words (two-byte units) in hexadecimal. This shall be equivalent to -t x2.

       Multiple types can be specified by using multiple -bcdostx options.  Output lines are  written  for  each
       type specified in the order in which the types are specified.

OPERANDS

       The following operands shall be supported:

       file   A  pathname  of  a file to be read. If no file operands are specified, the standard input shall be
              used.

       If there are no more than two operands, none of the -A, -j, -N, or -t options is specified, and either of
       the following is true: the first character of the last operand is a plus sign ( '+' ), or there  are  two
       operands and the first character of the last operand is numeric;    the last operand shall be interpreted
       as  an  offset operand on XSI-conformant systems.  Under these conditions, the results are unspecified on
       systems that are not XSI-conformant systems.

       [+]offset[.][b]
              The offset operand specifies the offset in the file where dumping is to commence. This operand  is
              normally  interpreted  as  octal  bytes.  If  '.'  is appended, the offset shall be interpreted in
              decimal. If 'b' is appended, the offset shall be interpreted in units of 512 bytes.

STDIN

       The standard input shall be used only if no file operands are specified. See the INPUT FILES section.

INPUT FILES

       The input files can be any file type.

ENVIRONMENT VARIABLES

       The following environment variables shall affect the execution of od:

       LANG   Provide a default value for the internationalization variables that are unset or  null.  (See  the
              Base  Definitions  volume of IEEE Std 1003.1-2001, Section 8.2, Internationalization Variables for
              the  precedence  of  internationalization  variables  used  to  determine  the  values  of  locale
              categories.)

       LC_ALL If  set  to  a  non-empty  string value, override the values of all the other internationalization
              variables.

       LC_CTYPE
              Determine the locale for the interpretation of sequences of bytes of text data as characters  (for
              example, single-byte as opposed to multi-byte characters in arguments and input files).

       LC_MESSAGES
              Determine  the locale that should be used to affect the format and contents of diagnostic messages
              written to standard error.

       LC_NUMERIC

              Determine the locale for selecting the radix character used when writing floating-point  formatted
              output.

       NLSPATH
              Determine the location of message catalogs for the processing of LC_MESSAGES .

ASYNCHRONOUS EVENTS

       Default.

STDOUT

       See the EXTENDED DESCRIPTION section.

STDERR

       The standard error shall be used only for diagnostic messages.

OUTPUT FILES

       None.

EXTENDED DESCRIPTION

       The  od  utility  shall copy sequentially each input file to standard output, transforming the input data
       according to the output types specified by the -t option    or the -bcdosx options.  If no output type is
       specified, the default output shall be as if -t oS had been specified.

       The number of bytes transformed by the output type specifier c may be variable depending on the  LC_CTYPE
       category.

       The  default  number  of bytes transformed by output type specifiers d , f , o , u , and x corresponds to
       the various C-language types as follows. If the c99 compiler is present on the system,  these  specifiers
       shall  correspond  to  the  sizes used by default in that compiler. Otherwise, these sizes may vary among
       systems that conform to IEEE Std 1003.1-2001.

        * For the type specifier characters d , o , u , and x , the default number of bytes shall correspond  to
          the  size  of  the underlying implementation's basic integer type. For these specifier characters, the
          implementation shall support values of the optional number of bytes to be converted  corresponding  to
          the  number  of  bytes  in  the C-language types char, short, int, and long. These numbers can also be
          specified by an application as the characters 'C'  ,  'S'  ,  'I'  ,  and  'L'  ,  respectively.   The
          implementation  shall  also support the values 1, 2, 4, and 8, even if it provides no C-Language types
          of those sizes.  The implementation shall support the decimal value corresponding  to  the  C-language
          type  long  long.  The byte order used when interpreting numeric values is implementation-defined, but
          shall correspond to the order in which a constant of the corresponding type is stored in memory on the
          system.

        * For the type specifier character f , the default number of bytes shall correspond  to  the  number  of
          bytes  in  the  underlying  implementation's  basic  double  precision  floating-point  data type. The
          implementation shall support values of the optional number of bytes to be converted  corresponding  to
          the  number of bytes in the C-language types float, double, and long double. These numbers can also be
          specified by an application as the characters 'F' , 'D' , and 'L' , respectively.

       The type specifier character a specifies that bytes shall be interpreted as  named  characters  from  the
       International  Reference Version (IRV) of the ISO/IEC 646:1991 standard. Only the least significant seven
       bits of each byte shall be used for this  type  specification.  Bytes  with  the  values  listed  in  the
       following table shall be written using the corresponding names for those characters.

                                             Table: Named Characters in od
                                Value  Name  Value  Name  Value  Name      Value  Name
                                \000   nul   \001   soh   \002   stx       \003   etx
                                \004   eot   \005   enq   \006   ack       \007   bel
                                \010   bs    \011   ht    \012   lf or nl  \013   vt
                                \014   ff    \015   cr    \016   so        \017   si
                                \020   dle   \021   dc1   \022   dc2       \023   dc3
                                \024   dc4   \025   nak   \026   syn       \027   etb
                                \030   can   \031   em    \032   sub       \033   esc
                                \034   fs    \035   gs    \036   rs        \037   us
                                \040   sp    \177   del

       Note:  The "\012" value may be written either as lf or nl.

       The  type  specifier character c specifies that bytes shall be interpreted as characters specified by the
       current setting of the LC_CTYPE locale category. Characters listed in the table in the  Base  Definitions
       volume of IEEE Std 1003.1-2001, Chapter 5, File Format Notation ( '\\' , '\a' , '\b' , '\f' , '\n' , '\r'
       ,  '\t'  ,  '\v' ) shall be written as the corresponding escape sequences, except that backslash shall be
       written as a single backslash and a NUL shall be written as '\0' . Other non-printable  characters  shall
       be  written  as one three-digit octal number for each byte in the character. If the size of a byte on the
       system is greater than nine bits, the format used for non-printable characters is implementation-defined.
       Printable multi-byte characters shall be written in the area corresponding  to  the  first  byte  of  the
       character;  the  two-character sequence "**" shall be written in the area corresponding to each remaining
       byte in the character, as an indication that the character is continued. When either the -j  skip  or  -N
       count  option  is  specified  along with the c type specifier, and this results in an attempt to start or
       finish in the middle of a multi-byte character, the result is implementation-defined.

       The input data shall be manipulated in blocks, where a block is defined as a multiple of the least common
       multiple of the number of bytes transformed by the specified output types. If the least  common  multiple
       is  greater  than  16,  the results are unspecified.  Each input block shall be written as transformed by
       each output type, one per written line, in the order that the output types were specified. If  the  input
       block  size  is  larger  than  the  number of bytes transformed by the output type, the output type shall
       sequentially transform the parts of the input block, and the output  from  each  of  the  transformations
       shall be separated by one or more <blank>s.

       If,  as  a  result  of  the specification of the -N option or end-of-file being reached on the last input
       file, input data only partially satisfies an output type, the input shall be extended  sufficiently  with
       null bytes to write the last byte of the input.

       Unless  -A n  is  specified, the first output line produced for each input block shall be preceded by the
       input offset, cumulative across input files, of the next byte to be written.  The  format  of  the  input
       offset  is unspecified; however, it shall not contain any <blank>s, shall start at the first character of
       the output line, and shall be followed by one or more <blank>s. In  addition,  the  offset  of  the  byte
       following  the  last byte written shall be written after all the input data has been processed, but shall
       not be followed by any <blank>s.

       If no -A option is specified, the input offset base is unspecified.

EXIT STATUS

       The following exit values shall be returned:

        0     All input files were processed successfully.

       >0     An error occurred.

CONSEQUENCES OF ERRORS

       Default.

       The following sections are informative.

APPLICATION USAGE

       XSI-conformant applications are warned not to use filenames starting with '+' or a first operand starting
       with a numeric character so that the old functionality can be maintained by implementations, unless  they
       specify  one of the -A, -j, or -N options. To guarantee that one of these filenames is always interpreted
       as a filename, an application could always specify the address base format with the -A option.

EXAMPLES

       If a file containing 128 bytes with decimal values zero to 127,  in  increasing  order,  is  supplied  as
       standard input to the command:

              od -A d -t a

       on  an  implementation  using  an  input  block size of 16 bytes, the standard output, independent of the
       current locale setting, would be similar to:

              0000000 nul soh stx etx eot enq ack bel  bs  ht  nl  vt  ff  cr  so  si
              0000016 dle dc1 dc2 dc3 dc4 nak syn etb can  em sub esc  fs  gs  rs  us
              0000032  sp   !   "   #   $   %   &   '   (   )   *   +   ,   -   .  /
              0000048   0   1   2   3   4   5   6   7   8   9   :   ;   <   =   >   ?
              0000064   @   A   B   C   D   E   F   G   H   I   J   K   L   M   N   O
              0000080   P   Q   R   S   T   U   V   W   X   Y   Z   [   \   ]   ^   _
              0000096   `   a   b   c   d   e   f   g   h   i   j   k   l   m   n   o
              0000112   p   q   r   s   t   u   v   w   x   y   z   {   |   }   ~ del
              0000128

       Note that this volume of  IEEE Std 1003.1-2001  allows  nl  or  lf  to  be  used  as  the  name  for  the
       ISO/IEC 646:1991  standard  IRV  character  with  decimal value 10. The IRV names this character lf (line
       feed), but traditional implementations have referred to this character as newline (  nl)  and  the  POSIX
       locale character set symbolic name for the corresponding character is a <newline>.

       The command:

              od -A o -t o2x2x -N 18

       on  a system with 32-bit words and an implementation using an input block size of 16 bytes could write 18
       bytes in approximately the following format:

              0000000 032056 031440 041123 042040 052516 044530 020043 031464
                        342e   3320   4253   4420   554e   4958   2023   3334
                           342e3320      42534420      554e4958      20233334
              0000020 032472
                        353a
                           353a0000
              0000022

       The command:

              od -A d -t f -t o4 -t x4 -N 24 -j 0x15

       on a system with 64-bit doubles (for example, IEEE Std 754-1985 double precision  floating-point  format)
       would skip 21 bytes of input data and then write 24 bytes in approximately the following format:

              0000000    1.00000000000000e+00    1.57350000000000e+01
                      07774000000 00000000000 10013674121 35341217270
                         3ff00000    00000000    402f3851    eb851eb8
              0000016    1.40668230000000e+02
                      10030312542 04370303230
                         40619562    23e18698
              0000024

RATIONALE

       The  od  utility  went  through  several  names  in  early proposals, including hd, xd, and most recently
       hexdump. There were several objections to all of these based on the following reasons:

        * The hd and xd names conflicted with historical utilities that behaved differently.

        * The hexdump description was much more complex than needed for a simple dump utility.

        * The od utility has been available on all historical implementations and there was no need to create  a
          new name for a utility so similar to the historical od utility.

       The  original  reasons for not standardizing historical od were also fairly widespread. Those reasons are
       given below along with rationale explaining why the standard developers believe that  this  version  does
       not suffer from the indicated problem:

        * The  BSD  and System V versions of od have diverged, and the intersection of features provided by both
          does not meet the needs of the user community. In fact, the System V version only provides a mechanism
          for dumping octal bytes and shorts, signed and unsigned decimal shorts, hexadecimal shorts, and  ASCII
          characters.  BSD  added the ability to dump floats, doubles, named ASCII characters, and octal, signed
          decimal, unsigned decimal, and hexadecimal longs.  The version presented here provides more normalized
          forms for dumping bytes, shorts, ints, and longs in  octal,  signed  decimal,  unsigned  decimal,  and
          hexadecimal; float, double, and long double; and named ASCII as well as current locale characters.

        * It  would not be possible to come up with a compatible superset of the BSD and System V flags that met
          the requirements of the standard developers. The historical default od output is the specified default
          output of this utility. None of the option letters chosen for this version of od conflict with any  of
          the options to historical versions of od.

        * On  systems  with different sizes for short, int, and long, there was no way to ask for dumps of ints,
          even in the BSD version. Because of the way options are named, the name space could not be extended to
          solve these problems. This is why the -t option was added (with type specifiers more  closely  matched
          to  the  printf()  formats  used  in the rest of this volume of IEEE Std 1003.1-2001) and the optional
          field sizes were added to the d , f , o , u , and x type specifiers. It is also one of the reasons why
          the historical practice was not mandated as a required obsolescent  form  of  od.  (Although  the  old
          versions  of  od  are  not  listed  as  an  obsolescent form, implementations are urged to continue to
          recognize the older forms for several more years.) The a , c , f , o , and x types match  the  meaning
          of  the corresponding format characters in the historical implementations of od except for the default
          sizes of the fields converted. The d format is signed in this volume of IEEE Std 1003.1-2001 to  match
          the  printf()  notation.  (Historical  versions  of  od used d as a synonym for u in this version. The
          System V implementation uses s for signed decimal; BSD uses i for  signed  decimal  and  s  for  null-
          terminated  strings.)  Other  than d and u , all of the type specifiers match format characters in the
          historical BSD version of od.

       The sizes of the C-language types char, short, int, long, float, double, and long double  are  used  even
       though  it  is  recognized  that  there  may  be  zero or more than one compiler for the C language on an
       implementation and that they may use different sizes for some of these types. (For example, one  compiler
       might  use  2  bytes shorts, 2 bytes ints, and 4 bytes longs, while another compiler (or an option to the
       same compiler) uses 2 bytes shorts, 4 bytes ints, and 4 bytes longs.)  Nonetheless, there  has  to  be  a
       basic  size  known  by  the  implementation  for  these  types,  corresponding  to the values reported by
       invocations of the getconf utility  when  called  with  system_var  operands  {UCHAR_MAX},  {USHORT_MAX},
       {UINT_MAX},  and  {ULONG_MAX}  for  the types char, short, int, and long, respectively. There are similar
       constants required by  the  ISO C  standard,  but  not  required  by  the  System  Interfaces  volume  of
       IEEE Std 1003.1-2001 or this volume of IEEE Std 1003.1-2001. They are {FLT_MANT_DIG}, {DBL_MANT_DIG}, and
       {LDBL_MANT_DIG}  for  the types float, double, and long double, respectively. If the optional c99 utility
       is provided by the implementation and used as specified by this volume of IEEE Std 1003.1-2001, these are
       the sizes that would be provided. If an option is used that specifies different sizes  for  these  types,
       there  is  no  guarantee  that  the  od utility is able to interpret binary data output by such a program
       correctly.

       This volume of IEEE Std 1003.1-2001 requires that the numeric values of these lengths  be  recognized  by
       the od utility and that symbolic forms also be recognized. Thus, a conforming application can always look
       at an array of unsigned long data elements using od -t uL.

        * The  method  of specifying the format for the address field based on specifying a starting offset in a
          file unnecessarily tied the two together. The -A option now specifies the  address  base  and  the  -S
          option specifies a starting offset.

        * It  would  be difficult to break the dependence on U.S. ASCII to achieve an internationalized utility.
          It does not seem to be any harder for od to dump characters in the current locale than it is  for  the
          ed  or  sed l commands. The c type specifier does this without difficulty and is completely compatible
          with the historical implementations of the c format character when the current locale uses a  superset
          of  the ISO/IEC 646:1991 standard as a codeset. The a type specifier (from the BSD a format character)
          was left as a portable means to dump ASCII (or more correctly ISO/IEC 646:1991 standard (IRV)) so that
          headers produced by pax could be deciphered even on systems  that  do  not  use  the  ISO/IEC 646:1991
          standard as a subset of their base codeset.

       The  use  of  "**"  as  an indication of continuation of a multi-byte character in c specifier output was
       chosen based on seeing an implementation that uses this method. The continuation bytes have to be  marked
       in a way that is not ambiguous with another single-byte or multi-byte character.

       An early proposal used -S and -n, respectively, for the -j and -N options eventually selected. These were
       changed to avoid conflicts with historical implementations.

       The  original  standard specified -t o2 as the default when no output type was given. This was changed to
       -t oS (the length of a short) to accommodate a supercomputer implementation  that  historically  used  64
       bits  as  its  default  (and  that  defined  shorts as 64 bits). This change should not affect conforming
       applications. The requirement to support lengths of 1, 2, and 4 was added at the same time to address  an
       historical implementation that had no two-byte data types in its C compiler.

       The  use  of  a  basic  integer  data  type is intended to allow the implementation to choose a word size
       commonly used by applications on that architecture.

FUTURE DIRECTIONS

       All option and operand interfaces marked as extensions may be withdrawn in a future version.

SEE ALSO

       c99 , sed

COPYRIGHT

       Portions of this text are reprinted and reproduced in electronic form from IEEE Std 1003.1, 2003 Edition,
       Standard for Information Technology -- Portable Operating System Interface (POSIX), The Open  Group  Base
       Specifications Issue 6, Copyright (C) 2001-2003 by the Institute of Electrical and Electronics Engineers,
       Inc  and  The  Open Group. In the event of any discrepancy between this version and the original IEEE and
       The Open Group Standard, the original IEEE and The Open Group  Standard  is  the  referee  document.  The
       original Standard can be obtained online at http://www.opengroup.org/unix/online.html .

IEEE/The Open Group                                   2003                                                 OD(P)