Ubuntu Manpage: Image::MetaData::JPEG::Structures - This document describes the structure of a JPEG file; it is an

Provided by: libimage-metadata-jpeg-perl_0.153-1_all

NAME

       Image::MetaData::JPEG::Structures - This document describes the structure of a JPEG file; it is an
       appendix to the main manual page of the Image::MetaData::JPEG module, which the reader should refer to
       for further details and the general scope.

THE STRUCTURE OF JPEG PICTURES

       The JPEG (Joint Photographic Experts Group) ISO standard is a lossy image compression mechanism developed
       before 1990, designed for compressing colour or gray-scale photo-realistic images (it performs poorly on
       line-art drawings). The JPEG mechanism is designed to exploit known limitations of the human eye, in
       particular the fact that colour changes are perceived less accurately than changes in brightness. Note
       that although the "baseline" JPEG format is patent-free, there are patents associated with some optional
       features, namely arithmetic coding and hierarchical storage. For this reason, these optional features are
       never used on the Web (good job, patent fans ...). JPEG refers only to a class of compression algorithms,
       not to a specific file format. In order to produce files with embedded JPEG streams, a number of file
       format (semi-)standards have been adapted or devised, some of which are listed in the format section. The
       structure of a well formed JPEG file can be described by the following pseudo production rules (for sake
       of simplicity, some additional constraints between tables and SOF segments are neglected).

               JPEG        --> (SOI)(misc)*(image)?(EOI)
               (image)     --> (hierarch.)|(non-hier.)
               (hierarch.) --> (DHP)(frame)+
               (frame)     --> (misc)*(EXP)?(non-hier.)
               (non-hier.) --> (SOF)(scan)+
               (scan)      --> (misc)*(SOS)(data)*(ECS)(DNL)?
               (data)      --> (ECS)(RST)
               (misc)      --> (DQT)|(DHT)|(DAC)|(DRI)|(COM)|(APP)

               (SOI) = Start Of Image
               (EOI) = End Of Image
               (SOF) = Start Of Frame header (10 types)
               (SOS) = Start Of Scan header
               (ECS) = Entropy Coded Segment (row data, not a real segment)
               (DNL) = Define Number of Lines segment
               (DHP) = Define Hierarchical P??? segment
               (EXP) = EXPansion segment
               (RST) = ReSTart segment (8 types)
               (DQT) = Define Quantisation Table
               (DHT) = Define Huffman coding Table
               (DAC) = Define Arithmetic coding Table
               (DRI) = Define Restart Interval
               (COM) = COMment segment
               (APP) = APPlication segment

       This package does not check that a JPEG file is really correct; it accepts a looser syntax, were segments
       and ECS blocks are just contiguous (basically, because it does not need to display the image!). All meta-
       data information is concentrated in the (COM*) and (APP) Segments, exception made for some records in the
       (SOF*) segment (e.g. image dimensions).

         Reference: B<"Digital compression and coding of continuous-tone still
                    images: requirements and guidelines", CCITT recommend. T.81, 1992,
                    The International Telegraph and Telephone Consultative Committee,
                    standard ISO/IEC IS 10918-1 or ITU-T Recommendation T.81>.
              Also: B<"The JPEG still picture compression standard", G.K.Wallace,
                    1991, IEEE Transactions on Consumer Electronics>.

   A SHORT HISTORY OF JPEG FILE FORMATS
       TIFF [not supported]

       The TIFF (Tagged Image File Format) is one of the most popular and flexible raster file formats for
       digital images, and is the de-facto standard graphics format for high colour depths. It was initially
       developed by Aldus (in 1986) and is now maintained (or neglected) by Adobe. Multiple images (e.g., fax
       pages) can be stored in a single TIFF file, each page in a separate IFD (Image File Directory), all IFD's
       being linked in a linear structure. The designers of TIFF wanted to be able to include all sorts of image
       information in the file (tags), like geometry, size, data arrangement and various compression options.
       However, lack of full TIFF support by the first publicly available readers resulted in a "reduced
       standard", so that even today most TIFF files contain only 32-bit uncompressed images. Support for JPEG
       streams was included in version 6.0 (in 1992), but it was badly designed, so that JPEG/TIFF files are
       nowadays very uncommon (some changes were proposed in various technical notes in 1995 and 2002, but they
       never made it to version 7.0).

         Reference: B<"TIFF (TM)", revision 6.0, June 1992, Adobe Developers
                    Association, Adobe Systems Incorporated, Mountain View, CA, USA>.
              Also: B<"Adobe PageMaker 6.0: TIFF Technical Notes", September 1995>,
                    B<"Adobe Photoshop: TIFF Technical Notes", March 2002>.

       JFIF

       The JFIF (JPEG File Interchange Format) was created by Eric Hamilton at C-Cube Microsystems in 1991 for
       storing JPEG-encoded data, and put into the public domain together with example reference software. JFIF
       is designed to allow files containing JPEG-encoded data streams to be exchanged between otherwise
       incompatible systems and applications. A JFIF file basically consists of a JPEG file with an APP0 segment
       of JFIF or JFXX type at the beginning, providing information missing from the JPEG stream: version
       number, horizontal and vertical pixel density, pixel aspect ratio and an optional thumbnail (extended
       JFIF files can contain also compressed thumbnails). JFIF files conform to the general file interchange
       specifications in ISO/IEC 10918-1.

         Reference: B<"JPEG File Interchange Format", version 1.02, September 1992,
                    Eric Hamilton, C-Cube Microsystems, Milpitas, CA, USA>.

       SPIFF [not supported yet]

       The SPIFF (Still Picture Interchange File Format) is an official file format released in 1996 by the
       Joint Photographic Experts Group. It can contain image data streams encoded with a variety of compression
       mechanism, including JPEG and JBIG. When JPEG was standardised, disagreements among ISO committees
       prevented a standard JPEG file format from being created.  The de-facto format that appeared was JFIF,
       with which a JPEG/SPIFF file is backward compatible. In addition to the image data, SPIFF includes
       information necessary to render it on common output devices, within the constraints imposed by that
       device. SPIFF appears not to be widely adopted; most producers prefer simple JFIF files.

         Reference: B<ISO/IEC IS 10918-1, part 3> (extensions to the standard).

       CIFF [not supported yet]

       The CIFF (Camera Image File Format) is a JPEG-based image file format developed by some camera makers in
       1997, and constitutes an evolution of the JFIF. A CIFF file has a standard JFIF APP0 segment, immediately
       followed by another APP0 segment holding information specific to a particular digital camera in a
       particular format. CIFF was not at all successful and was rapidly replaced by DCF/Exif.

         Reference: B<"CIFF: Specification on Image Data File",
                    version 1.0 rev.4, December 1997, Canon corporation>.

       Exif and DCT

       The Exif (Exchangeable image file format), developed by the JEITA (1996 v.1.0, 1997 v.1.1, 1998 v.2.1,
       2002 v.2.2) standard was aimed at realizing a common format for the image files used with digital still
       cameras and other related equipment. Exif is paired with a set of naming conventions and directory layout
       for files in a camera memory (DCF or Design rule for Camera File system, 1999). An Exif file is a valid
       JPEG file with application marker segments (APP1 and APP2) inserted. Uncompressed files are recorded in
       TIFF-6 format. Related attribute information is stored in the tag information format defined in TIFF-6
       (i.e., in a chain of IFD's in the APP1 segment). Information specific to the camera system and not
       defined in TIFF is stored in private tags registered for Exif. The Exif image file specification also
       specifies the method for recording thumbnails. The APP2 segment is used when recording Flashpix
       extensions. Exif is the most common JPEG file format used by digital cameras nowadays.

         Reference: B<JEITA CP-3451, "Exchangeable image file format for digital
                    still cameras: Exif Version 2.2", April 2002, Japan Electronics
                    and Information Technology Industries Association>.

   Structure of a JFIF APP0 segment
       APP0 segments are used in the old JFIF standard to store information about the picture dimensions and an
       optional thumbnail. The format of a JFIF APP0 segment is as follows (note that the size of thumbnail data
       is 3n, where n = Xthumbnail * Ythumbnail, and it is present only if n is not zero; only the first 8
       records are mandatory):

           [Record name]    [size]   [description]
           ---------------------------------------
           Identifier       5 bytes  ("JFIF\000" = 0x4a46494600)
           MajorVersion     1 byte   major version (e.g. 0x01)
           MinorVersion     1 byte   minor version (e.g. 0x01 or 0x02)
           Units            1 byte   units (0: densities give aspect ratio
                                            1: density values are dots per inch
                                            2: density values are dots per cm)
           Xdensity         2 bytes  horizontal pixel density
           Ydensity         2 bytes  vertical pixel density
           Xthumbnail       1 byte   thumbnail horizontal pixel count
           Ythumbnail       1 byte   thumbnail vertical pixel count
           ThumbnailData   3n bytes  thumbnail image

       There is also an extended JFIF (only possible for JFIF versions 1.02 and above). In this case the
       identifier is not JFIF but JFXX. This extension allows for the inclusion of differently encoded
       thumbnails. The syntax in this case is modified as follows:

           [Record name]    [size]   [description]
           ---------------------------------------
           Identifier       5 bytes  ("JFXX\000" = 0x4a46585800)
           ExtensionCode    1 byte   (0x10 Thumbnail coded using JPEG
                                      0x11 Thumbnail using 1 byte/pixel
                                      0x13 Thumbnail using 3 bytes/pixel)

       Then, depending on the extension code, there are other records to define the thumbnail. If the thumbnail
       is coded using a JPEG stream, a binary JPEG stream immediately follows the extension code (the byte count
       of this file is included in the byte count of the APP0 Segment). This stream conforms to the syntax for a
       JPEG file (SOI .... SOF ... EOI); however, no 'JFIF' or 'JFXX' marker Segments should be present:

           [Record name]    [size]   [description]
           ---------------------------------------
           JPEGThumbnail  ... bytes  a variable length JPEG picture

       If the thumbnail is stored using one byte per pixel, after the extension code one should find a palette
       and an indexed RGB. The records are as follows (remember that n = Xthumbnail * Ythumbnail):

           [Record name]    [size]   [description]
           ---------------------------------------
           Xthumbnail       1 byte    thumbnail horizontal pixel count
           YThumbnail       1 byte    thumbnail vertical pixel count
           ColorPalette   768 bytes   24-bit RGB values for the colour palette
                                      (defining the colours represented by each
                                       value of an 8-bit binary encoding)
           1ByteThumbnail   n bytes   8-bit indexed values for the thumbnail

       If the thumbnail is stored using three bytes per pixel, there is no colour palette, so the previous
       fields simplify into:

           [Record name]    [size]   [description]
           ---------------------------------------
           Xthumbnail       1 byte    thumbnail horizontal pixel count
           YThumbnail       1 byte    thumbnail vertical pixel count
           3BytesThumbnail 3n bytes 24-bit RGB values for the thumbnail

   Structure of an Exif APP1 segment
       Exif (Exchangeable Image File format) JPEG files use APP1 segments in order not to conflict with JFIF
       files (which use APP0). Exif APP1 segments store a great amount of information on photographic parameters
       for digital cameras and are the preferred way to store thumbnail images nowadays. They can also host an
       additional section with GPS data. The reference document for Exif 2.2 and the Interoperability standards
       are respectively:

          B<"Exchangeable image file format for digital still cameras:
           Exif Version 2.2", JEITA CP-3451, Apr 2002
          Japan Electronic Industry Development Association (JEIDA)>

          B<"Design rule for Camera File system", (DCF), v1.0
           English Version 1999.1.7, Adopted December 1998
          Japan Electronic Industry Development Association (JEIDA)>

       The TIFF (Tagged Image File format) standard documents, as well as some updates and corrections, are also
       useful:

          B<- "TIFF(TM) Revision 6.0, Final", June 3, 1992, Adobe Devel. Association
          - ISO 12639, "Graphic technology -- Prepress digital data exchange
                -- Tag image file format for image  technology (TIFF/IT)"
          - ISO 12234-2, "Electronic still-picture imaging -- Removable memory
                -- Part 2: TIFF/EP image data format"
          - DRAFT - TIFF CLASS F, October 1, 1991
          - DRAFT - TIFF Technical Note #2, 17-Mar-95 (updates for JPEG-in-TIFF)
          - "Adobe Pagemaker 6.0 TIFF Technical Notes", (1,2,3 and OPI), 14-Sep-1995>

       Exif APP1 segments are made up by an identifier, a TIFF header and a sequence of IFDs (Image File
       Directories) and subIFDs. The high level IFDs are only two (IFD0, for photographic parameters, and IFD1
       for thumbnail parameters); they can be followed by thumbnail data. The structure is as follows:

           [Record name]    [size]   [description]
           ---------------------------------------
           Identifier       6 bytes   ("Exif\000\000" = 0x457869660000), not stored
           Endianness       2 bytes   'II' (little-endian) or 'MM' (big-endian)
           Signature        2 bytes   a fixed value = 42
           IFD0_Pointer     4 bytes   offset of 0th IFD (usually 8), not stored
           IFD0                ...    main image IFD
           IFD0@SubIFD         ...    Exif private tags (optional, linked by IFD0)
           IFD0@SubIFD@Interop ...    Interoperability IFD (optional,linked by SubIFD)
           IFD0@GPS            ...    GPS IFD (optional, linked by IFD0)
           APP1@IFD1           ...    thumbnail IFD (optional, pointed to by IFD0)
           ThumbnailData       ...    Thumbnail image (optional, 0xffd8.....ffd9)

       So, each Exif APP1 segment starts with the identifier string "Exif\000\000"; this avoids a conflict with
       other applications using APP1, for instance XMP data. The three following fields (Endianness, Signature
       and IFD0_Pointer) constitute the so called TIFF header. The offset of the 0th IFD in the TIFF header, as
       well as IFD links in the following IFDs, is given with respect to the beginning of the TIFF header (i.e.
       the address of the 'MM' or 'II' pair). This means that if the 0th IFD begins (as usual) immediately after
       the end of the TIFF header, the offset value is 8. An Exif segment is the only part of a JPEG file whose
       endianness is not fixed to big-endian.

       If the thumbnail is present it is located after the 1st IFD. There are 3 possible formats: JPEG (only
       this is compressed), RGB TIFF, and YCbCr TIFF.  It seems that JPEG and 160x120 pixels are recommended for
       Exif ver. 2.1 or higher (mandatory for DCF files). Since the segment size for a segment is recorded in 2
       bytes, thumbnails are limited to slightly less than 64KB.

       Each IFD block is a structured sequence of records, called, in the Exif jargon, Interoperability arrays.
       The beginning of the 0th IFD is given by the 'IFD0_Pointer' value. The structure of an IFD is the
       following:

           [Record name]    [size]   [description]
           ---------------------------------------
                            2 bytes  number n of Interoperability arrays
                          12n bytes  the n arrays (12 bytes each)
                            4 bytes  link to next IFD (can be zero)
                              ...    additional data area

       The next_link field of the 0th IFD, if non-null, points to the beginning of the 1st IFD. The 1st IFD as
       well as all other sub-IFDs must have next_link set to zero. The thumbnail location and size is given by
       some interoperability arrays in the 1st IFD. The structure of an Interoperability array is:

           [Record name]    [size]   [description]
           ---------------------------------------
                            2 bytes  Tag (a unique 2-byte number)
                            2 bytes  Type (one out of 12 types)
                            4 bytes  Count (the number of values)
                            4 bytes  Value Offset (value or offset)

       The possible types are the same as for the Record class, exception made for nibbles and references (see
       "Managing a JPEG Record object" in Image::MetaData::JPEG). Indeed, the Record class is modelled after
       interoperability arrays, and each interoperability array gets stored as a Record with given tag, type,
       count and values. The "value offset" field gives the offset from the TIFF header base where the value is
       recorded. It contains the actual value if it is not larger than 4 bytes (32 bits). If the value is
       shorter than 4 bytes, it is recorded in the lower end of the 4-byte area (smaller offsets). For further
       details see the section "Valid tags for Exif APP1 data" in Image::MetaData::JPEG::TagLists.

   Structure of an XMP APP1 segment
       XMP (eXtensible Metadata Platform) JPEG files use APP1 segments in order to store metadata information;
       the storage format (serialisation) is RDF (Resource Description Framework) implemented as an application
       of XML. XMP Exif APP1 segments can store all information stored in Exif and IPTC segments, as well as
       custom and future schemas. The reference document for XMP 3.2 can be requested from Adobe Systems
       Incorporated:

          B<"XMP Specification", version 3.2, June 2005, Adobe Systems Inc.,
          345 Park Avenue, San Jose, CA 95110-2704, L<http://www.adobe.com>>

          B<See also: L<http://xml.coverpages.org/xmp.html>>
          B<See also: L<http://www.w3.org/TR/REC-rdf-syntax/>>

       XMP APP1 segments are made up by an identifier and a Unicode XMP packet (the encoding is usually UTF-8,
       but it can also be UTF-16 or UTF-32, both big-endian or little-endian). The packet cannot be split in
       multiple segments, so there is a maximum size of approximately 64KB. The structure is very simple: a
       fixed XMP namespace URI (null terminated, and without quotation marks) followed by the XMP packet:

           [Record name]    [size]   [description]
           ---------------------------------------
           Identifier      29 bytes   http://ns.adobe.com/xap/1.0/\000
           <XMP packet>       ...     the actual Unicode XMP packet

       The packet content is sandwiched between a header and a trailer, and may contain padding whitespaces at
       the end. The xpacket header has two mandatory attributes, begin and id (order is important), separated by
       exactly one space (U+0020). Attribute values, here and in the following, are enclosed by single quotes
       (U+0027) or double quotes (U+0022). The value of begin must be the Unicode zero-width non-breaking space
       (U+FEFF), to be used as a byte-order marker during blind scans; an empty value is also acceptable (for
       backward compatibility), and stands for UTF-8. The value of id is fixed. Other attributes, like
       (deprecated) bytes or encoding, may be ignored. A padding of 2KB or 4KB, with a newline every 100 spaces,
       is recommended. The end attribute of the trailer may have a value of "r" (read-only) or "w" (modifiable).

           Header      <?xpacket begin="..." id="W5M0MpCehiHzreSzNTczkc9d" ...?>
           Content       ... serialised XMP data (see later) ...
           Padding       ... padding with XML whitespaces ...
           Trailer     <?xpacket end="w"?>

       The structure of the packet content is as follows. There is an optional x:xmpmeta (or x:xapmeta for older
       files) element, with a mandatory xmlns:x attribute set to "adobe:ns:meta/" and other optional attributes,
       which can be ignored (currently, Adobe's toolkit stores its version in x:xmptk). Inside it (or at top
       level, if it is absent), there is exactly one rdf:RDF element with an attribute specifying the xmlns:rdf
       namespace (other namespaces, whose role is not clear to me, can be listed here as additional attributes).
       Inside the rdf:RDF element then, all XMP properties are stored inside one or more rdf:Description
       elements:

           <x:xmpmeta xmlns:x='adobe:ns:meta/' ..opt.attributes..>
              <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
                 <rdf:Description ...> .... </rdf:Description>
                 <rdf:Description ...> .... </rdf:Description>
                 .........
                 <rdf:Description ...> .... </rdf:Description>
              </rdf:RDF>
           </x:xmpmeta>

       rdf:Description elements and schemas are usually in one-to-one correspondence, although this is just a
       convention and not a rule. Each element has two mandatory attributes, rdf:about and xmlns:NAME. The
       rdf:about attribute is usually empty (it can however contain an application specific URI), and its value
       must be shared among all rdf:Description elements. The xmlns:NAME attribute specifies the local namespace
       prefix (NAME stands for the actual prefix). Additional namespaces can be specified via xmlns attributes.

           <rdf:Description rdf:about="" xmlns:NAME="...">
              <NAME:propname1 ...> .... </NAME:propname1>
              <NAME:propname2 ...> .... </NAME:propname2>
                 .........
              <NAME:propnameN ...> .... </NAME:propnameN>
           </rdf:Description>

       Properties can come in various flavours. A simple property is just some literal value between opening and
       closing tags carrying the property name; there exists also an abbreviated form where properties are
       listed as attributes of the rdf:Description tag (in this case there is no closing rdf:Description tag,
       and the opening tags ends with the '/' character).  Simple properties can have qualifiers (attributes).
       This is an example of a description element with only simple properties:

           <rdf:Description rdf:about="" xmlns:xap="http://ns.adobe.com/xap/1.0/">
              <xap:CreatorTool>Adobe Photoshop Elements 3.0</xap:CreatorTool>
              <xap:ModifyDate>2005-05-12T44:39:11+03:00</xap:ModifyDate>
           </rdf:Description>
                                         or

           <rdf:Description rdf:about="" xmlns:xap="http://ns.adobe.com/xap/1.0/"
              xap:CreatorTool="Adobe Photoshop Elements 3.0"
              xap:ModifyDate="2005-06-13T23:38:11+02:00"/>

       A structured property is characterised by a rdf:Description block instead of a simple content. The inner
       description block, which does not allow for an rdf:about attribute, contains one or more named properties
       (which can, of course, contain further substructures). This is an example of a description element with
       one structured property:

           <rdf:Description rdf:about="" xmlns:xmpTPg="..an URI..">
              <xmpTPg:MaxPageSize>
                 <rdf:Description xmlns:stDim="..another URI.."> ---.   Content
                    <stDim:w>21</stDim:w>                           |   of  the
                    <stDim:h>27</stDim:h>                           | MaxPageSize
                    <stDim:unit>centimetres</stDim:unit>            |  structured
                 </rdf:Description>                -----------------'   property
              </xmpTPg:MaxPageSize>
           </rdf:Description>

       An array property is, on the other hand, just an array of same-type properties. It can come in three
       varieties: as a Bag (an unordered set), as a Sequence (an ordered list) and as an Alternative list (see
       later). As for structured properties, each item in the array is a property and can contain further
       substructures. The item list is delimited by an rdf:Bag, an rdf:Seq, or an rdf:Alt tag pair,
       respectively. Each item is delimited by a pair of rdf:li tags. The following example illustrates an
       unordered array property with three items:

           <rdf:Description rdf:about="" xmlns:dc="..an URI..">
              <dc:subject>
                 <rdf:Bag>                         ----------------.   Content
                    <rdf:li>metadata</rdf:li>                      |   of  the
                    <rdf:li>schema</rdf:li>                        |  unordered
                    <rdf:li>XMP</rdf:li>                           |    array
                 </rdf:Bag>                        ----------------'  dc:subject
              </dc:subject>
           </rdf:Description>

       A simple property (but, currently, not a structured or array property) can have qualifiers. For instance,
       items in an array property can be qualified; in this case, the content of rdf:li is not text, but an
       rdf:Description (without attributes) containing the actual value (as rdf:value) and the qualifiers as
       additional properties; a qualifier name belongs to a secondary namespace, specified as an attribute of
       the outer rdf:Description. The following example illustrates the use of property qualifiers:

           <rdf:Description rdf:about="" xmlns:dc="..1stURI.." xmlns:ns="..2ndURI..">
              <dc:creator>
                 <rdf:Seq>
                    <rdf:li>                       ----------------.  An element
                       <rdf:Description>                           |    of the
                          <rdf:value>William Gilbert</rdf:value>   | ordered array
                          <ns:role>lyricist</ns:role>              |  dc:creator
                       </rdf:Description>                          |    with a
                    </rdf:li>                      ----------------'   qualifier
                    <rdf:li>
                       <rdf:Description>
                          <rdf:value>Arthur Sullivan</rdf:value>
                          <ns:role>composer</ns:role>
                       </rdf:Description>
                    </rdf:li>
                 </rdf:Seq>
              </dc:creator>
           </rdf:Description>

       Text properties in an alternative array property may have a special qualifier (the xml:lang property)
       specifying the language of the text. In this case, the serialisation is different: xml:lang for each item
       of the array becomes an attribute of the rdf:li tag. The default value is indicated by a special value of
       the qualifier, or is taken from the first item in the list. The following example illustrates an array of
       language alternatives:

           <xmp:Title>
              <rdf:Alt>                            ----------------.     List of
                 <rdf:li xml:lang="x-default">Title</rdf:li>       |  alternatives
                 <rdf:li xml:lang="en-us">Title</rdf:li>           |  corresponding
                 <rdf:li xml:lang="fr-fr">Titre</rdf:li>           |  to xmp:Title;
                 <rdf:li xml:lang="it-it">Titolo</rdf:li>          |  the  language
              </rdf:Alt>                           ----------------'   is xml:lang
           </xmp:Title>

       The interpretation of the values of properties conforming to the previously described syntax is done with
       the aid of the schema corresponding to the property namespace. The most common schemas in JPEG files are
       the following ones (note, however, that XMP is intrinsically extensible, so this list will never be
       exhaustive):

       Dublin Core schema =item XMP basic schema =item XMP rights management schema =item XMP media management
       schema =item XMP basic job ticket schema =item XMP paged-text schema =item XMP dynamic media schema =item
       Adobe Photoshop schema =item Camera row schema =item Exif schemas =item IPTC schema

   Structure of a Photoshop-style APP13 segment
       The  Adobe's  Photoshop  program,  a de-facto standard for image manipulation, uses the APP13 segment for
       storing non-graphic information, such as layers, paths, IPTC data and more. The unit  for  this  kind  of
       information  is called a "resource data block" (because they hold data that was stored in the Macintosh's
       resource fork in early versions of Photoshop). The content of an APP13 segment is formed by an identifier
       string (usually "Photoshop 3.0\000",  but  also  'Adobe_Photoshop2.5:',  used  by  earlier  versions,  is
       accepted; in this case some additional undocumented bytes are read (resolution info?) and saved in a root
       'Resolution'  record)  followed by a sequence of resource data blocks; a resource block has the following
       structure:

           [Record name]    [size]   [description]
           ---------------------------------------
           (Type)           4 bytes  Photoshop uses '8BIM' from ver 4.0 on
           (ID)             2 bytes  a unique identifier, e.g., "\004\004" for IPTC
           (Name)             ...    a Pascal string (padded to make size even)
           (Size)           4 bytes  actual size of resource data
           (Data)             ...    resource data, padded to make size even

       (a Pascal string is made up of a single byte, giving the string length, followed by  the  string  itself,
       padded to make size even including the length byte; since the string length is explicit, there is no need
       of a terminating null character). The signature (type) is usually '8BIM', but Photoshop used '8BPS' up to
       version  3.0,  and  some rogue program (Adobe PhotoDeluxe?) is using 'PHUT' ("PHotoshop User Tags" ?) for
       path information (ID=7d0-bb7). Valid Image Resource IDs are listed  in  the  Photoshop-style  tags'  list
       section. In general a resource block contains only a few bytes, but there is an important block, the IPTC
       block,  which can be quite large; the structure of this block is analysed in more detail in the IPTC data
       block section.

       The reference document for the Photoshop file format is:

          B<"Adobe Photoshop 6.0: File Formats Specifications",
          Adobe System Inc., ver.6.0, rel.2, November 2000>.

       Another interesting source of information is:

          B<"\"Solo\" Image File Format. RichTIFF and its
           replacement by \"Solo\" JFIF", version 2.0a,
          Coatsworth Comm. Inc., Brampton, Ontario, Canada>

       Structure of an IPTC data block

       An IPTC/NAA resource data block of a Photoshop-style APP13 segment embeds an IPTC  stream  conforming  to
       the  standard  defined by the International Press and Telecommunications Council (IPTC) and the Newspaper
       Association of America (NAA) for exchanging interoperability information related to various news objects.
       The data part of a resource block, an IPTC stream, is simply a sequence  of  units  called  datasets;  no
       preamble nor count is present. Each dataset consists of a unique tag header and a data field (the list of
       valid tags [dataset numbers] can be found in section about IPTC data). A standard tag header is used when
       the  data field size is less than 32768 bytes; otherwise, an extended tag header is used. The datasets do
       not need to show up in numerical order according to their tag. The structure of a dataset is:

           [Record name]    [size]   [description]
           ---------------------------------------
           (Tag marker)     1 byte   this must be 0x1c
           (Record number)  1 byte   always 2 for 2:xx datasets
           (Dataset number) 1 byte   this is what we call a "tag"
           (Size specifier) 2 bytes  data length (< 32768 bytes) or length of ...
           (Size specifier)  ...     data length (> 32767 bytes only)
           (Data)            ...     (its length is specified before)

       So, standard datasets have a 5 bytes tag header; the last two bytes in the header contain the data  field
       length,  the  most significant bit being always 0. For extended datasets instead, these two bytes contain
       the length of the (following) data field length, the most significant bit being always 1.  The  value  of
       the  most  significant  bit  thus  distinguishes  "standard" from "extended"; in digital photographies, I
       assume that the datasets which are actually  used  (a  subset  of  the  standard)  are  always  standard;
       therefore,  we  likely  do not have the IPTC block spanning more than one APP13 segment. The record types
       defined by the IPTC-NAA standard are the following (but the "pseudo"-standard by  Adobe  for  APP13  IPTC
       data  is  restricted to the first application record, 2:xx, and sometimes to the envelope record, 1:xx, I
       believe, because everything else can be accomodated more simply by other JPEG Segments):

           [Record name]                [dataset record number]
           ----------------------------------------------------
           Object Envelop Record                 1:xx
           Application Records:             2:xx through 6:xx
           Pre-ObjectData Descriptor Record:     7:xx
           ObjectData Record:                    8:xx
           Post-ObjectData Descriptor Record:    9:xx

       The reference document for the IPTC standard is:

          B<"IPTC-NAA: Information Interchange Model", version 4, 1-Jul-1999,
          ComitE<eacute> International des TE<eacute>lE<eacute>communications de Presse>

AUTHOR

       Stefano Bettelli, bettelli@cpan.org

COPYRIGHT AND LICENSE

       Copyright (C) 2004,2005,2006 by Stefano Bettelli

       This library is free software; you can redistribute it and/or modify  it  under  the  terms  of  the  GNU
       General Public License. See the COPYING and LICENSE file for the license terms.

NAME

THE STRUCTURE OF JPEG PICTURES

AUTHOR

COPYRIGHT AND LICENSE

SEE ALSO