Provided by: libopenslide-dev_3.4.0-1build1_amd64
NAME
openslide-formats - Reference supported formats
DESCRIPTION
LIST OF KNOWN PROPERTIES
Properties generated by OpenSlide openslide.background-color The background color of the slide, given as an RGB hex triplet. This property is not always present. openslide.bounds-height The height of the rectangle bounding the non-empty region of the slide. This property is not always present. openslide.bounds-width The width of the rectangle bounding the non-empty region of the slide. This property is not always present. openslide.bounds-x The X coordinate of the rectangle bounding the non-empty region of the slide. This property is not always present. openslide.bounds-y The Y coordinate of the rectangle bounding the non-empty region of the slide. This property is not always present. openslide.comment A free-form text comment. openslide.mpp-x Microns per pixel in the X dimension of level 0. May not be present or accurate. openslide.mpp-y Microns per pixel in the Y dimension of level 0. May not be present or accurate. openslide.objective-power Magnification power of the objective. Often inaccurate; sometimes missing. openslide.quickhash-1 A non-cryptographic hash of a subset of the slide data. It can be used to uniquely identify a particular virtual slide, but cannot be used to detect file corruption or modification. openslide.vendor The name of the vendor backend. Properties for TIFF-based formats tiff.Artist The contents of the TIFF Artist tag. tiff.Copyright The contents of the TIFF Copyright tag. tiff.DateTime The contents of the TIFF DateTime tag. tiff.DocumentName The contents of the TIFF DocumentName tag. tiff.HostComputer The contents of the TIFF HostComputer tag. tiff.ImageDescription The contents of the TIFF ImageDescripton tag. tiff.Make The contents of the TIFF Make tag. tiff.Model The contents of the TIFF Model tag. tiff.ResolutionUnit The contents of the TIFF ResolutionUnit tag. tiff.Software The contents of the TIFF Software tag. tiff.XPosition The contents of the TIFF XPosition tag. tiff.XResolution The contents of the TIFF XResolution tag. tiff.YPosition The contents of the TIFF YPosition tag. tiff.YResolution The contents of the TIFF YResolution tag. Vendor-specific properties A list of vendor-specific properties can be found on the pages for each vendor format, linked from Supported Virtual Slide Formats[1].
TRESTLE FORMAT
Format single-file pyramidal tiled TIFF, with non-standard metadata and overlaps; additional files contain more metadata and detailed overlap info File extensions .tif OpenSlide vendor backend trestle Detection Trestle slides are stored in single-file TIFF format. OpenSlide will detect a file as Trestle if: 1. The file is TIFF. 2. The TIFF Software tag starts with MedScan. 3. The ImageDescription tag is present. 4. All images are tiled. Relevant TIFF tags ┌─────────────────────────┬──────────────────────────────────┐ │Tag │ Description │ ├─────────────────────────┼──────────────────────────────────┤ │ImageDescription │ Stores some important key-value │ │ │ pairs, see below │ ├─────────────────────────┼──────────────────────────────────┤ │Software │ Starts with ”MedScan” │ ├─────────────────────────┼──────────────────────────────────┤ │XResolution, YResolution │ Seems to store microns-per-pixel │ │ │ (MPP), which may or may not take │ │ │ into account the correct │ │ │ objective power. Note that this │ │ │ is inverted from standard TIFF, │ │ │ which stores pixels-per-unit, │ │ │ not units-per-pixel. │ └─────────────────────────┴──────────────────────────────────┘ Extra data stored in ImageDescription The ImageDescription tag contains semicolon-delimited key-value pairs. A key-value pair is equals-delimited. We use the OverlapsXY and Background Color keys from the ImageDescription, and ignore the rest. All of these values are stored as properties starting with ”trestle.”. ┌─────────────────┬─────────────────────────────────┐ │Key │ Description │ ├─────────────────┼─────────────────────────────────┤ │Background Color │ Hex-encoded background color │ │ │ info, assumed to be in the │ │ │ format RRGGBB. │ ├─────────────────┼─────────────────────────────────┤ │White Balance │ Hex-encoded white balance │ ├─────────────────┼─────────────────────────────────┤ │Objective Power │ Reported objective power, often │ │ │ incorrect. │ ├─────────────────┼─────────────────────────────────┤ │JPEG Quality │ The JPEG quality value. │ ├─────────────────┼─────────────────────────────────┤ │OverlapsXY │ Overlaps, see below. │ └─────────────────┴─────────────────────────────────┘ TIFF Image Directory Organization The first image in the TIFF file is the full-resolution image. The subsequent images are assumed to be decreasingly sized reduced-resolution images. Overlaps The OverlapsXY pseudo-field encodes a list of tile overlap values as ASCII. Example: ”64 64 32 32 16 16” (note the initial space). These values represent the standard overlaps between adjacent tiles in X and Y, in pixels. This example encodes 3 levels worth of overlaps. Further overlaps are assumed to have the value 0. Individual tile overlaps may differ from the standard overlaps. These individual overlaps are recorded in .tif-Nb files adjacent to the .tif file, where N is the level number. OpenSlide does not read these files, though they have been partially decoded; see issue 21[2] for details. Associated Images macro the image with a filename extension of ”.Full” (optional) Known Properties All data encoded in the ImageDescription TIFF field is represented as properties prefixed with ”trestle.”. openslide.mpp-x copy of tiff.XResolution (note that this is a totally non-standard use of this TIFF tag) openslide.mpp-y copy of tiff.YResolution (note that this is a totally non-standard use of this TIFF tag) openslide.objective-power normalized trestle.Objective Power Test Data http://openslide.cs.cmu.edu/download/openslide-testdata/Trestle/
HAMAMATSU FORMAT
Format multi-file JPEG/NGR with proprietary metadata and index file formats, and single-file TIFF-like format with proprietary metadata File extensions .vms, .vmu, .ndpi OpenSlide vendor backend hamamatsu Detection OpenSlide will detect a file as Hamamatsu if: 1. The file given is a INI-style text file. 2. It has a [Virtual Microscope Specimen] (VMS) or [Uncompressed Virtual Microscope Specimen] (VMU) group. 3. If VMS, there are at least 1 row and 1 column of JPEG images (NoJpegColumns and NoJpegRows). or if: 1. The file has a TIFF directory structure. 2. The Software tag starts with NDP.scan. Overview The Hamamatsu format has three variants. VMS and VMU consist of an index file, 2 or more image files, and (in the case of VMS) an “optimisation” file. NDPI consists of a single TIFF-like file with some custom TIFF tags. VMS and NDPI contain JPEG images; VMU contains NGR images (a custom uncompressed format). Multiple focal planes are ignored, only focal plane 0 is read. JPEG does not allow for files larger than 65535 pixels on a side. In VMS, multiple JPEG files are used to encode large images. To avoid having many files, VMS uses close to maximum size (65K by 65K) JPEG files. NDPI, instead, stuffs large levels into a single JPEG and sets the overflowed width/height fields to 0. Unfortunately, JPEG provides very poor support for random-access decoding of parts of a file. To get around this, JPEG restart markers are placed at regular intervals, and these offsets are specified in the optimisation file (in VMS) or in a TIFF tag (in NDPI). With restart markers identified, OpenSlide can treat JPEG as a tiled format, where the height is the height of an MCU row, and the width is the number of MCUs per row divided by the restart marker interval times the width of an MCU. (This often leads to oddly-shaped and inefficient tiles of 4096x8, for example.) Unfortunately, the VMS optimisation file does not give the location of every restart marker, only the ones found at the beginning of an MCU row. It also seems that the file ends early, and does not give the location of the restart marker at the last MCU row of the last image file. Thus, the optimisation file can only be taken as a hint, and cannot be trusted. The entire set of JPEG files must be scanned for restart markers in order to facilitate random access. OpenSlide does this lazily as needed, and also in a background thread that runs only when OpenSlide is otherwise idle. The VMS map file is a lower-resolution version of the other images, and can be used to make a 2-level JPEG pyramid. JPEG also allows for lower-resolution decoding, so further pyramid levels are synthesized from each JPEG file. VMS File The .vms file is the main index file for the VMS format. It is a Windows INI-style key-value pair file, with sections. Only keys in the Virtual Microscope Specimen group are read by OpenSlide. Here are known keys from the file: ┌───────────────────────┬──────────────────────────────────┐ │Key │ Description │ ├───────────────────────┼──────────────────────────────────┤ │NoLayers │ Number of layers, currently must │ │ │ be 1 to be accepted │ ├───────────────────────┼──────────────────────────────────┤ │NoJpegColumns │ Number of JPEG files across, │ │ │ given in ImageFile attributes │ ├───────────────────────┼──────────────────────────────────┤ │NoJpegRows │ Number of JPEG files down, given │ │ │ in ImageFile attributes │ ├───────────────────────┼──────────────────────────────────┤ │ImageFile │ Semantically equivalent to │ │ │ ImageFile(0,0,0), though not │ │ │ specified that way. The image in │ │ │ position (0,0,0) of the set of │ │ │ images │ ├───────────────────────┼──────────────────────────────────┤ │ImageFile(x,y) │ Semantically equivalent to │ │ │ ImageFile(0,x,y), though not │ │ │ specified that way. The image in │ │ │ position (0,x,y) of the set of │ │ │ images │ ├───────────────────────┼──────────────────────────────────┤ │ImageFile(z,x,y) │ Where x and y are non-negative │ │ │ integers. Both x and y cannot be │ │ │ 0. z is a positive integer. │ │ │ These are the images that make │ │ │ up the virtual slide, as a │ │ │ concatenation of JPEG images. x │ │ │ and y specify the location of │ │ │ each JPEG, z specifies the focal │ │ │ plane │ ├───────────────────────┼──────────────────────────────────┤ │MapFile │ A lower-resolution version of │ │ │ all the ImageFiles │ ├───────────────────────┼──────────────────────────────────┤ │OptimisationFile │ File specifying some of the │ │ │ restart marker offsets in each │ │ │ ImageFile │ ├───────────────────────┼──────────────────────────────────┤ │AuthCode │ Unknown │ ├───────────────────────┼──────────────────────────────────┤ │SourceLens │ Apparently the magnification │ ├───────────────────────┼──────────────────────────────────┤ │PhysicalWidth │ Width of the slide in some unit? │ ├───────────────────────┼──────────────────────────────────┤ │PhysicalHeight │ Height of the slide in some │ │ │ unit? │ ├───────────────────────┼──────────────────────────────────┤ │LayerSpacing │ Unknown │ ├───────────────────────┼──────────────────────────────────┤ │MacroImage │ Image file for the “macro” │ │ │ associated image │ ├───────────────────────┼──────────────────────────────────┤ │PhysicalMacroWidth │ Unknown │ ├───────────────────────┼──────────────────────────────────┤ │PhysicalMacroHeight │ Unknown │ ├───────────────────────┼──────────────────────────────────┤ │XOffsetFromSlideCentre │ Unknown │ ├───────────────────────┼──────────────────────────────────┤ │YOffsetFromSlideCentre │ Unknown │ └───────────────────────┴──────────────────────────────────┘ VMU File The .vmu file is the main index file for the VMU format. Only keys in the Uncompressed Virtual Microscope Specimen group are read by OpenSlide. Here are known keys from the file: ┌───────────────────────┬──────────────────────────────────┐ │Key │ Description │ ├───────────────────────┼──────────────────────────────────┤ │NoLayers │ (see VMS above) │ ├───────────────────────┼──────────────────────────────────┤ │ImageFile │ (see VMS above) │ ├───────────────────────┼──────────────────────────────────┤ │ImageFile(x,y) │ (see VMS above) │ ├───────────────────────┼──────────────────────────────────┤ │ImageFile(z,x,y) │ (see VMS above) │ ├───────────────────────┼──────────────────────────────────┤ │MapFile │ (see VMS above) │ ├───────────────────────┼──────────────────────────────────┤ │MapScale │ Seems to be the downsample │ │ │ factor of the map │ ├───────────────────────┼──────────────────────────────────┤ │AuthCode │ (see VMS above) │ ├───────────────────────┼──────────────────────────────────┤ │SourceLens │ (see VMS above) │ ├───────────────────────┼──────────────────────────────────┤ │PixelWidth │ Width of the image in pixels │ ├───────────────────────┼──────────────────────────────────┤ │PixelHeight │ Height of the image in pixels │ ├───────────────────────┼──────────────────────────────────┤ │PhysicalWidth │ (see VMS above) │ ├───────────────────────┼──────────────────────────────────┤ │PhysicalHeight │ (see VMS above) │ ├───────────────────────┼──────────────────────────────────┤ │LayerSpacing │ (see VMS above) │ ├───────────────────────┼──────────────────────────────────┤ │LayerOffset │ Unknown │ ├───────────────────────┼──────────────────────────────────┤ │MacroImage │ (see VMS above) │ ├───────────────────────┼──────────────────────────────────┤ │PhysicalMacroWidth │ (see VMS above) │ ├───────────────────────┼──────────────────────────────────┤ │PhysicalMacroHeight │ (see VMS above) │ ├───────────────────────┼──────────────────────────────────┤ │XOffsetFromSlideCentre │ (see VMS above) │ ├───────────────────────┼──────────────────────────────────┤ │YOffsetFromSlideCentre │ (see VMS above) │ ├───────────────────────┼──────────────────────────────────┤ │Reference │ Unknown │ ├───────────────────────┼──────────────────────────────────┤ │BitsPerPixel │ Bits per pixel, currently │ │ │ expected to be 36 │ ├───────────────────────┼──────────────────────────────────┤ │PixelOrder │ Currently expected to be RGB │ ├───────────────────────┼──────────────────────────────────┤ │Creator │ String describing the software │ │ │ creating this image │ ├───────────────────────┼──────────────────────────────────┤ │IlluminationMode │ Unknown │ ├───────────────────────┼──────────────────────────────────┤ │ExposureMultiplier │ Unknown, possibly the multiplier │ │ │ used to scale to 15 bits? │ ├───────────────────────┼──────────────────────────────────┤ │GainRed │ Unknown │ ├───────────────────────┼──────────────────────────────────┤ │GainGreen │ Unknown │ ├───────────────────────┼──────────────────────────────────┤ │GainBlue │ Unknown │ ├───────────────────────┼──────────────────────────────────┤ │FocalPlaneTolerance │ Unknown │ ├───────────────────────┼──────────────────────────────────┤ │NMP │ Unknown │ ├───────────────────────┼──────────────────────────────────┤ │MacroIllumination │ Unknown │ ├───────────────────────┼──────────────────────────────────┤ │FocusOffset │ Unknown │ ├───────────────────────┼──────────────────────────────────┤ │RefocusInterval │ Unknown │ ├───────────────────────┼──────────────────────────────────┤ │CubeName │ Unknown │ ├───────────────────────┼──────────────────────────────────┤ │HardwareModel │ Name of the hardware │ ├───────────────────────┼──────────────────────────────────┤ │HardwareSerial │ Serial number of the hardware │ ├───────────────────────┼──────────────────────────────────┤ │NoFocusPoints │ Unknown │ ├───────────────────────┼──────────────────────────────────┤ │FocusPoint0X │ Unknown │ ├───────────────────────┼──────────────────────────────────┤ │FocusPoint0Y │ Unknown │ ├───────────────────────┼──────────────────────────────────┤ │FocusPoint0Z │ Unknown │ ├───────────────────────┼──────────────────────────────────┤ │FocusPoint1X │ Unknown │ ├───────────────────────┼──────────────────────────────────┤ │FocusPoint1Y │ Unknown │ ├───────────────────────┼──────────────────────────────────┤ │FocusPoint1Z │ Unknown │ ├───────────────────────┼──────────────────────────────────┤ │FocusPoint2X │ Unknown │ ├───────────────────────┼──────────────────────────────────┤ │FocusPoint2Y │ Unknown │ ├───────────────────────┼──────────────────────────────────┤ │FocusPoint2Z │ Unknown │ ├───────────────────────┼──────────────────────────────────┤ │FocusPoint3X │ Unknown │ ├───────────────────────┼──────────────────────────────────┤ │FocusPoint3Y │ Unknown │ ├───────────────────────┼──────────────────────────────────┤ │FocusPoint3Z │ Unknown │ ├───────────────────────┼──────────────────────────────────┤ │NoBlobPoints │ Unknown │ ├───────────────────────┼──────────────────────────────────┤ │BlobPoint0Blob │ Unknown │ ├───────────────────────┼──────────────────────────────────┤ │BlobPoint0FocusPoint │ Unknown │ ├───────────────────────┼──────────────────────────────────┤ │BlobPoint1Blob │ Unknown │ ├───────────────────────┼──────────────────────────────────┤ │BlobPoint1FocusPoint │ Unknown │ ├───────────────────────┼──────────────────────────────────┤ │BlobPoint2Blob │ Unknown │ ├───────────────────────┼──────────────────────────────────┤ │BlobPoint2FocusPoint │ Unknown │ ├───────────────────────┼──────────────────────────────────┤ │BlobPoint3Blob │ Unknown │ ├───────────────────────┼──────────────────────────────────┤ │BlobPoint3FocusPoint │ Unknown │ ├───────────────────────┼──────────────────────────────────┤ │BlobMapWidth │ Unknown │ ├───────────────────────┼──────────────────────────────────┤ │BlobMapHeight │ Unknown │ └───────────────────────┴──────────────────────────────────┘ NDPI File NDPI uses a TIFF-like structure, but libtiff cannot read the headers of an NDPI file. This is because NDPI specifies the RowsPerStrip as the height of the file, and after doing out the multiplication, this typically overflows libtiff and it refuses to open the file. Also, the TIFF tags are not stored in sorted order. NDPI stores an image pyramid in TIFF directory entries. In some files, the lower-resolution pyramid levels contain no restart markers. The macro image, and sometimes an active-region map, seems to come last. JPEG files in NDPI are not necessarily valid. If ImageWidth or ImageHeight exceeds the JPEG limit of 65535, then the width or height as stored in the JPEG file is 0. libjpeg will refuse to read the header of such a file, so the JPEG data stream must be altered when fed into libjpeg. Here are the observed TIFF tags: ┌────────────────┬──────────────────────────────────┐ │Tag │ Description │ ├────────────────┼──────────────────────────────────┤ │ImageWidth │ Width of the image │ ├────────────────┼──────────────────────────────────┤ │ImageHeight │ Height of the image │ ├────────────────┼──────────────────────────────────┤ │Make │ “Hamamatsu” │ ├────────────────┼──────────────────────────────────┤ │Model │ “NanoZoomer” or “C9600-12”, etc │ ├────────────────┼──────────────────────────────────┤ │XResolution │ Seemingly correct X resolution, │ │ │ when interpreted with │ │ │ ResolutionUnit │ ├────────────────┼──────────────────────────────────┤ │YResolution │ Seemingly correct Y resolution, │ │ │ when interpreted with │ │ │ ResolutionUnit │ ├────────────────┼──────────────────────────────────┤ │ResolutionUnit │ Seemingly correct resolution │ │ │ unit │ ├────────────────┼──────────────────────────────────┤ │Software │ “NDP.scan”, sometimes with a │ │ │ version number │ ├────────────────┼──────────────────────────────────┤ │StripOffsets │ The offset of the JPEG file for │ │ │ this layer │ ├────────────────┼──────────────────────────────────┤ │StripByteCounts │ The length of the JPEG file for │ │ │ this layer │ ├────────────────┼──────────────────────────────────┤ │65420 │ Unknown, always 1? │ ├────────────────┼──────────────────────────────────┤ │65421 │ SourceLens, correctly │ │ │ downsampled for each entry. -1 │ │ │ for macro image, -2 for a map of │ │ │ non-empty regions. │ ├────────────────┼──────────────────────────────────┤ │65422 │ XOffsetFromSlideCentre │ ├────────────────┼──────────────────────────────────┤ │65423 │ YOffsetFromSlideCentre │ ├────────────────┼──────────────────────────────────┤ │65424 │ Seemingly the Z offset from the │ │ │ center focal plane, in some unit │ ├────────────────┼──────────────────────────────────┤ │65425 │ Unknown, always 0? │ ├────────────────┼──────────────────────────────────┤ │65426 │ Optimisation entries, as above │ ├────────────────┼──────────────────────────────────┤ │65427 │ Reference │ ├────────────────┼──────────────────────────────────┤ │65428 │ Unknown, AuthCode? │ ├────────────────┼──────────────────────────────────┤ │65433 │ Unknown, I have seen 1500 in │ │ │ this tag │ ├────────────────┼──────────────────────────────────┤ │65439 │ Unknown, perhaps some polygon │ │ │ ROI? │ ├────────────────┼──────────────────────────────────┤ │65440 │ Unknown, I have seen this: <0 0 │ │ │ 0 1 0 2 0 3 0 4 0 5 0 6 0 7 0 8 │ │ │ 1 9 1 10 1 11 1 12 1 13 1 14 1 │ │ │ 15 1 16 1 17> │ ├────────────────┼──────────────────────────────────┤ │65441 │ Unknown, always 0? │ ├────────────────┼──────────────────────────────────┤ │65442 │ Scanner serial number │ ├────────────────┼──────────────────────────────────┤ │65443 │ Unknown, have seen 0 or 16 │ ├────────────────┼──────────────────────────────────┤ │65444 │ Unknown, always 80? │ ├────────────────┼──────────────────────────────────┤ │65445 │ Unknown, have seen 0, 2, 10 │ ├────────────────┼──────────────────────────────────┤ │65446 │ Unknown, always 0? │ ├────────────────┼──────────────────────────────────┤ │65449 │ ASCII metadata block, key=value │ │ │ pairs, not always present │ ├────────────────┼──────────────────────────────────┤ │65455 │ Unknown, have seen 13 │ ├────────────────┼──────────────────────────────────┤ │65456 │ Unknown, have seen 101 │ ├────────────────┼──────────────────────────────────┤ │65457 │ Unknown, always 0? │ ├────────────────┼──────────────────────────────────┤ │65458 │ Unknown, always 0? │ └────────────────┴──────────────────────────────────┘ Optimisation File (only for VMS) The optimisation file contains a list of 32- (or 64- or 320- ?) bit little endian values, giving the file offset into an MCU row, each offset starts at a 40-byte alignment, and the last row (of the entire file, not each image) seems to be missing. The offsets are all packed into 1 file, even with multiple images. The order of images is left-to-right, top-to-bottom. Map File (only for VMS/VMU) The VMS map file is a standard JPEG file. Its restart markers (if any) are not included in the optimisation file. The VMU map file is in NGR format. This file can be used to provide a lower-resolution view of the slide. Image Files (only for VMS/VMU) These files are given by the VMS/VMU ImageFile keys. They are assumed to have a height which is a multiple of the MCU height. They are assumed to have a width which is a multiple of MCUs per row divided by the restart interval. For VMS, these files are in JPEG, for VMU they are in NGR format. NGR Format The NGR file contains uncompressed 16-bit RGB data, with a small header. The files we have encountered start with GN, two more bytes, and then width, height, and column width in little endian 32-bit format. The column width must divide evenly into the width. Column width is important, since NGR files are generated in columns, where the first column comes first in the file, followed by subsequent files. Columns are painted left-to-right. At offset 24 is another 32-bit integer which gives the offset in the file to the start of the image data. The image data we have encountered is in 16-bit little endian format. Associated Images macro the image file given by the MacroImage value in the VMS/VMU file, or SourceLens of -1 in NDPI Known Properties All key-value data stored in the VMS/VMU file, and known tags from the NDPI file, are encoded as properties prefixed with ”hamamatsu.”. openslide.mpp-x for NDPI, calculated as 10000/tiff.XResolution, if tiff.ResolutionUnit is centimeter openslide.mpp-y for NDPI, calculated as 10000/tiff.YResolution, if tiff.ResolutionUnit is centimeter openslide.objective-power normalized hamamatsu.SourceLens Test Data NDPI format http://openslide.cs.cmu.edu/download/openslide-testdata/Hamamatsu/ VMS format http://openslide.cs.cmu.edu/download/openslide-testdata/Hamamatsu-vms/
APERIO FORMAT
Format single-file pyramidal tiled TIFF, with non-standard metadata and compression File extensions .svs, .tif OpenSlide vendor backend aperio Vendor Documentation http://www.aperio.com/documents/api/Aperio_Digital_Slides_and_Third-party_data_interchange.pdf Detection Aperio slides are stored in single-file TIFF format. OpenSlide will detect a file as Aperio if: 1. The file is TIFF. 2. The initial image is tiled. 3. The ImageDescription tag starts with Aperio. Relevant TIFF tags ┌─────────────────┬──────────────────────────────────┐ │Tag │ Description │ ├─────────────────┼──────────────────────────────────┤ │ImageDescription │ Stores some important key-value │ │ │ pairs and other information, see │ │ │ below │ ├─────────────────┼──────────────────────────────────┤ │Compression │ May be 33003 or 33005, which │ │ │ represent specific kinds of JPEG │ │ │ 2000 compression, see below │ └─────────────────┴──────────────────────────────────┘ Extra data stored in ImageDescription For tiled images, the ImageDescription tag contains some dimensional downsample information as well as what look like offsets. Additionally, vertical line-delimited key-value pairs are stored, in at least the full-resolution image. A key-value pair is equals-delimited. These key-values are stored as properties starting with ”aperio.”. Currently, OpenSlide does not use any of the information present in these key-value fields. For stripped images, the ImageDescription tag may contain a name, followed by a carriage return. This is used for naming the associated images. The second image in the file does not have a name, though it is an associated image. TIFF Image Directory Organization http://www.aperio.com/documents/api/Aperio_Digital_Slides_and_Third-party_data_interchange.pdf page 14: The first image in an SVS file is always the baseline image (full resolution). This image is always tiled, usually with a tile size of 240x240 pixels. The second image is always a thumbnail, typically with dimensions of about 1024x768 pixels. Unlike the other slide images, the thumbnail image is always stripped. Following the thumbnail there may be one or more intermediate “pyramid” images. These are always compressed with the same type of compression as the baseline image, and have a tiled organization with the same tile size. Optionally at the end of an SVS file there may be a slide label image, which is a low resolution picture taken of the slide’s label, and/or a macro camera image, which is a low resolution picture taken of the entire slide. The label and macro images are always stripped. JPEG 2000 (compression types 33003 or 33005) Some Aperio files use compression type 33003 or 33005. Images using this compression need to be decoded as a JPEG 2000 codestream. For 33003: YCbCr format, possibly with a chroma subsampling of 4:2:2. For 33005: RGB format. Note that the TIFF file may not encode the colorspace or subsampling parameters in the PhotometricInterpretation field, nor the YCbCrSubsampling field, even though the TIFF standard seems to require this. The correct subsampling can be found in the JPEG 2000 codestream. Associated Images thumbnail the second image in the file label optional, the name “label” is given in ImageDescription macro optional, the name “macro” is given in ImageDescription Known Properties All key-value data encoded in the ImageDescription TIFF field is represented as properties prefixed with ”aperio.”. openslide.mpp-x normalized aperio.MPP openslide.mpp-y normalized aperio.MPP openslide.objective-power normalized aperio.AppMag Test Data http://openslide.cs.cmu.edu/download/openslide-testdata/Aperio/
MIRAX FORMAT
Format multi-file with very complicated proprietary metadata and indexes File extensions .mrxs OpenSlide vendor backend mirax Detection OpenSlide will detect a file as MIRAX if: 1. The file is not a TIFF. 2. The filename ends with .mrxs. 3. A directory exists in the same location as the file, with the same name as the file minus the extension. 4. A file named Slidedat.ini exists in the directory. Overview MIRAX can store slides in JPEG, PNG, or BMP formats. Because JPEG does not allow for large images, and JPEG and PNG provide very poor support for random-access decoding of part of an image, multiple images are needed to encode a slide. To avoid having many individual files, MIRAX packs these images into a small number of data files. The index file provides offsets into the data files for each required piece of data. The camera on MIRAX scanners takes overlapping photos and records the position of each one. Each photo is then split into multiple images which do not overlap. Overlaps only occur between images that come from different photos. To generate level n + 1, each image from level n is downsampled by 2 and then concatenated into a new image, 4 old images per new image (2 x 2). This process is repeated for each level, irrespective of image overlaps. Therefore, at sufficiently high levels, a single image can contain one or more embedded overlaps of non-integral width. Index File The index file starts with a five-character ASCII version string, followed by the SLIDE_ID from the slidedat file. The rest of the file consists of 32-bit little-endian integers (unaligned), which can be data values or pointers to byte offsets within the index file. The first two integers point to offset tables for the hierarchical and nonhierarchical roots, respectively. These tables contain one record for each VAL in the HIERARCHICAL slidedat section. For example, the record for NONHIER_1_VAL_2 would be stored at nonhier_root + 4 * (NONHIER_0_COUNT + 2). Each record is a pointer to a linked list of data pages. The first two values in a data page are the number of data items in the page and a pointer to the next page. The first page always has 0 data items, and the last page has a 0 next pointer. There is one hierarchical record for each zoom level. The record contains data items consisting of an image index, offset and length within a file, and a file number. The file number can be converted to a data file name via the DATAFILE slidedat section. The image index is equal to image_y * GENERAL.IMAGENUMBER_X + image_x. Image coordinates which are not multiples of the zoom level’s downsample factor are omitted. Nonhierarchical records refer to associated images and additional metadata. Nonhierarchical data items consist of three zero values followed by an offset, length, and file number as in hierarchical records. Data Files A data file begins with a header containing a five-character ASCII version string, the SLIDE_ID from the slidedat file, the file number encoded into three ASCII characters, and 256 bytes of padding. The remainder of the file contains packed data referenced by the index file. Slide Position File The slide position file is referenced by the VIMSLIDE_POSITION_BUFFER.default nonhierarchical section. It contains one entry for each camera position (not each image position) in row-major order. Each entry is nine bytes: a flag byte, the X pixel coordinate of the photo (4 bytes, little-endian, may be negative), and the Y coordinate (4 bytes, little-endian, may be negative). In slides with CURRENT_SLIDE_VERSION ≥ 1.9, the flag byte is 1 if the slide file contains images for this camera position, 0 otherwise. In older slides, the flag byte is always 0. In slides with CURRENT_SLIDE_VERSION ≥ 2.2, the slide position file is compressed with DEFLATE and referenced by the StitchingIntensityLayer.StitchingIntensityLevel nonhierarchical section. Associated Images thumbnail the image named ”ScanDataLayer_SlidePreview” in Slidedat.ini (optional) label the image named ”ScanDataLayer_SlideBarcode” in Slidedat.ini (optional) macro the image named ”ScanDataLayer_SlideThumbnail” in Slidedat.ini (optional) Known Properties All key-value data stored in the Slidedat.ini file are encoded as properties prefixed with ”mirax.”. openslide.mpp-x normalized MICROMETER_PER_PIXEL_X from the Slidedat section corresponding to level 0 (typically mirax.LAYER_0_LEVEL_0_SECTION.MICROMETER_PER_PIXEL_X) openslide.mpp-y normalized MICROMETER_PER_PIXEL_Y from the Slidedat section corresponding to level 0 (typically mirax.LAYER_0_LEVEL_0_SECTION.MICROMETER_PER_PIXEL_Y) openslide.objective-power normalized mirax.GENERAL.OBJECTIVE_MAGNIFICATION See Also Introduction to MIRAX/MRXS[3]. Note that our terminology has changed since that document was written; where it says “tile”, substitute “image”, and where it says “subtile”, substitute “tile”. Test Data http://openslide.cs.cmu.edu/download/openslide-testdata/Mirax/
SAKURA FORMAT
Format SQLite database containing pyramid tiles and metadata File extensions .svslide OpenSlide vendor backend sakura Detection OpenSlide will detect a file as Sakura if: 1. The file is a SQLite database. 2. The DataManagerSQLiteConfigXPO table contains exactly one row, and its TableName field refers to a unique table. 3. The unique table contains a row with id = "++MagicBytes" and data = "SVGigaPixelImage". File Organization Sakura slides are SQLite 3 database files written by the eXpress Persistent Objects ORM. Tables contain slide metadata, associated images, and JPEG tiles. Tiles are addressed as (downsample, level-0 X coordinate, level-0 Y coordinate, color channel), with separate grayscale JPEGs for each color channel. Despite the generality of the address format, tiles appear to be organized in a regular grid, with power-of-two level downsamples and without overlapping tiles. The structure of the file allows scans to be sparse, but it is not clear if this is actually done. SQL Tables Some irrelevant tables and columns have been omitted from the summary below. DataManagerSQLiteConfigXPO.PP Useful only to get a reference to the unique table. OpenSlide requires this table to contain exactly one row. ┌──────────┬──────┬────────────────────────┐ │Column │ Type │ Description │ ├──────────┼──────┼────────────────────────┤ │TableName │ text │ Name of the unique │ │ │ │ table, described below │ └──────────┴──────┴────────────────────────┘ SVSlideDataXPO.PP High-level metadata about a slide. OpenSlide assumes this table will contain exactly one row. ┌───────────────────┬─────────┬──────────────────────────┐ │Column │ Type │ Description │ ├───────────────────┼─────────┼──────────────────────────┤ │OID │ integer │ Primary key │ ├───────────────────┼─────────┼──────────────────────────┤ │m_labelScan │ integer │ Foreign key to label │ │ │ │ associated image in │ │ │ │ SVScannedImageDataXPO │ ├───────────────────┼─────────┼──────────────────────────┤ │m_overviewScan │ integer │ Foreign key to macro │ │ │ │ associated image in │ │ │ │ SVScannedImageDataXPO │ ├───────────────────┼─────────┼──────────────────────────┤ │SlideId │ text │ UUID │ ├───────────────────┼─────────┼──────────────────────────┤ │Date │ text │ File creation date? │ ├───────────────────┼─────────┼──────────────────────────┤ │Description │ text │ Descriptive text? │ ├───────────────────┼─────────┼──────────────────────────┤ │Creator │ text │ Author? │ ├───────────────────┼─────────┼──────────────────────────┤ │DiagnosisCode │ text │ Unknown, have seen “0” │ ├───────────────────┼─────────┼──────────────────────────┤ │HRScanCount │ integer │ Presumably the number of │ │ │ │ corresponding rows in │ │ │ │ SVHRScanDataXPO │ ├───────────────────┼─────────┼──────────────────────────┤ │Keywords │ text │ Descriptive text? │ ├───────────────────┼─────────┼──────────────────────────┤ │TotalDataSizeBytes │ integer │ Presumably the sum of │ │ │ │ TotalDataSizeBytes in │ │ │ │ corresponding │ │ │ │ SVHRScanDataXPO rows │ └───────────────────┴─────────┴──────────────────────────┘ SVHRScanDataXPO.PP A single high-resolution scan of a slide from SVSlideDataXPO. OpenSlide assumes this table will contain exactly one row. ┌─────────────────────────┬─────────┬─────────────────────────┐ │Column │ Type │ Description │ ├─────────────────────────┼─────────┼─────────────────────────┤ │OID │ integer │ Primary key │ ├─────────────────────────┼─────────┼─────────────────────────┤ │ParentSlide │ integer │ Foreign key to │ │ │ │ SVSlideDataXPO │ ├─────────────────────────┼─────────┼─────────────────────────┤ │ScanId │ text │ UUID │ ├─────────────────────────┼─────────┼─────────────────────────┤ │Date │ text │ Scan date? │ ├─────────────────────────┼─────────┼─────────────────────────┤ │Description │ text │ Descriptive text? │ ├─────────────────────────┼─────────┼─────────────────────────┤ │Name │ text │ Scan name? │ ├─────────────────────────┼─────────┼─────────────────────────┤ │PosOnSlideMm │ blob │ 16 bytes of binary │ ├─────────────────────────┼─────────┼─────────────────────────┤ │ResolutionMmPerPix │ real │ Millimeters per pixel │ ├─────────────────────────┼─────────┼─────────────────────────┤ │NominalLensMagnification │ real │ Objective power │ ├─────────────────────────┼─────────┼─────────────────────────┤ │ThumbnailImage │ blob │ thumbnail associated │ │ │ │ image data │ ├─────────────────────────┼─────────┼─────────────────────────┤ │TotalDataSizeBytes │ integer │ Same as TOTAL_SIZE blob │ │ │ │ in unique table │ ├─────────────────────────┼─────────┼─────────────────────────┤ │FocussingMethod │ integer │ Unknown; have seen “1” │ ├─────────────────────────┼─────────┼─────────────────────────┤ │FocusStack │ blob │ 8 bytes; have seen all │ │ │ │ zeros │ └─────────────────────────┴─────────┴─────────────────────────┘ SVScannedImageDataXPO.PP Contains associated images other than the thumbnail. ┌───────────────────┬─────────┬───────────────────────┐ │Column │ Type │ Description │ ├───────────────────┼─────────┼───────────────────────┤ │OID │ integer │ Primary key │ ├───────────────────┼─────────┼───────────────────────┤ │Id │ text │ UUID │ ├───────────────────┼─────────┼───────────────────────┤ │PosOnSlideMm │ blob │ 16 bytes of binary │ ├───────────────────┼─────────┼───────────────────────┤ │ScanCenterPosMm │ blob │ 16 bytes of binary │ ├───────────────────┼─────────┼───────────────────────┤ │ResolutionMmPerPix │ real │ Millimeters per pixel │ ├───────────────────┼─────────┼───────────────────────┤ │Image │ blob │ JPEG image data │ ├───────────────────┼─────────┼───────────────────────┤ │ThumbnailImage │ blob │ Low-resolution JPEG │ │ │ │ thumbnail │ └───────────────────┴─────────┴───────────────────────┘ tile.PP This table is most naturally used to map tile coordinates to tile IDs, but is not suitable for individual lookups because it has no useful indexes. ┌─────────────┬─────────┬─────────────────────────┐ │Column │ Type │ Description │ ├─────────────┼─────────┼─────────────────────────┤ │TILEID │ text │ Foreign key to unique │ │ │ │ table │ ├─────────────┼─────────┼─────────────────────────┤ │PYRAMIDLEVEL │ integer │ Downsample of the │ │ │ │ pyramid level │ ├─────────────┼─────────┼─────────────────────────┤ │COLUMNINDEX │ integer │ Level-0 X coordinate of │ │ │ │ the top-left corner of │ │ │ │ the tile │ ├─────────────┼─────────┼─────────────────────────┤ │ROWINDEX │ integer │ Level-0 Y coordinate of │ │ │ │ the top-left corner of │ │ │ │ the tile │ ├─────────────┼─────────┼─────────────────────────┤ │COLORINDEX │ integer │ 0 for red, 1 for green, │ │ │ │ 2 for blue │ └─────────────┴─────────┴─────────────────────────┘ Unique table This is the table named by DataManagerSQLiteConfigXPO.TableName. It contains named blobs including the JPEG tile data. ┌───────┬─────────┬──────────────────────┐ │Column │ Type │ Description │ ├───────┼─────────┼──────────────────────┤ │id │ text │ Primary key │ ├───────┼─────────┼──────────────────────┤ │size │ integer │ Length of data field │ ├───────┼─────────┼──────────────────────┤ │data │ blob │ Data item │ └───────┴─────────┴──────────────────────┘ This table stores a variety of blob types. IDs for image tiles appear to be mechanically generated from the tile coordinates, but OpenSlide does not construct them directly. Instead, it uses the tile table to obtain IDs for each tile. ┌───────────────────┬──────────────────────────────────┐ │id │ Description │ ├───────────────────┼──────────────────────────────────┤ │++MagicBytes │ SVGigaPixelImage │ ├───────────────────┼──────────────────────────────────┤ │++VersionBytes │ Format version, e.g. 1.0.0 │ ├───────────────────┼──────────────────────────────────┤ │Header │ A small binary structure. The │ │ │ first 12 bytes are little-endian │ │ │ 32-bit integers: tile size in │ │ │ pixels, image width in pixels, │ │ │ image height in pixels. │ ├───────────────────┼──────────────────────────────────┤ │TOTAL_SIZE │ The data field is empty. The │ │ │ size field is the sum of all │ │ │ other size fields except │ │ │ ++MagicBytes and ++VersionBytes. │ ├───────────────────┼──────────────────────────────────┤ │T;2048|4096;4;2;0 │ Image tile with downsample 4, X │ │ │ coordinate 2048, Y coordinate │ │ │ 4096, channel 2 (blue) │ ├───────────────────┼──────────────────────────────────┤ │T;2048|4096;4;2;0# │ MD5 hash of the │ │ │ T;2048|4096;4;2;0 image tile │ └───────────────────┴──────────────────────────────────┘ Associated Images label SVScannedImageDataXPO.Image corresponding to SVSlideDataXPO.m_labelScan macro SVScannedImageDataXPO.Image corresponding to SVSlideDataXPO.m_overviewScan thumbnail SVHRScanDataXPO.ThumbnailImage Known Properties sakura.Creator SVSlideDataXPO.Creator sakura.Date SVSlideDataXPO.Date sakura.Description SVSlideDataXPO.Description sakura.DiagnosisCode SVSlideDataXPO.DiagnosisCode sakura.FocussingMethod SVHRScanDataXPO.FocussingMethod sakura.Keywords SVSlideDataXPO.Keywords sakura.NominalLensMagnification SVHRScanDataXPO.NominalLensMagnification sakura.ResolutionMmPerPix SVHRScanDataXPO.ResolutionMmPerPix sakura.ScanId SVHRScanDataXPO.ScanId sakura.SlideId SVSlideDataXPO.SlideId openslide.mpp-x calculated as 1000 * sakura.ResolutionMmPerPix openslide.mpp-y calculated as 1000 * sakura.ResolutionMmPerPix openslide.objective-power normalized sakura.NominalLensMagnification Test Data No public data available. Contact the mailing list[4] if you have some.
GENERIC TILED TIFF FORMAT
Format single-file pyramidal tiled TIFF File extensions .tif OpenSlide vendor backend generic-tiff Detection OpenSlide will detect a file as generic TIFF if: 1. No other detections succeed. 2. The file is TIFF. 3. The initial image is tiled. TIFF Image Directory Organization The first image in the TIFF file is the full-resolution image. Any other tiled images in the file with the “reduced resolution” bit set are assumed to be reduced-resolution versions of the original. Associated Images None. Known Properties Many TIFF tags are encoded as properties starting with ”tiff.”.
AUTHORS
The Carnegie Mellon School of Computer Science. This manual page was written by Mathieu Malaterre <malat@debian.org> for the Debian GNU/Linux system (but may be used by others).
NOTES
1. Supported Virtual Slide Formats http://openslide.org/formats/ 2. issue 21 http://openslide.orghttps://github.com/openslide/openslide/issues/21#issuecomment-23615583 3. Introduction to MIRAX/MRXS http://lists.andrew.cmu.edu/pipermail/openslide-users/2012-July/000373.html 4. mailing list http://lists.andrew.cmu.edu/mailman/listinfo/openslide-users/