Provided by: samtools_1.10-3_amd64 bug

NAME

       samtools view - views and converts SAM/BAM/CRAM files

SYNOPSIS

       view samtools view [options] in.sam|in.bam|in.cram [region...]

DESCRIPTION

       With  no  options  or  regions  specified,  prints  all  alignments in the specified input
       alignment file (in SAM, BAM, or CRAM format) to standard output in  SAM  format  (with  no
       header).

       You may specify one or more space-separated region specifications after the input filename
       to restrict output to only those alignments which overlap the specified region(s). Use  of
       region  specifications requires a coordinate-sorted and indexed input file (in BAM or CRAM
       format).

       The -b, -C, -1, -u, -h, -H, and -c options change the output format from  the  default  of
       headerless SAM, and the -o and -U options set the output file name(s).

       The  -t  and  -T  options  provide  additional reference data. One of these two options is
       required when SAM input does not contain @SQ  headers,  and  the  -T  option  is  required
       whenever writing CRAM output.

       The  -L,  -M, -r, -R, -d, -D, -s, -q, -l, -m, -f, -F, and -G options filter the alignments
       that will be included in the output to only those alignments that match certain criteria.

       The -x and -B options modify the data which is contained in each alignment.

       The -X option can be used to allow user to specify customized index  file  location(s)  if
       the  data  folder  does  not  contain  any  index file. See EXAMPLES section for sample of
       useage.

       Finally, the -@ option can  be  used  to  allocate  additional  threads  to  be  used  for
       compression, and the -?  option requests a long help message.

       REGIONS:
              Regions can be specified as: RNAME[:STARTPOS[-ENDPOS]] and all position coordinates
              are 1-based.

              Important note: when multiple regions are given,  some  alignments  may  be  output
              multiple times if they overlap more than one of the specified regions.

              Examples of region specifications:

              chr1      Output all alignments mapped to the reference sequence named `chr1' (i.e.
                        @SQ SN:chr1).

              chr2:1000000
                        The region on chr2 beginning at base position 1,000,000 and ending at the
                        end of the chromosome.

              chr3:1000-2000
                        The  1001bp region on chr3 beginning at base position 1,000 and ending at
                        base position 2,000 (including both end positions).

              '*'       Output the unmapped reads at the end of the file.  (This does not include
                        any  unmapped reads placed on a reference sequence alongside their mapped
                        mates.)

              .         Output all alignments.  (Mostly unnecessary as not specifying a region at
                        all has the same effect.)

OPTIONS

       -b        Output in the BAM format.

       -C        Output in the CRAM format (requires -T).

       -1        Enable fast BAM compression (implies -b).

       -u        Output    uncompressed    BAM.    This    option    saves    time    spent    on
                 compression/decompression and is thus preferred when  the  output  is  piped  to
                 another samtools command.

       -h        Include the header in the output.

       -H        Output the header only.

       -c        Instead  of printing the alignments, only count them and print the total number.
                 All filter options, such as -f, -F, and -q, are taken into account.

       -?        Output long help and exit immediately.

       -o FILE   Output to FILE [stdout].

       -U FILE   Write alignments that are not selected by the various filter  options  to  FILE.
                 When  this  option  is  used, all alignments (or all alignments intersecting the
                 regions specified) are written to either the output file or this file, but never
                 both.

       -t FILE   A  tab-delimited  FILE.   Each line must contain the reference name in the first
                 column and the length of the reference in the second column, with one  line  for
                 each  distinct  reference.   Any  additional fields beyond the second column are
                 ignored. This file also defines the order of the reference sequences in sorting.
                 If you run: `samtools faidx <ref.fa>', the resulting index file <ref.fa>.fai can
                 be used as this FILE.

       -T FILE   A FASTA format reference  FILE,  optionally  compressed  by  bgzip  and  ideally
                 indexed  by  samtools  faidx.  If an index is not present, one will be generated
                 for you.

       -L FILE   Only output alignments overlapping the input BED FILE [null].

       -M        Use the multi-region iterator on the union of a BED file and command-line region
                 arguments.  This avoids re-reading the same regions of files so can sometimes be
                 much faster.  Note this  also  removes  duplicate  sequences.   Without  this  a
                 sequence  that  overlaps  multiple regions specified on the command line will be
                 reported multiple times.  The usage of a BED file is optional and its  path  has
                 to be preceded by -L option.

       -r STR    Output  alignments  in  read group STR [null].  Note that records with no RG tag
                 will also be output when using this option.  This  behaviour  may  change  in  a
                 future release.

       -R FILE   Output  alignments in read groups listed in FILE [null].  Note that records with
                 no RG tag will also be output when using this option.  This behaviour may change
                 in a future release.

       -d STR:STR
                 Only output alignments with tag STR and associated value STR [null].

       -D STR:FILE
                 Only output alignments with tag STR and associated values listed in FILE [null].

       -q INT    Skip alignments with MAPQ smaller than INT [0].

       -l STR    Only output alignments in library STR [null].

       -m INT    Only output alignments with number of CIGAR bases consuming query sequence ≥ INT
                 [0]

       -f INT    Only output alignments with all bits set in INT present in the FLAG field.   INT
                 can be specified in hex by beginning with `0x' (i.e. /^0x[0-9A-F]+/) or in octal
                 by beginning with `0' (i.e. /^0[0-7]+/) [0].

       -F INT    Do not output alignments with any bits set in INT present  in  the  FLAG  field.
                 INT  can  be specified in hex by beginning with `0x' (i.e. /^0x[0-9A-F]+/) or in
                 octal by beginning with `0' (i.e. /^0[0-7]+/) [0].

       -G INT    Do not output alignments with all bits set in INT present  in  the  FLAG  field.
                 This  is  the  opposite of -f such that -f12 -G12 is the same as no filtering at
                 all.  INT can be specified in hex by beginning with `0x'  (i.e.  /^0x[0-9A-F]+/)
                 or in octal by beginning with `0' (i.e. /^0[0-7]+/) [0].

       -x STR    Read tag to exclude from output (repeatable) [null]

       -B        Collapse the backward CIGAR operation.

       -s FLOAT  Output  only a proportion of the input alignments.  This subsampling acts in the
                 same way on all of the alignment records in the same template or read  pair,  so
                 it never keeps a read but not its mate.

                 The  integer and fractional parts of the -s INT.FRAC option are used separately:
                 the part after the decimal point sets the  fraction  of  templates/pairs  to  be
                 kept,  while  the integer part is used as a seed that influences which subset of
                 reads is kept.

                 When subsampling data that has previously been subsampled,  be  sure  to  use  a
                 different  seed  value  from those used previously; otherwise more reads will be
                 retained than expected.

       -@ INT    Number of BAM compression threads to use in addition to main thread [0].

       -S        Ignored for compatibility with  previous  samtools  versions.   Previously  this
                 option  was  required  if input was in SAM format, but now the correct format is
                 automatically detected by examining the first few characters of input.

       -X        Include customized index file as a part of arugments. See EXAMPLES  section  for
                 sample of useage.

       --no-PG   Do not add a @PG line to the header of the output file.

EXAMPLES

       o Import SAM to BAM when @SQ lines are present in the header:

           samtools view -bS aln.sam > aln.bam

         If @SQ lines are absent:

           samtools faidx ref.fa
           samtools view -bt ref.fa.fai aln.sam > aln.bam

         where ref.fa.fai is generated automatically by the faidx command.

       o Convert a BAM file to a CRAM file using a local reference sequence.

           samtools view -C -T ref.fa aln.bam > aln.cram

       o Convert a BAM file to a CRAM with NM and MD tags stored verbatim rather than calculating
         on the fly during CRAM decode, so that mixed data sets with MD/NM only on some  records,
         or NM calculated using different definitions of mismatch, can be decoded without change.
         The second command demonstrates how to decode such a file.  The request to not decode MD
         here is turning off auto-generation of both MD and NM; it will still emit the MD/NM tags
         on records that had these stored verbatim.

           samtools view -C --output-fmt-option store_md=1 --output-fmt-option store_nm=1 -o aln.cram aln.bam
           samtools view --input-fmt-option decode_md=0 -o aln.new.bam aln.cram

       o An alternative way of  achieving  the  above  is  listing  multiple  options  after  the
         --output-fmt or -O option.  The commands below are equivalent to the two above.

           samtools view -O cram,store_md=1,store_nm=1 -o aln.cram aln.bam
           samtools view --input-fmt cram,decode_md=0 -o aln.new.bam aln.cram

       o Include customized index file as a part of arugments.

           samtools view [options] -X /data_folder/data.bam /index_folder/data.bai chrM:1-10

       o Output  alignments  in  read  group  grp2  (records  with  no RG tag will also be in the
         output).

           samtools view -r grp2 -o /data_folder/data.rg2.bam /data_folder/data.bam

       o Only keep reads with tag BC and were the barcode matches  the  barcodes  listed  in  the
         barcode file.

           samtools view -D BC:barcodes.txt -o /data_folder/data.barcodes.bam /data_folder/data.bam

       o Only keep reads with tag RG and read group grp2.  This does almost the same than -r grp2
         but will not keep records without the RG tag.

           samtools view -d RG:grp2 -o /data_folder/data.rg2_only.bam /data_folder/data.bam

AUTHOR

       Written by Heng Li from the Sanger Institute.

SEE ALSO

       samtools(1), samtools-tview(1), sam(5)

       Samtools website: <http://www.htslib.org/>