Provided by: genometools_1.5.1-2ubuntu1_amd64
NAME
gt-csa - Transform spliced alignments from GFF3 file into consensus spliced alignments.
SYNOPSIS
gt csa [option ...] [GFF3_file]
OPTIONS
-join-length set join length for the spliced alignment clustering (default: 300) -v be verbose (default: no) -o redirect output to specified file (default: undefined) -gzip write gzip compressed output file (default: no) -bzip2 write bzip2 compressed output file (default: no) -force force writing to output file (default: no) -help display help and exit -version display version information and exit
DESCRIPTION
Let’s assume we have a GFF3 file csa_example_spliced_alignments.gff3 containing the following four overlapping spliced alignments (represented as genes with exons as children): ##gff-version 3 ##sequence-region seq 1 290 seq . gene 1 209 . + . ID=gene1 seq . exon 1 90 . + . Parent=gene1 seq . exon 110 190 . + . Parent=gene1 seq . exon 201 209 . + . Parent=gene1 ### seq . gene 1 290 . + . ID=gene2 seq . exon 1 90 . + . Parent=gene2 seq . exon 101 190 . + . Parent=gene2 seq . exon 201 290 . + . Parent=gene2 ### seq . gene 10 290 . + . ID=gene3 seq . exon 10 90 . + . Parent=gene3 seq . exon 110 190 . + . Parent=gene3 seq . exon 201 290 . + . Parent=gene3 ### seq . gene 181 290 . + . ID=gene4 seq . exon 181 190 . + . Parent=gene4 seq . exon 201 290 . + . Parent=gene4 ### To compute the consensus spliced alignments we call: $ gt csa csa_example_spliced_alignments.gff3 Which returns: ##gff-version 3 ##sequence-region seq 1 290 seq gt csa gene 1 290 . + . ID=gene1 seq gt csa mRNA 1 290 . + . ID=mRNA1;Parent=gene1 seq gt csa exon 1 90 . + . Parent=mRNA1 seq gt csa exon 110 190 . + . Parent=mRNA1 seq gt csa exon 201 290 . + . Parent=mRNA1 seq gt csa mRNA 1 290 . + . ID=mRNA2;Parent=gene1 seq gt csa exon 1 90 . + . Parent=mRNA2 seq gt csa exon 101 190 . + . Parent=mRNA2 seq gt csa exon 201 290 . + . Parent=mRNA2 ### As one can see, they have been combined into a consensus spliced alignment (represented as genes with mRNAs as children which in turn have exons as children) with two alternative splice forms. The first and the third spliced alignment have been combined into the first alternative splice form (mRNA1) and the the second and the fourth spliced alignment into the second alternative splice form (mRNA2). As one can see, the second exon from the first alternative splice form ist shorter than the corresponding exon from the second alternative splice form.
AUTHOR
Report bugs to <gt-users@genometools.org>. 10/10/2012 GT-CSA(1)