Skip to content
Martin Asser Hansen edited this page Oct 2, 2015 · 9 revisions

Biopiece: write_kiss

Description

KISS .

The KISS format (Keep it Simple Stupid) is a text based data format for describing generic feature information in a simple format with one feature per line in 12 tab-separated columns:

  1. S_ID: Subject ID - e.g. chr12.
  2. S_BEG: Begin position of a feature relating to the subject sequence. 0-based.
  3. S_END: End position of a feature relating to the subject sequence.
  4. Q_ID: Query ID - e.g. a Solexa read ID e.g. a3_2VCOjxwXsN1
  5. SCORE: A float that can describe e.g. a BLAT score.
  6. STRAND: Denotes which strand a feature relates to. + or -.
  7. HITS: Number of times a feature is found in the subject sequence.
  8. ALIGN: Comma-separated list of alignment descriptors for mismatches, insertions, and deletions *).
  9. BLOCK_COUNT: Number of blocks in a feature (i.e. exons).
  10. BLOCK_BEGS: Comma-separated list of block begin positions. Offset is S_BEG.
  11. BLOCK_LENS: Comma-separated list of block lengths.
  12. BLOCK_TYPE: Comma-separated list of block types (0=Gap,1=Non-gap,2=CDS,3=5'UTR,4=3'UTR).

Values in fields 4-12 are optional and empty fields must contain a '.'.

*) Alignment descriptors:

  • mismatch: (offset:S-base>Q-base) - e.g. 0:C>T,13:G>C
  • insertion: (offset:->Q-base) - e.g. 8:->G,18:->A
  • deletions: (offset:S-base>-) - e.g. 5:A>-,16:T>-

The offset position is based on S_BEG and do not change with insertions or deletions. Alignment descriptors are based on the + strand.

Descriptors should be sorted by offset postion.

Read more about the KISS format here:

http://code.google.com/p/biopieces/wiki/KissFormat

Usage

... | write_kiss [options]

Options

[-?         | --help]               #  Print full usage description.
[-x         | --no_stream]          #  Do not emit records.
[-o <file>  | --data_out=<file>]    #  Write result to file.
[-I <file!> | --stream_in=<file!>]  #  Read input from stream file  -  Default=STDIN
[-O <file>  | --stream_out=<file>]  #  Write output to stream file  -  Default=STDOUT
[-Z         | --compress]           #  Compress output using gzip.
[-v         | --verbose]            #  Verbose output.

Examples

... | write_kiss -xo test.kiss

See also

read_kiss

Author

Martin Asser Hansen - Copyright (C) - All rights reserved.

[email protected]

October 2009

License

GNU General Public License version 2

http://www.gnu.org/copyleft/gpl.html

Help

write_kiss is part of the Biopieces framework.

http://www.biopieces.org

Clone this wiki locally