find_gaps

Biopiece: find_gaps

Description

find_gaps locates stretches of N's (case insensitive) in sequences in the stream. For each stretch of N's a record is output like this:

S_ID: chr1
S_BEG: 123
S_END: 12345
S_LEN: 12222
---

Usage

... | find_gaps [options]

Options

[-?          | --help]               #  Print full usage description.
[-m <uint>   | --min_len=<uint>]     #  Minimum gap length             -  Default=5
[-I <file!>  | --stream_in=<file!>]  #  Read input from stream file    -  Default=STDIN
[-O <file>   | --stream_out=<file>]  #  Write output to stream file    -  Default=STDOUT
[-v          | --verbose]            #  Verbose output.

Examples

Here is how you find gaps in a genome read in with read_fasta and the result is written to KISS format with write_kiss:

read_fasta -i genome.fna | find_gaps | write_kiss -x

Author

[email protected]

March 2010

License

GNU General Public License version 2

http://www.gnu.org/copyleft/gpl.html

Help

find_gaps is part of the Biopieces framework.

http://www.biopieces.org

Provide feedback

Saved searches

Use saved searches to filter your results more quickly