SbtPlayData

Very small examples for testing Sequence Bloom Trees and similar data structures

The example1 directory contains five fastq files representing sequencing experiments. These non-paired reads were simulated by sampling at 7X coverage from five fake genomes. Simulated sequencing error was about 1% mismatches with no indels.

The five genomes have lengths ranging from about 11K bp to 21K bp. These contain a mix of random sequence data and some transcript-like sequences. Some "transcripts" are present in more than one genome, albeit with a 2% mutation rate.

The example2 directory is similar to example1 but was generated with different random seeding from fewer fake transcripts. The transcripts file is also included.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

SbtPlayData

Files

README.md

Latest commit

History

README.md

File metadata and controls

SbtPlayData