-
Notifications
You must be signed in to change notification settings - Fork 23
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge branch 'master' of github.com:maasha/biopieces
- Loading branch information
Showing
2 changed files
with
8 additions
and
6 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1 +1 @@ | ||
{"name":"www.biopieces.org","tagline":"Biopieces is a bioinformatic framework of tools easily used and easily created.","body":"![Logo](https://raw.githubusercontent.com/wiki/maasha/biopieces/bp_logo_50.png)\r\n\r\nThe Biopieces are a collection of bioinformatics tools that can be pieced together in a very easy and flexible manner to perform both simple and complex tasks. The Biopieces work on a data stream in such a way that the data stream can be passed through several different Biopieces, each performing one specific task: modifying or adding records to the data stream, creating plots, or uploading data to databases and web services. The Biopieces are executed in a command line environment where the data stream is initialized by specific Biopieces which read data from files, databases, or web services, and output records to the data stream that is passed to downstream Biopieces until the data stream is terminated at the end of the analysis as outlined below:\r\n\r\n```\r\nread_data | calculate_something | write_results\r\n```\r\n\r\nThe following example demonstrates how a next generation sequencing experiment can be cleaned and analyzed – including plotting of scores and length distribution, removal of adaptor sequence, trimming and filtering using quality scores, mapping to a specified genome, and uploading the data to the UCSC genome browser for further analysis:\r\n\r\n```\r\nread_fastq -i data.fq | # Initialize data stream from a FASTQ file.\r\nplot_scores -t png -o scores_unclean.png | # Plot scores before cleaning.\r\nfind_adaptor -c 24 -a TCGTATGCCGTCTTC -p | # Locate adaptor - including partial adaptor.\r\nclip_adaptor | # Clip any located adaptor.\r\ntrim_seq | # End trim sequences according to quality scores.\r\ngrab -e 'SEQ_LEN > 18' # Filter short sequences.\r\nmean_scores -l | # Locate local quality score minima.\r\ngrab -e 'SCORES_MEAN_LOCAL >= 15' | # Filter low local quality score minima.\r\nwrite_fastq -o data_clean.fq | # Write the cleaned data to a FASTQ file.\r\nplot_scores -t png -o scores_clean.png | # Plot scores after cleaning.\r\nplot_distribution -k SEQ_LEN -t png -o lengths.png | # Plot sequence length distribution.\r\nbowtie_seq -c 24 -g hg19 -m 2 | # Map sequences to the human genome with Bowtie.\r\nupload_to_ucsc –d hg19 –t my_data –x # Upload the results to the UCSC Genome Browser.\r\n```\r\n\r\nThe advantage of the Biopieces is that a user can easily solve simple and complex tasks without having any programming experience. Moreover, since the data format used to pass data between Biopieces is text based, different developers can quickly create new Biopieces in their favorite programming language - and all the Biopieces will maintain compatibility. Finally, templates exist for creating new Biopieces in Perl and Ruby.\r\n\r\nThere are currently ~190 Biopieces.\r\n\r\n* To learn more about Biopieces have a look at the [Biopieces Introduction](Introduction).\r\n* To browse the available Biopieces see the top right side bar.\r\n* If you want to install the Biopieces go to the Biopieces [Installation Instructions](Installation).\r\n* If you want to contribute Biopieces go to the [Biopieces HowTo](HowTo).\r\n* Browse publications using Biopieces [here](http://scholar.google.dk/scholar?hl=en&q=biopieces).\r\n\r\nFor important messages, questions, discussion, and suggestions join the [Biopieecs Google Group](http://groups.google.com/group/biopieces).\r\n\r\nBiopieces was developed with support from the Danish Agency for Science, Technology and Innovation (grant no 272-06-0325).\r\n","google":"UA-390268-5","note":"Don't delete this file! It's used internally to help with page regeneration."} | ||
{"name":"www.biopieces.org","tagline":"Biopieces is a bioinformatic framework of tools easily used and easily created.","body":"![Logo](https://raw.githubusercontent.com/wiki/maasha/biopieces/bp_logo_50.png)\r\n\r\nThe Biopieces are a collection of bioinformatics tools that can be pieced together in a very easy and flexible manner to perform both simple and complex tasks. The Biopieces work on a data stream in such a way that the data stream can be passed through several different Biopieces, each performing one specific task: modifying or adding records to the data stream, creating plots, or uploading data to databases and web services. The Biopieces are executed in a command line environment where the data stream is initialized by specific Biopieces which read data from files, databases, or web services, and output records to the data stream that is passed to downstream Biopieces until the data stream is terminated at the end of the analysis as outlined below:\r\n\r\n```\r\nread_data | calculate_something | write_results\r\n```\r\n\r\nThe following example demonstrates how a next generation sequencing experiment can be cleaned and analyzed – including plotting of scores and length distribution, removal of adaptor sequence, trimming and filtering using quality scores, mapping to a specified genome, and uploading the data to the UCSC genome browser for further analysis:\r\n\r\n```\r\nread_fastq -i data.fq | # Initialize data stream from a FASTQ file.\r\nplot_scores -t png -o scores_unclean.png | # Plot scores before cleaning.\r\nfind_adaptor -c 24 -a TCGTATGCCGTCTTC -p | # Locate adaptor - including partial adaptor.\r\nclip_adaptor | # Clip any located adaptor.\r\ntrim_seq | # End trim sequences according to quality scores.\r\ngrab -e 'SEQ_LEN > 18' # Filter short sequences.\r\nmean_scores -l | # Locate local quality score minima.\r\ngrab -e 'SCORES_MEAN_LOCAL >= 15' | # Filter low local quality score minima.\r\nwrite_fastq -o data_clean.fq | # Write the cleaned data to a FASTQ file.\r\nplot_scores -t png -o scores_clean.png | # Plot scores after cleaning.\r\nplot_distribution -k SEQ_LEN -t png -o lengths.png | # Plot sequence length distribution.\r\nbowtie_seq -c 24 -g hg19 -m 2 | # Map sequences to the human genome with Bowtie.\r\nupload_to_ucsc –d hg19 –t my_data –x # Upload the results to the UCSC Genome Browser.\r\n```\r\n\r\nThe advantage of the Biopieces is that a user can easily solve simple and complex tasks without having any programming experience. Moreover, since the data format used to pass data between Biopieces is text based, different developers can quickly create new Biopieces in their favorite programming language - and all the Biopieces will maintain compatibility. Finally, templates exist for creating new Biopieces in Perl and Ruby.\r\n\r\nThere are currently ~190 Biopieces.\r\n\r\n* To learn more about Biopieces have a look at the [Biopieces Introduction](http://github.com/maasha/biopieces/wiki/Introduction).\r\n* To browse the available Biopieces see the [Biopieces Wiki](http://github.com/maasha/biopieces/wiki).\r\n* If you want to install the Biopieces go to the [Biopieces Installation Instructions](http://github.com/maasha/biopieces/wiki/Installation).\r\n* If you want to contribute Biopieces go to the [Biopieces HowTo](http://github.com/maasha/biopieces/wiki/HowTo).\r\n* Browse publications using Biopieces [here](http://scholar.google.dk/scholar?hl=en&q=biopieces).\r\n\r\nFor important messages, questions, discussion, and suggestions join the [Biopieces Google Group](http://groups.google.com/group/biopieces).\r\n\r\nAlso checkout [BioDSL](http://maasha.github.io/BioDSL/) - a powerful way to create Biopieces like workflows.\r\n\r\nBiopieces was developed with support from the Danish Agency for Science, Technology and Innovation (grant no 272-06-0325).\r\n","google":"UA-390268-5","note":"Don't delete this file! It's used internally to help with page regeneration."} |