Skip to content
This repository has been archived by the owner on Nov 14, 2023. It is now read-only.
Syoncloud edited this page Aug 8, 2014 · 12 revisions

How to run SeqPig on HortonWorks 2.1 .

Since Pig 0.12 includes performance optimizations I have made up and running SeqPig on the platform.

1. SW versions

SeqPig code as of August 7, 2014, Java JDK 1.7.0_45, HortonWorks 2.1 ( Pig 0.12.1, Hadoop 2.4.0 ), Hadoop BAM 6.2, Seal 0.4.0_rc2, picard-1.107.jar, samtools-1.107.jar, tribble-1.107.jar, variant-1.107.jar

2. How To

  1. Get code git clone https://github.com/HadoopGenomics/SeqPig
  2. Copy libraries to /hadoop/seqpig/lib/

3. Test

REGISTER /hadoop/seqpig/build/jar/SeqPig.jar; REGISTER /hadoop/seqpig/lib/hadoop-bam-6.3-SNAPSHOT.jar; REGISTER /hadoop/seqpig/lib/picard-1.107.jar; REGISTER /hadoop/seqpig/lib/seal.jar; A = LOAD '/user/guest/input.bam' USING fi.aalto.seqpig.io.BamLoader('yes'); DUMP A;

Clone this wiki locally