This repository contains few problems solved in Map-Reduce, Pig and Spark.
Dataset : Yelp Dataset https://www.yelp.com/academic_dataset
Analyzed yelp dataset to derive useful statistics about "user”, “business" and "review" entities. Dataset was stored in Hadoop HDFS. Designed Map Reduce java programs and Pig Latin code for various problems.