Sparker is a Data cleansing and Data transformation library built on top of Apache Spark. It is designed to support basic data cleansing and transformation operations especially on Big Data in near real time.
It is aimed to be used as the transformation framework for DataGraft.
It is developed using Scala 2.10 and Apache Spark 1.6.0 version.
This is still an experimental project and actively being developed add more features and standard ETL operations.