Skip to content

Rumble 1.1 Arbutus Oak

Pre-release
Pre-release
Compare
Choose a tag to compare
@ghislainfourny ghislainfourny released this 08 Aug 11:08
· 5752 commits to master since this release
30e9103

This is the second beta release of Rumble, a JSONiq engine to query large-scale JSON datasets stored on HDFS. Spark under the hood.

New:

  • bugfixes.
  • more functions
  • FLWOR expressions are now internally mapped to DataFrames and Spark SQL, which brings a 2x performance improvement for grouping and sorting queries.

The jar file was built with ANTLR 4.7 and is compatible with all tested distributions of Spark 2.3+. It is meant to be used with the spark-submit script either as an interactive shell, or to execute a single query from a JSONiq file (local or HDFS) and output the result either on stdin or back to the disk (local or HDFS). This works both locally and with a deployed cluster.

The jar file was compiled with Java 8 and is forward compatible with later Java versions (e.g., Java 11).

The jar file for older versions of Spark (2.0+) with ANTLR 4.5.3 is available on request (if you receive a warning on the command line).

Documentation: http://rumble.readthedocs.io/en/latest/