Skip to content

Sparksoniq 0.9.7 Mahogany

Pre-release
Pre-release
Compare
Choose a tag to compare
@ghislainfourny ghislainfourny released this 20 May 13:00
· 5971 commits to master since this release
b0a0a69

New alpha release for Sparksoniq, a JSONiq engine to query large-scale JSON datasets stored on HDFS. Spark under the hood.

New:

  • Bugfixes.
  • It is now possible to read a query locally (--query-path), and output the results on stdin rather than to the local filesystem.
  • Fix error on non-existing JSONObject keySet() method due to a backward incompatibility of org.json in some environments.

The jar file was built with ANTLR 4.7 and is compatible with all tested distributions of Spark 2.3+. It is meant to be used with the spark-submit script either as an interactive shell, or to execute a single query from a JSONiq file (local or HDFS) and output the result either on stdin or back to the disk (local or HDFS). This works both locally and with a deployed cluster.

The jar file for older versions of Spark (2.0+) with ANTLR 4.5.3 is available on request (if you receive a warning on the command line).

Documentation: http://sparksoniq.readthedocs.io/en/latest/