Sparksoniq 0.9.7 Mahogany
Pre-release
Pre-release
New alpha release for Sparksoniq, a JSONiq engine to query large-scale JSON datasets stored on HDFS. Spark under the hood.
New:
- Bugfixes.
- It is now possible to read a query locally (--query-path), and output the results on stdin rather than to the local filesystem.
- Fix error on non-existing JSONObject keySet() method due to a backward incompatibility of org.json in some environments.
The jar file was built with ANTLR 4.7 and is compatible with all tested distributions of Spark 2.3+. It is meant to be used with the spark-submit script either as an interactive shell, or to execute a single query from a JSONiq file (local or HDFS) and output the result either on stdin or back to the disk (local or HDFS). This works both locally and with a deployed cluster.
The jar file for older versions of Spark (2.0+) with ANTLR 4.5.3 is available on request (if you receive a warning on the command line).
Documentation: http://sparksoniq.readthedocs.io/en/latest/