Schema-Harvester

Schema-Harvester is a tool that parses exsiting JSON documents and tries to derive a JSON schema from these documents.

It comes with different "frontends" to consume JSON documents from different sources, currently via CLI or from Kafka.

Kafka-Service usage

You need a kafka-topic where the service publishes schemas to. Schemas are published with the source-topic as key.

It makes sense to enable log-compaction (cleanup.policy=compact) for the schema-topic, but of course this is optional.

Create a config.toml (e.g. copy config.sample.toml, see `config.default.toml for all options) and start the service:

harvesterd

By default, it consumes all topics it has access to.

CLI Usage

Consume a file with line separated JSON documents:

$ cat line_separated.json | schema-harvester

Consume via MQTT (using Eclipse Mosquitto):

$ mosquitto_sub -t homeassistant/event | schema-harvester

Consume from Kafka (using kcat):

$ kcat -b $KAFKA_BROKER_ADDRESS_LIST -t your_topic | schema-harvester

Verify schemas

To verify that the generated schema is a valid JSON schema, we use the jsonschema crate's schema-validation baked into an executable.

cargo run --example validate schema.json
# or, eg directly from kafka
kcat -b localhost:9092 -t schemas -o-1 -C -e | cargo run --example validate

Name		Name	Last commit message	Last commit date
Latest commit History 163 Commits
.github		.github
cli		cli
core		core
service		service
test-suite		test-suite
.dockerignore		.dockerignore
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
Dockerfile		Dockerfile
Justfile		Justfile
LICENSE-APACHE		LICENSE-APACHE
LICENSE-MIT		LICENSE-MIT
README.md		README.md
docker-compose.yaml		docker-compose.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Licenses found

Repository files navigation

Schema-Harvester

Kafka-Service usage

CLI Usage

Verify schemas

About

Licenses found

Releases

Packages

Languages

License

Licenses found

elmarx/schema-harvester

Folders and files

Latest commit

History

Repository files navigation

Schema-Harvester

Kafka-Service usage

CLI Usage

Verify schemas

About

Resources

License

Licenses found

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages