What happens if first record does not have headers but further records in the same batch do? #227

jeqo · 2024-02-26T16:44:42Z

Output schema (avro/parquet) is based on the first record. This works fine for key/value, etc. but headers may differ between records. Different record header elements types can cause issues but this can be solved by casting header types by using StringConverter or similar for headers.
Though, when first record does not have headers the schema type is null. We should figure out what happens with next records in the same batch if they do have header (are the headers missing? does conversion fail?) and provide an approach on how to process infer the schema for headers properly.

See Aiven-Open/gcs-connector-for-apache-kafka#347 (comment)

This was referenced Feb 26, 2024

refactor: improve error message when header types differ #226

Merged

GCS Files Overwritten every minute by kafka sink connector and some messages missing Aiven-Open/gcs-connector-for-apache-kafka#347

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What happens if first record does not have headers but further records in the same batch do? #227

What happens if first record does not have headers but further records in the same batch do? #227

jeqo commented Feb 26, 2024

What happens if first record does not have headers but further records in the same batch do? #227

What happens if first record does not have headers but further records in the same batch do? #227

Comments

jeqo commented Feb 26, 2024