Skip to content

Latest commit

 

History

History
161 lines (123 loc) · 5.48 KB

README.md

File metadata and controls

161 lines (123 loc) · 5.48 KB

Build Status Sonarcloud status Code coverage Maven

Generic Avro Reflect

Generate an Avro schema from any Java object at runtime.

You can find a blog post on medium with some examples and detailed explanations of how Generic Avro Reflect works.

Getting Started

You can add Generic Avro Reflect via Maven Central.

Gradle

compile group: 'com.bakdata', name: 'generic-avro-reflect', version: '1.0.0'

Maven

<dependency>
    <groupId>com.bakdata</groupId>
    <artifactId>generic-avro-reflect</artifactId>
    <version>1.0.0</version>
</dependency>

For other build tools or versions, refer to the latest version in MvnRepository.

Using it

Here is a quick example that shows you how to use Generic Avro Reflect.

Let's say you have this class and want to generate an Avro schema from an instance.

public class MyClass<T> {
    T myValue;
    
    public MyClass(T value) {
        myValue = value;
    }
}

Then to generate the Avro schema, you simply have to call the getSchema() method on the Reflect2Data.

MyClass<String> myObject = new MyClass<>("foo");
Schema mySchema = Reflect2Data.get().getSchema(myObject);

This will result in the following schema:

{
  "type": "record",
  "name": "MyClass",
  "namespace": "org.my.namespace",
  "fields": [
    {
      "name": "myValue",
      "type": "string",
    }
  ]
}

Reading and Writing Schemas

You probably want to use this to write and read your data. Avro uses DatumWriters and DatumReaders for this. You simply specify the schema that should be written or read and the rest will be done for you.

// Get Schema for myObject.
Reflect2Data reflectData = Reflect2Data.get();
MyClass<String> myObject = new MyClass<>("foo");
Schema schema = reflectData.getSchema(myObject);

// Write myObject to a byte array.
DatumWriter datumWriter = reflectData.createDatumWriter(schema);
OutputStream out = new ByteArrayOutputStream();
// We can ignore the `null` for this example.
BinaryEncoder encoder = EncoderFactory.get().binaryEncoder(out, null);
datumWriter.write(x, encoder);

// Read serialized myObject back to a Java object.
// We can ignore the `null`s for this example
BinaryDecoder decoder = DecoderFactory.get().binaryDecoder(out.toByteArray(), null);
DatumReader datumReader = reflectData.createDatumReader(schema);
MyClass<String> myNewObject = datumReader.read(null, decoder);

// We can expect the object before and the one after serializing to be equal.
assertEquals(myObject, myNewObject);

More Examples

You can find many more tests in this repository's test code.

Known Limitations

As the type resolution is done at runtime and depends on actual values being present, there is a limitation regarding null values. If a value is set to null, Reflect2Data cannot resolve its type and will default to Object.

MyClass<String> myObject = new MyClass<>(null);
Schema mySchema = Reflect2Data.get().getSchema(myObject);

Doing this will result in the following schema:

{
  "type": "record",
  "name": "MyClass",
  "namespace": "org.apache.avro.reflect.data",
  "fields": [
    {
      "name": "myValue",
      "type": {
        "type": "record",
        "name": "Object",
        "namespace": "java.lang",
        "fields": [],
      },
    }
  ],
}

This also applies to empty Lists or Maps. In general, you can assume if there is a value on which you can call .getClass(), the type can be resolved.

Development

If you want to contribute to this project, you can simply clone the repository and build it via Gradle. All dependencies should be included in the Gradle files, there are no external prerequisites.

> git clone [email protected]:bakdata/generic-avro-reflect.git
> cd generic-avro-reflect && ./gradlew build

Please note, that we have code styles for Java. They are basically the Google style guide, with some small modifications.

Contributing

We are happy if you want to contribute to this project. If you find any bugs or have suggestions for improvements, please open an issue. We are also happy to accept your PRs. Just open an issue beforehand and let us know what you want to do and why.

License

This project is licensed under the MIT license. Have a look at the LICENSE for more details.