Reliability

Dataflow has to run on Kubernetes, which means that pods can be deleted and processes killed at anytime. It avoids using its own storage, and relies on the source or sink for storage.

Dataflow aims for at-least once message delivery semantics.

The following disruptions are tolerated:

Loss of network connection to source or sink.
Pod deletion.
Pipeline deletion (metrics will be lost, but no messages).

Under disruption, no messages should be lost and up to 20 messages maybe duplicated.

NATS Jet Stream

No message lost or duplicated is seen under following disruption:

Loss of network connection to source or sink.
Pod deletion.
Pipeline deletion.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RELIABILITY.md

RELIABILITY.md

Reliability

NATS Jet Stream

Files

RELIABILITY.md

Latest commit

History

RELIABILITY.md

File metadata and controls

Reliability

NATS Jet Stream