feat: Data partitions for snapshot and durable state #515

patriknw · 2024-02-02T14:13:55Z

On top of #508 , and not complete yet

patriknw · 2024-02-05T14:33:58Z

Ready for review, I think it should cover everything for snapshot and durable state.

johanandren

LGTM, left a few nitpicks.

johanandren · 2024-02-05T14:50:19Z

core/src/main/scala/akka/persistence/r2dbc/R2dbcSettings.scala

+   * the table name is without suffix.
+   */
+  def snapshotTableWithSchema(slice: Int): String =
+    resolveTableName(snapshotsTableWithSchema, slice)


This is accidentally used in PostgresSnapshotDao, can we make the name factory private and have a public method that uses the table to avoid such mistakes?

What do you mean? This is the new method, which should always be used. The old snapshotsTableWithSchema without slice parameter could become private, but I didn't want to break bin compat (@InternalStableApi), but maybe worth doing that?

I deprecated the old tableWithSchema methods

Sorry, maybe I was unclear, what I meant is that the consuming code calls this, which generates a new string for every call instead of using the allSnapshotTablesWithSchema lookup table defined below. Probably not a big thing but seems wasteful when we anyway have the lookup table.

Ah, I see now the lookup table is the reverse, name to slice

I see what you mean, but allSnapshotTablesWithSchema isn't a lookup for each of the 1024 slices. It contains one entry per data partition, where the slice is the lower slice for that data partition.
E.g. snapshotTableWithSchema(slice = 17) can't be found easily in allSnapshotTablesWithSchema.

To make it a full lookup table it would have to contain 1024 entries, with the slice as key. Might be possible with some other structure such as binary search but not sure that is worth it. Then I'm more worried about all string construction of the full sql statements that now happen on each use. We could introduce a cache for them, and that would also solve this because the table names are always used within such sql.

allSnapshotTablesWithSchema was mostly added because in tests we delete all tables before running.

johanandren · 2024-02-05T14:54:47Z

core/src/main/scala/akka/persistence/r2dbc/R2dbcSettings.scala

+   * The durable state table and schema name with data partition suffix for the given slice. When number-of-partitions
+   * is 1 the table name is without suffix.
+   */
+  def durableStateTableWithSchema(slice: Int): String =


Also used all over the place instead of the prepared lookup table.

Similar comment here. With lookup table, do you mean _durableStateTableByEntityType. For Durable State it was already possible to configure separate table per entity type, and the accessor for that is

def getDurableStateTableWithSchema(entityType: String, slice: Int)

but I think we also need this one for the default durable_state table

core/src/main/scala/akka/persistence/r2dbc/internal/postgres/PostgresDurableStateDao.scala

Base automatically changed from wip-data-partitions-patriknw to main February 5, 2024 09:33

patriknw force-pushed the wip-more-data-partitions-patriknw branch from 73af6fe to fdfd25a Compare February 5, 2024 14:33

patriknw marked this pull request as ready for review February 5, 2024 14:33

patriknw force-pushed the wip-more-data-partitions-patriknw branch from fdfd25a to 9f379a7 Compare February 5, 2024 14:37

johanandren approved these changes Feb 5, 2024

View reviewed changes

patriknw force-pushed the wip-more-data-partitions-patriknw branch 2 times, most recently from f6b13dd to 3ec6548 Compare February 8, 2024 12:54

patriknw mentioned this pull request Feb 8, 2024

doc: Data partitions #519

Merged

patriknw added 11 commits February 9, 2024 10:14

feat: Data partitions for snapshot and durable state

61b46bd

ddl scripts

0127530

create H2 tables

7b75e09

PayloadSpec

17d3e77

durable state currentPersistenceIds and friends

95455a1

DurableStateStoreAdditionalColumnSpec

b0b51ea

ChangeHandler not supported for > 1 database

1726a12

DurableStateUpdateWithChangeEventStoreSpec

1d91177

deprecate old tableWithSchema methods

83d4957

remove durableStateTable in dao

c8e67a6

oops, init order

14c2d67

patriknw force-pushed the wip-more-data-partitions-patriknw branch from b8342d3 to 14c2d67 Compare February 9, 2024 09:15

patriknw added 2 commits February 9, 2024 10:43

another init order

bb3659d

remove fixmes

acd3587

patriknw merged commit 28b95bc into main Feb 9, 2024
9 checks passed

patriknw deleted the wip-more-data-partitions-patriknw branch February 9, 2024 10:11

patriknw added this to the 1.2.2 milestone Feb 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Data partitions for snapshot and durable state #515

feat: Data partitions for snapshot and durable state #515

patriknw commented Feb 2, 2024

patriknw commented Feb 5, 2024

johanandren left a comment

johanandren Feb 5, 2024

patriknw Feb 7, 2024

patriknw Feb 8, 2024

johanandren Feb 9, 2024

johanandren Feb 9, 2024

patriknw Feb 9, 2024

johanandren Feb 5, 2024

patriknw Feb 7, 2024

feat: Data partitions for snapshot and durable state #515

feat: Data partitions for snapshot and durable state #515

Conversation

patriknw commented Feb 2, 2024

patriknw commented Feb 5, 2024

johanandren left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment