ADO.NET Grain Directory #9263

JorgeCandeias · 2024-12-07T20:01:05Z

This PR adds support for an ADO.NET Grain Directory.

A similar implementation to the one proposed here has been battle tested in our main application for a few months now using SQL Server 2019.

There are two main demands that drive the design decisions taken, especially with the SQL artefacts:

We must support arbitrarily long grain keys.
We must support high insert/delete churn without critical performance degradation.

Given the above, the SQL Server implementation uses a non-unique clustered index based on the GrainId hash where uniqueness is guaranteed via careful locking hints in the stored procedures.

Important artefacts:

OrleansGrainDirectory

This is the directory table proper.

For feature completeness sake, this table supports multiple providers within the same cluster.
The GrainIdHash holds the StableHash of the GrainId. This is considered an (unfortunate) low-level implementation detail, and is therefore kept hidden behind the repository-like RelationalOrleansQueries class.
CreatedOn is purely for troubleshooting and not exposed outside the database at all.

CREATE TABLE OrleansGrainDirectory
(
    ClusterId NVARCHAR(150) NOT NULL,
    ProviderId NVARCHAR(150) NOT NULL,
    GrainIdHash INT NOT NULL,
    GrainId NVARCHAR(MAX) NOT NULL,
    SiloAddress NVARCHAR(100) NOT NULL,
    ActivationId NVARCHAR(100) NOT NULL,
    CreatedOn DATETIMEOFFSET(3) NOT NULL
)
GO

CI_OrleansGrainDirectory

This index turns the table into a clustered index that allows duplicates, namely on the GrainIdHash. This allows for individual changes without requiring full table locking, even if indexing GrainId directly is not possible.

Uniqueness of the GrainId proper is ensured via careful page locking hints in the stored procedures.

Loss of perf due to concurrency will happen whenever the GrainIdHash collides or whenever the individual rows are stored on the same page. High fragmentation is inevitable due to the non-ordered nature of the hash key. However, the ordered nature of the index itself is what permits the acquisition of page locks in a consistent order, thereby preventing deadlocks from manifesting.

This index must be maintained on a regular basis.

CREATE CLUSTERED INDEX CI_OrleansGrainDirectory
ON OrleansGrainDirectory
(
    ClusterId ASC,
    ProviderId ASC,
    GrainIdHash ASC
)
GO

PostgreSQL & MySQL/MariaDB:

I was unable to figure out any granular way to prevent deadlocks in both PostgreSQL and MariaDB in this context. Both RDBMSs show lack of support for both non-unique clustered tables and explicit page locking. Alternates approaches utilizing what they do support always ended up failing the chaos tests with duplicates, deadlocks or both. Therefore both of the implementations rely on full table locks to prevent both duplicates and deadlocks.

Microsoft Reviewers: Open in CodeFlow

JorgeCandeias · 2024-12-09T00:33:33Z

I've now added support for PostgreSQL. Unfortunately, I'm unable to add support for MariaDB due to its lack of support for table locking. The usual alternatives such as FOR UPDATE and TRANSACTION ISOLATION LEVEL SERIALIZABLE are ineffective at protecting from deadlocks in this context. Given this stalls the PR from proceeding, please advise on how to move forward.

src/AdoNet/Orleans.GrainDirectory.AdoNet/AdoNetGrainDirectory.cs

JorgeCandeias · 2024-12-09T13:36:22Z

I've refactored the SQL Server artefacts to use an alternate implementation of what we have. This new implementation is now based on a non-unique clustered index. Surprisingly, this approach does not appear to show the churn related performance issues we observed with our first naive go at it (which used a unique clustered index with a surrogate key).

This alternate approach may also be viable for PostgreSQL and MariaDB, will look into these again.

JorgeCandeias · 2024-12-09T21:24:18Z

PostgreSQL & MySQL support is now added.

veikkoeeva · 2025-01-11T08:36:16Z

@JorgeCandeias I have not yet taken a deeper look into this, but if nothing else, take a look out of curiosity. Looks good.

As for those locking things, in the persistence provider I had similar issues and in those cases Orleans has some quarantees of uniques, I just made the DB code branch based on null in version (i.e. it's a new entry, not already held by Orleans) and only take the heavier locks on the null branch and otherwise avoid them since there surely is a row in the DB already. Then other things like using heap index or reverse index to avoid fragmentation maintenance, and prevent all sorts of sniffing problems. Postgres traditionally have had some issues with tuples but it's getting better. MySQL with locking stuff. And there's probably a lot of DBs in use I don't know that much about (Cockroach, Scylla and so on).

I suppose the important issue is that if someone wants to try, or implement, the DB side in some other way or perhaps use in-memory tables or whatever, the interface between Orleans and DB would work with no modifications or with little modifications. It's a tricky thing to run things long periods reliably with performance. But I suppose ultimately that's why we're here. :)

JorgeCandeias · 2025-01-14T18:39:36Z

@veikkoeeva Thank you very much for taking a look at this one. Ultimately this is one of those it's what we can do with what we have occasions.

Old school relational structures just aren't a very good match for this feature. We end up with odd hacks no matter what approach we take. Yet they are better than nothing and something is what we need at the moment. Inmemory structures are often a better match for this but as this package targets a broad audience, this is something less safe to assume availability of.

For example, LocalDB, SSDT and MOTs in SQL Server don't play well together at the moment, and that's a problem for my own team.

I also could not discover any appropriate surgical tools to deal with deadlocks in Postgres and MariaDB in this context, so I had to bring in the hammer. If you know a better approach I'm happy to implement it.

JorgeCandeias added 14 commits December 6, 2024 19:46

Added new project

f7aea92

Added SQL Server artefacts

dcdb36f

Added query artefacts

920a0ac

Added AdoNetGrainDirectory

0edc9fd

Added hosting artefacts

aa78856

Added discovery artefacts

b4455d5

Refactored

65abf77

Added tests

5df7efd

Added tests

3b7426f

Refactored

bdc5cc5

Added tests

7076a74

Cleanup

19c5e8c

Typo

b96e9dd

Added PostgreSQL support

b6b2159

WIP MySQL

348ba95

gfoidl reviewed Dec 9, 2024

View reviewed changes

src/AdoNet/Orleans.GrainDirectory.AdoNet/AdoNetGrainDirectory.cs Outdated Show resolved Hide resolved

JorgeCandeias added 2 commits December 9, 2024 10:36

Review

e138c51

Refactored SQL Server artefacts

3d210a9

JorgeCandeias added 4 commits December 9, 2024 14:04

Comments

d1c7184

Refactored

18473ad

Refactored

22fa270

Refactored

be92c8a

JorgeCandeias changed the title ~~WIP ADO.NET Grain Directory~~ ADO.NET Grain Directory Dec 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ADO.NET Grain Directory #9263

ADO.NET Grain Directory #9263

JorgeCandeias commented Dec 7, 2024 •

edited

Loading

JorgeCandeias commented Dec 9, 2024

JorgeCandeias commented Dec 9, 2024

JorgeCandeias commented Dec 9, 2024

veikkoeeva commented Jan 11, 2025 •

edited

Loading

JorgeCandeias commented Jan 14, 2025

ADO.NET Grain Directory #9263

Are you sure you want to change the base?

ADO.NET Grain Directory #9263

Conversation

JorgeCandeias commented Dec 7, 2024 • edited Loading

Important artefacts:

OrleansGrainDirectory

CI_OrleansGrainDirectory

PostgreSQL & MySQL/MariaDB:

Microsoft Reviewers: Open in CodeFlow

JorgeCandeias commented Dec 9, 2024

JorgeCandeias commented Dec 9, 2024

JorgeCandeias commented Dec 9, 2024

veikkoeeva commented Jan 11, 2025 • edited Loading

JorgeCandeias commented Jan 14, 2025

JorgeCandeias commented Dec 7, 2024 •

edited

Loading

veikkoeeva commented Jan 11, 2025 •

edited

Loading