Chasm interface draft #6987

yycptt · 2024-12-13T08:56:58Z

What changed?

Draft chasm interface with activity as a sample use case.

Why?

How did you test it?

Potential risks

Documentation

Is hotfix candidate?

bergundy · 2024-12-13T19:38:31Z

service/history/chasm/component.go

I would put this in a top level chasm directory. There's likely going to be some chasm related code in other services.

I thought about that as well, but think having a top level chasm package that re-exports everything is probably better.

Some concepts like Field, Ref are exported as a struct and contain un-exported fields needs for the implementation. Having the implementation in the top level package instead of service/history feels a bit weird to me.

There will be some other exported functions, structs etc. in the chasm package meant to be consumed by other packages in the history service, but not the developers who's writing a chasm component. With re-export, we can choose not to re-export them, so dev will see a cleaner top level chasm package.

Synced offline. I've moved the package to top level.
Though later during implementation, we will have to comment that some exported function are meant for internal implementation use and should not be used by lib/component author.

I would just move this out of service/history though since there will be some chasm functionality that isn't strictly tied to the history service (e.g. exposing public APIs via the frontend and possibly extending matching functionality).

Top level, you mean common/chasm?

bergundy · 2024-12-13T19:40:01Z

service/history/chasm/components/activity/activity.go

+		//
+		// Framework will try to recognize the type and do serialization/deserialization
+		// proto.Message is recommended so the component get compatibility if state definition changes
+		State persistencepb.ActivityInfo // proto.Message


Let's call this standard field Data, since State is usually an enum.

Alternatively, we can let you embed a proto and treat that as the data field. That could be slightly nicer to use.

Agree with naming change. Disagree with proto embed. I'd prefer for it to just be explicit.

yeah Data is better. I won't commit this sample activity impl though :)

But I will track as low priority that the reflection logic in chasm engine needs to support embedded fields.

bergundy · 2024-12-13T19:42:49Z

service/history/chasm/components/activity/activity.go

+		Input  *chasm.Field[*common.Payload] `chasm:"lazy"`
+		Output *chasm.Field[*common.Payload] `chasm:"lazy"`
+
+		EventNotifier *chasm.Field[EventNotifier]


I know this is just to show the pointer capabilities, but the real activity implementation will need to do more than just notify its parent, it'll need to be injected with a way to read the inputs and outputs. They won't be embedded payloads in the workflow case.

I also had thought we wanted a way for the parent to be able to decide what notifications it cared about from its children, rather than the children needing to remember to notify the parent. I think that can still be implemented in terms of this a pointer just fine though.

Something as simple as the following I think would be nice, and isn't really even part of the framework, just a useful pattern. I think it keeps things a little more obvious for the activity implementer.

func NewScheduledActivity( chasmContext chasm.MutableContext, params *NewActivityRequest, onComplete func(ActivityCompletedEvent) ) { ... } // Then workflow constructs like NewScheduledActivity(ctx, params, func(ce) { me.AsComponentPointer(ctx).OnCompletion(ce) })

the real activity implementation will need to do more than just notify its parent

Yeah for sure. The interface can be anything and can have a method for like loading events for example. This field internally will just be a persisted pointer (basically the path) to another component.

onComplete func(ActivityCompletedEvent)

The framework needs to be able to persist this pointer. Not exactly sure if we can get a pointer to the method receiver in this approach though.

service/history/chasm/components/activity/activity.go

bergundy · 2024-12-13T19:50:24Z

service/history/chasm/components/activity/fx.go

+
+func (l Library) Components() []chasm.RegistrableComponent {
+	return []chasm.RegistrableComponent{
+		chasm.NewRegistrableComponent[*Activity](


I'm kinda going back and forth between a RegisterComponent function and this approach where the library returns the list of components.

The main downside to this approach is that you have to wrap with NewRegisterableComponent.

If we do want this approach, we may want a LibraryBase struct that has empty implementations for all of the methods so library implementors only provide the items they need registered.

I like that with the concept of a library, you essentially get a namespace for all components and tasks.

I am in favor of returning the list of components. Having to wrap seems like a total non-issue for something that's done this infrequently.

yeah wrapping feels ok to me.

We anyway need a way for specifying name & options of the component. Alternatively they can be methods on the component struct itself, but I feel that's mixing other concerns with the component data & behavior.

LibraryBase makes sense. Guess we also needs it for making the interface forward compatible so all library impl should embed it. Will add.

Lets go with library, I agree it's the better approach.

service/history/chasm/components/activity/fx.go

bergundy · 2024-12-13T19:51:28Z

service/history/chasm/components/activity/handler.go

+	"go.temporal.io/server/service/history/chasm"
+)
+
+// This will be nexus


👍 I'll work on a Nexus POC on top of this.

bergundy · 2024-12-13T19:52:26Z

service/history/chasm/components/activity/tasks.go

+
+type TimeoutTaskHandler struct{}
+
+func (h *TimeoutTaskHandler) Validate(


A little on the fence if Validate should be here or on the Component or the task but this works and has some advantages.

Kinda feels more natural to me if the concept belongs to the Task. and can be shared across multiple components.

bergundy · 2024-12-13T19:54:40Z

service/history/chasm/components/activity/tasks.go

+
+func (h *TimeoutTaskHandler) Validate(
+	chasmContext chasm.Context,
+	activity *Activity,


I wonder if we reuse tasks for separate components and there will need to be a way to provide a Validate function for each of them.

I am thinking the Component here doesn't have to be a concrete struct. It can be an interface and then the same task definition can be reused across components.
When validating/executing the task, the underlying struct will be determined based on the component name stored in the task and the registered concrete type for that name.

Let's try this out. Hard to tell without building some use cases on top of this.

bergundy · 2024-12-13T19:56:18Z

service/history/chasm/components/activity/handler.go

+		return nil, err
+	}
+
+	resp, startedActivityRef, err := chasm.UpdateComponent(


As discussed yesterday, I wish we could get the ref in the update function and keep the ref out of this signature. This is good to start with though.

Yeah, ideally a component author making a transition doesn't need to care where the referenced component came from

ideally a component author making a transition doesn't need to care where the referenced component came from

They have to care unfortunately. The ref also serves as a consistency token. If a second operation depends on the first one, you need to use the ref returned from the first operation or the second operation may talk to a stale copy of the component and get rejected (e.g. not found)

But yeah this means some boilerplate code in the api handler. As a start I think I can at least move the ref ser/de part into chasm.UpdateComponent, so the method will take in and return []byte. We lose some type safety but code will be simpler.

I ended up making the ref type generic as well, and it can be []byte | ComponentRef.
For api, takes in []byte makes sense.
For task processing, decoded ComponentRef makes more sense as the framework will construct ComponentRef in memory. Always takes in []byte means extra encoding/decoding. Framework also needs to put extra task validation fn in the ref as well.

I'm not too concerned with this detail for now, we can figure it out as we build use cases on top of the API.

service/history/chasm/components/activity/activity.go

bergundy · 2024-12-13T20:02:01Z

service/history/chasm/components/activity/handler.go

+		func(a *Activity, ctx chasm.Context, _ *GetActivityResultRequest) bool {
+			return a.LifecycleState() == chasm.LifecycleStateCompleted
+		},
+		func(a *Activity, ctx chasm.MutableContext, _ *GetActivityResultRequest) (*GetActivityResultResponse, error) {
+			outputPayload, err := a.Output.Get(ctx)
+			resp.Output = outputPayload.Data
+			return resp, err
+		},


I wonder if we'd be better off merging these two like so:

Suggested change

func(a *Activity, ctx chasm.Context, _ *GetActivityResultRequest) bool {

return a.LifecycleState() == chasm.LifecycleStateCompleted

},

func(a *Activity, ctx chasm.MutableContext, _ *GetActivityResultRequest) (*GetActivityResultResponse, error) {

outputPayload, err := a.Output.Get(ctx)

resp.Output = outputPayload.Data

return resp, err

},

func(a *Activity, ctx chasm.MutableContext, _ *GetActivityResultRequest) (*GetActivityResultResponse, error) {

if a.LifecycleState() != chasm.LifecycleStateCompleted {

return nil, chasm.ErrNoReady / *name TBD */

}

outputPayload, err := a.Output.Get(ctx)

resp.Output = outputPayload.Data

return resp, err

},

I think I prefer them separate, because what happens if you mutate something and then say "not ready"? That would be some weird violation that shouldn't be possible, and separate contexts enforces that at the type level.

If you need context from one to the other, make the ready function return (bool, T) and T is passed into the mutator

yeah, I separated them mainly to ensure the validation logic is readonly.

I changed the signature of the validation to return (T, bool, error), though that means the mutator is now taking 4 params...

Not sure why there needs to be a mutator in poll or why you need both an error and a bool.

bergundy · 2024-12-13T20:02:58Z

service/history/chasm/engine.go

+// 	panic("not implemented")
+// }
+
+func NewEntity[C Component, I any, O any](


Alternatively, call this CreateEntity but what you have is fine too.

bergundy · 2024-12-17T19:28:02Z

service/history/chasm/engine.go

+	panic("not implemented")
+}
+
+// Not needed for V1


True, this isn't needed but if we were to do it, there's a type safe way using a closure that gets an instance and returns field references.

Sushisource

Overall looking really good to me

Sushisource · 2024-12-18T23:21:50Z

service/history/chasm/components/activity/activity.go

+		//
+		// Framework will try to recognize the type and do serialization/deserialization
+		// proto.Message is recommended so the component get compatibility if state definition changes
+		State persistencepb.ActivityInfo // proto.Message


Agree with naming change. Disagree with proto embed. I'd prefer for it to just be explicit.

Sushisource · 2024-12-18T23:23:38Z

service/history/chasm/components/activity/activity.go

+}
+
+func (i *Activity) GetDispatchInfo(
+	chasmContext chasm.MutableContext,


Should this be an immutable context?

Good catch. yes, it should be immutable.

I still kinda prefer Read and Write context but that's just a personal non-blocking preference.

service/history/chasm/components/activity/activity.go

Sushisource · 2024-12-18T23:27:06Z

service/history/chasm/components/activity/fx.go

+
+func (l Library) Components() []chasm.RegistrableComponent {
+	return []chasm.RegistrableComponent{
+		chasm.NewRegistrableComponent[*Activity](


I am in favor of returning the list of components. Having to wrap seems like a total non-issue for something that's done this infrequently.

Sushisource · 2024-12-18T23:35:31Z

service/history/chasm/components/activity/activity.go

+		Input  *chasm.Field[*common.Payload] `chasm:"lazy"`
+		Output *chasm.Field[*common.Payload] `chasm:"lazy"`
+
+		EventNotifier *chasm.Field[EventNotifier]


I also had thought we wanted a way for the parent to be able to decide what notifications it cared about from its children, rather than the children needing to remember to notify the parent. I think that can still be implemented in terms of this a pointer just fine though.

Something as simple as the following I think would be nice, and isn't really even part of the framework, just a useful pattern. I think it keeps things a little more obvious for the activity implementer.

func NewScheduledActivity( chasmContext chasm.MutableContext, params *NewActivityRequest, onComplete func(ActivityCompletedEvent) ) { ... } // Then workflow constructs like NewScheduledActivity(ctx, params, func(ce) { me.AsComponentPointer(ctx).OnCompletion(ce) })

Sushisource · 2024-12-18T23:37:11Z

service/history/chasm/components/activity/handler.go

+		return nil, err
+	}
+
+	resp, startedActivityRef, err := chasm.UpdateComponent(


Yeah, ideally a component author making a transition doesn't need to care where the referenced component came from

Sushisource · 2024-12-18T23:38:57Z

service/history/chasm/components/activity/handler.go

+		func(a *Activity, ctx chasm.Context, _ *GetActivityResultRequest) bool {
+			return a.LifecycleState() == chasm.LifecycleStateCompleted
+		},
+		func(a *Activity, ctx chasm.MutableContext, _ *GetActivityResultRequest) (*GetActivityResultResponse, error) {
+			outputPayload, err := a.Output.Get(ctx)
+			resp.Output = outputPayload.Data
+			return resp, err
+		},


I think I prefer them separate, because what happens if you mutate something and then say "not ready"? That would be some weird violation that shouldn't be possible, and separate contexts enforces that at the type level.

If you need context from one to the other, make the ready function return (bool, T) and T is passed into the mutator

Sushisource · 2024-12-18T23:40:38Z

service/history/chasm/context.go

+)
+
+type Context interface {
+	// Context is not binded to any component,


Suggested change

// Context is not binded to any component,

// Context is not bound to any component,

Sushisource · 2024-12-18T23:45:36Z

service/history/chasm/ref.go

+		// we probably don't even need this,
+		// can make the function generic and find the name from registry
+		rootComponentName: rootComponentName,


Agree. Taking in the type and having components define their name so you operate in terms of types rather than names is nice.

I realized later that if we pass in the component type, then we'll also need a way to access the registry. Either pass in the registry or the function impl reference a global variable... Neither feels like a good idea to me.

I need to better understand how the services will be exposed as nexus and then come back to this. We might solve the routing problem in a different way and can just store shardID in the ref.

Hmm... shard ID in the ref might work. I don't think any of this is blocking the initial work.

Sushisource · 2024-12-18T23:49:54Z

service/history/chasm/context.go

+	// If we provide this method, then the method on the engine doesn't need to
+	// return a Ref
+	// NewRef(Component) (ComponentRef, bool)


You could have a CreationContext that simply doesn't include this, and that and MutableContext both embed an un-exported base interface.

Problem is new components can be created in any transition not just the first one that creates the entity.

bergundy

Overall LGTM, had some followup questions.

I think we're in agreement but just want to double check that the structure will be:

chasm/ # framework code live here
chasm/lib # libraries live here

bergundy · 2025-01-13T23:52:36Z

service/history/chasm/component.go

I would just move this out of service/history though since there will be some chasm functionality that isn't strictly tied to the history service (e.g. exposing public APIs via the frontend and possibly extending matching functionality).

bergundy · 2025-01-13T23:54:41Z

service/history/chasm/components/activity/activity.go

+}
+
+func (i *Activity) GetDispatchInfo(
+	chasmContext chasm.MutableContext,


I still kinda prefer Read and Write context but that's just a personal non-blocking preference.

bergundy · 2025-01-13T23:55:59Z

service/history/chasm/components/activity/fx.go

+
+func (l Library) Components() []chasm.RegistrableComponent {
+	return []chasm.RegistrableComponent{
+		chasm.NewRegistrableComponent[*Activity](


Lets go with library, I agree it's the better approach.

bergundy · 2025-01-13T23:57:38Z

service/history/chasm/components/activity/handler.go

+		return nil, err
+	}
+
+	resp, startedActivityRef, err := chasm.UpdateComponent(


I'm not too concerned with this detail for now, we can figure it out as we build use cases on top of the API.

bergundy · 2025-01-14T00:01:30Z

service/history/chasm/components/activity/handler.go

+		func(a *Activity, ctx chasm.Context, _ *GetActivityResultRequest) bool {
+			return a.LifecycleState() == chasm.LifecycleStateCompleted
+		},
+		func(a *Activity, ctx chasm.MutableContext, _ *GetActivityResultRequest) (*GetActivityResultResponse, error) {
+			outputPayload, err := a.Output.Get(ctx)
+			resp.Output = outputPayload.Data
+			return resp, err
+		},


Not sure why there needs to be a mutator in poll or why you need both an error and a bool.

bergundy · 2025-01-14T00:02:24Z

service/history/chasm/components/activity/tasks.go

+
+func (h *TimeoutTaskHandler) Validate(
+	chasmContext chasm.Context,
+	activity *Activity,


Let's try this out. Hard to tell without building some use cases on top of this.

bergundy · 2025-01-14T00:04:01Z

service/history/chasm/ref.go

+		// we probably don't even need this,
+		// can make the function generic and find the name from registry
+		rootComponentName: rootComponentName,


Hmm... shard ID in the ref might work. I don't think any of this is blocking the initial work.

alexshtin · 2025-01-17T01:23:50Z

service/history/chasm/component.go

+type LifecycleState int
+
+const (
+	LifecycleStateUnspecified LifecycleState = 0


Suggested change

LifecycleStateUnspecified LifecycleState = 0

LifecycleStateUnspecified = LifecycleState(0)

and then it can be moved to the end of next const block.

Chasm interface draft

017e73d

yycptt requested review from bergundy, Sushisource, alexshtin, lina-temporal and paulnpdev December 13, 2024 08:56

bergundy reviewed Dec 13, 2024

View reviewed changes

service/history/chasm/components/activity/activity.go Show resolved Hide resolved

bergundy reviewed Dec 13, 2024

View reviewed changes

service/history/chasm/components/activity/fx.go Show resolved Hide resolved

bergundy reviewed Dec 13, 2024

View reviewed changes

service/history/chasm/components/activity/activity.go Show resolved Hide resolved

bergundy reviewed Dec 13, 2024

View reviewed changes

bergundy reviewed Dec 17, 2024

View reviewed changes

Sushisource reviewed Dec 18, 2024

View reviewed changes

yycptt requested review from Sushisource and bergundy January 13, 2025 23:54

bergundy approved these changes Jan 14, 2025

View reviewed changes

yycptt changed the title ~~[Will NOT Merge] Chasm interface draft~~ Chasm interface draft Jan 14, 2025

alexshtin reviewed Jan 17, 2025

View reviewed changes


		type TimeoutTaskHandler struct{}

		func (h *TimeoutTaskHandler) Validate(

	// Context is not binded to any component,
	// Context is not bound to any component,

	LifecycleStateUnspecified LifecycleState = 0
	LifecycleStateUnspecified = LifecycleState(0)

Chasm interface draft #6987

Are you sure you want to change the base?

Chasm interface draft #6987

Conversation

yycptt commented Dec 13, 2024

What changed?

Why?

How did you test it?

Potential risks

Documentation

Is hotfix candidate?

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Sushisource left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bergundy left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

alexshtin Jan 17, 2025 • edited Loading

Choose a reason for hiding this comment

alexshtin Jan 17, 2025 •

edited

Loading