Add documentation about workflow and task lifecycle events #1054

cdavernas · 2025-01-09T11:50:48Z

Please specify parts of this PR update:

Discussion or Issue link:
#1024
#1030

What this PR does:

Adds documentation about workflow and task lifecycle events

Closes serverlessworkflow#1030 Signed-off-by: Charles d'Avernas <[email protected]>

Signed-off-by: Charles d'Avernas <[email protected]>

fjtirado · 2025-01-09T15:07:02Z

dsl-reference.md

+    - [Task Lifecycle Events](#task-lifecycle-events)
+      + [Task Created](#task-created-event)
+      + [Task Started](#task-started-event)
+      + [Task Suspended](#task-suspended-event)


Now that I took a closer look, and after my yesterday comment related with workflow ~~cancelation~~ suspension, Im not 100% sure a task can be really suspended.
A ~~cancelation~~ suspension of a workflow can be understood as "let the current task to finish and hold execution of the next one" rather than "Interrupt current task and let it in a limbo"

If a task is a composite one (for, do, try) it can be of course be stopped, but at the end, it will executing another non composite task (basically a call), which is kind of atomic.
I think we can avoid that conundrum by remaining silent about the whole task cycle thing and just define events at workflow level (and task started and completed)

Now that I took a closer look, and after my yesterday comment related with workflow cancelation, Im not 100% sure a task can be really suspended.

Why not? Let's say you do a wait task, which sleeps for 10secs. You suspend it after 5secs, when resumed it still sleeps for another 5secs. Same can apply to consuming events, foreach enumerations, etc?

I think we can avoid that conundrum by remaining silent about the whole task cycle thing and just define events at workflow level (and task started and completed)

That's an option, but I think it's our loss. Workflow level updates are not enough in most cases, such as long running flows. An other option, though I'd prefer to leave it as is, is to only enforce workflow lifecycle events, while leaving task lifecycle events optional.

About the wait example, if we interrupt the wait (or stop listening to an event), what we should do when resuming, execute the next task after the wait or execute the wait again? I think we can avoid that question by not letting suspeding a workflow that is already waiting.
In the case of a For, what I suggested is to let the call that the loop is currently exeucting finish (the definition of finish might vary for a task, if a call, let the call finish, if a set, let the assignment finish, if a do, hold after the current task being executed finish, and so on )

Note that I edited my first comment, Im not talking about cancelation, but about suspension. Sorry for the confusion. Cancellation is clear, you cancel the workflow execution, interrupting if you can and not allowing resuming. But suspending-resuming is trickier.

About the wait example, if we interrupt the wait (or stop listening to an event), what we should do when resuming

I think you missed my point there:

Let's say you do a wait task, which sleeps for 10secs. You suspend it after 5secs, when resumed it still sleeps for another 5secs.

In the case of a For, what I suggested is to let the call that the loop is currently executing finish

If what you mean is that the iteration should be completed before suspending, I think it's a possibility, but not my personal preference, as the iteration can be a succession of perfectly suspendable tasks.

I think we should not allow suspending a workflow that is waiting (so implementors do not need that resuming logic for event/wait, which might be pretty tricky in some cases)
And for other task, I think suspending should let the operation finish (the call or the jq expression) and then freeze, so you resume in the next task.

I think we should not allow suspending a workflow that is waiting

Your opinion makes sense, it can be tricky indeed! Maybe we should let that up to the implementers, and add a couple of lines about those specifics?

Resuming tasks can be challenging. Ideally, as @cdavernas suggested, allowing the runtime to directly resume a suspended task would be the best approach. However, this is often not feasible. For a passive task, such as a wait operation, it might work. But for active tasks, like an HTTP call request, the situation becomes more complex. Even if the runtime cancels the request, the server may already be processing it, and retrying the request upon resuming could lead to unintended behaviors.

As a result, there isn't a universal solution. The suspension mechanism must depend on the nature of the task. Passive tasks, such as wait or listen, can typically be suspended and resumed. Active tasks, like call or run, would need to complete before the suspension can occur, deferring it to the subsequent task.

fjtirado · 2025-01-09T15:11:40Z

dsl-reference.md

+      + [Task Cancelled](#task-cancelled-event)
+      + [Task Faulted](#task-faulted-event)
+      + [Task Completed](#task-completed-event)
+      + [Task Status Changed](#task-status-changed-event)


Status, as I understood it, only applies to the worklow as a whole
https://github.com/serverlessworkflow/specification/blob/main/dsl.md#status-phases

Not in my opinion. Task status is extremely important in most, if not all workflow execution scenarios. Think of a long running flow which you represent in a UI. Just having flow events will let your users know the flow has started or ended, but whatever is in the middle will be unknown to them, potentially for hours or even days at a time.

ok, then we also need to edit that section to make it clear phase status applies both for workflow and task (I implemented it just for workflows because there was not hint on that section)
Note that I also implemented task created and task completed, because I concur on the usefulness for users, which is unclert to me is that they need more than task created and task completed ;), specially because we have compoiste task, so in theory you can have a do withint a do withitn a for withint a try and the four task will be running, so when you suspend them, you will have to send 4 events

ok, then we also need to edit that section to make it clear phase status applies both for workflow and task

I edited it in the PR, because as you said it only applied to workflows beforehand. Have you checked the updated section in the PR? I think it's clear, but I'd be happy to update it as you see fit!

is unclear to me is that they need more than task created and task completed

Well, I see what you mean, but having at least canceled and faulted seemed reasonable. Let's discuss it in today's daily!

@cdavernas @JBBianchi as dicussed in Slack, I think that if we have specific event for every phase change (both for workflow an task) we do not really need the status change one (or the other way around, if we have the status change, we do not need the specific ones)
Having only the specific ones (created, started, faulted, completed) covers all the status phase changes, so a user just interested in the fact that the status has changed, can consume them and ignore the cloud event data (thats the beauty of the CloudEvent structure, the same event can be used in different ways depending on the consumer needs)

The status change event is a generic event which is in my personal experience critically useful. As a matter of fact, users who want to have lifecycle notifications but that are not interested in event-specific data are usually going for those kind of events. While not necessary per say, they are an absolute plus for many users. They are usually fired after any other events, to communicate in a generic fashion with the outside world.

fjtirado · 2025-01-09T15:11:58Z

dsl-reference.md

+      + [Workflow Completed](#workflow-completed-event)
+      + [Workflow Status Changed](#workflow-status-changed-event)
+    - [Task Lifecycle Events](#task-lifecycle-events)
+      + [Task Created](#task-created-event)


I do not see how a task can be created and not started

Well, a task can be instantiated, meaning initialized and provisioned with its input, and not have yet started. That's what we do in Synapse. Unlike workflows, tasks do not exist beforehand, you therefore need to tell the user about them being created: you cannot start something that has not been created.

Lets use an example
do

firstTask

secondTask

thirdTask

I guess the sequence of events will be firstTask created, first task started, ..., first task completed, second task created, second task started, ..., second task completed

so inmediatealy after firstTask instance has been created with some input, it will start execution. (obviously you have to create something before executing it, which Im discussing is the fact that a task can be created without starting it just inmediately)

What Im trying to say is the same we do not have an event called "setting input to task instance before actually starting it" we do not need an event "taks created before starting it"

Im trying to save a redundant event, thats it, but if you feel the distinction between task created and task started is relevant, as far as it not fordibben for them to be simoulteneous in a particular implementation, Im fine

I'd rather have a simplified way and go with only started.

Lets use an example do

* firstTask * secondTask * thirdTask

I guess the sequence of events will be firstTask created, first task started, ..., first task completed, second task created, second task started, ..., second task completed

so inmediatealy after firstTask instance has been created with some input, it will start execution. (obviously you have to create something before executing it, which Im discussing is the fact that a task can be created without starting it just inmediately)

I believe this depends on the runtime implementation. It’s possible to provision tasks beforehand and start them later. For example:

firstTask created

secondTask created

thirdTask created

firstTask started

firstTask completed

secondTask started

started completed

From a human perspective, creating a task is like adding it to your to-do list, while starting a task corresponds to actually executing it.

That being said, I’m not entirely convinced that having a created state for tasks is mandatory. One could even argue that its counterpart should also exist at the workflow level if it were the case. Additionally, I see no issue if a runtime generates more events than those listed here, as long as it still produces at least the required events.

With that in mind, perhaps the created state could be omitted.

@JBBianchi Thats it. For me the moment in which is a task instance is created is an implementation detail the user is not really interested on, specially when, as far as I know, we do not really have the concept of an schedule task and users cannot arbitrarily exeucte task out of order (the task are executed according to the workflow definition) or do not explicitly intervene in the transitions (they might indirectly intervene by sending events for listen task or in custom ones, but there is not a task phase transition API the user might invoke. In other words, task are not created waiting for the user to start them manually. If that was the situation, the event will make perferct sense, but that being not the case, this state creates some ambiguity that I will prefer to avoid)

@fjtirado The problem that I see is the data carried by the created event, which I do not want to carry over to the started one, for semantic reasons. While I do not care of getting rid of the created event, I am strongly opposed to moving its payload to the started one. However, not doing so, we are forbidding the external world to know intrinsics about the task being run.

In other words, task are not created waiting for the user to start them manually.

@fjtirado No, but as @JBBianchi mentionned, you could have provisionned them beforehand, before running them in the order defined by the workflow, in a warmup procedure or whatever.

fjtirado

I have several doubts about the concept of task life cycle.
Its doable, buy pretty tricky and Im not sure we have to force implementors to keep track of every task status.
Also, for some task, for example, wait, which should be the status? suspended?

dsl-reference.md

dsl.md

cdavernas · 2025-01-09T15:25:47Z

Also, for some task, for example, wait, which should be the status? suspended?

@fjtirado That's specified in the dsl.md file:

| waiting | The workflow/task execution is temporarily paused, awaiting either inbound event(s) or a specified time interval as defined by a wait task. |

Co-authored-by: Ricardo Zanini <[email protected]>

cdavernas · 2025-01-09T15:35:16Z

I have several doubts about the concept of task life cycle.
Its doable, buy pretty tricky

I must be missing something, because I do not see the problem with it. As a matter of fact, we are already doing it in Synapse, as you can see here, for example.

Signed-off-by: Charles d'Avernas <[email protected]>

Add documentation about workflow and task lifecycle events

6a102e3

Closes serverlessworkflow#1030 Signed-off-by: Charles d'Avernas <[email protected]>

cdavernas added change: documentation Improvements or additions to documentation. It won't impact a version change. change: feature New feature or request. Impacts in a minor version change area: spec Changes in the Specification labels Jan 9, 2025

cdavernas added this to the v1.0.0 milestone Jan 9, 2025

cdavernas self-assigned this Jan 9, 2025

cdavernas requested a review from ricardozanini as a code owner January 9, 2025 11:50

cdavernas linked an issue Jan 9, 2025 that may be closed by this pull request

Cloud events to be publlished when workflow status change #1030

Open

Add the missing workflowDefinitionReference object

5221ef3

Signed-off-by: Charles d'Avernas <[email protected]>

cdavernas linked an issue Jan 9, 2025 that may be closed by this pull request

Add a new suspended workflow status phase #1024

Open

fjtirado reviewed Jan 9, 2025

View reviewed changes

ricardozanini approved these changes Jan 9, 2025

View reviewed changes

cdavernas and others added 9 commits January 9, 2025 16:26

Update dsl-reference.md

faac516

Co-authored-by: Ricardo Zanini <[email protected]>

Update dsl-reference.md

38dea37

Co-authored-by: Ricardo Zanini <[email protected]>

Update dsl-reference.md

273d09a

Co-authored-by: Ricardo Zanini <[email protected]>

Update dsl-reference.md

4fe63d2

Co-authored-by: Ricardo Zanini <[email protected]>

Update dsl-reference.md

81cd84d

Co-authored-by: Ricardo Zanini <[email protected]>

Update dsl-reference.md

4ba44d2

Co-authored-by: Ricardo Zanini <[email protected]>

Update dsl-reference.md

ef40498

Co-authored-by: Ricardo Zanini <[email protected]>

Update dsl-reference.md

1e47ca7

Co-authored-by: Ricardo Zanini <[email protected]>

Update dsl-reference.md

7ac9dd5

Co-authored-by: Ricardo Zanini <[email protected]>

cdavernas added 4 commits January 9, 2025 16:53

Merge branch 'main' into feat-lifecycle-cloud-events

8886c3a

Merge branch 'main' into feat-lifecycle-cloud-events

6a22a45

Change 'canceled' to 'cancelled', as requested by @fjtirado

d601036

Signed-off-by: Charles d'Avernas <[email protected]>

Merge branch 'main' into feat-lifecycle-cloud-events

bd6e4ed

Merge branch 'main' into feat-lifecycle-cloud-events

865464e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add documentation about workflow and task lifecycle events #1054

Add documentation about workflow and task lifecycle events #1054

cdavernas commented Jan 9, 2025 •

edited

Loading

fjtirado Jan 9, 2025 •

edited

Loading

fjtirado Jan 9, 2025

cdavernas Jan 9, 2025

cdavernas Jan 9, 2025

fjtirado Jan 9, 2025

fjtirado Jan 9, 2025 •

edited

Loading

cdavernas Jan 9, 2025

fjtirado Jan 9, 2025

cdavernas Jan 9, 2025

JBBianchi Jan 9, 2025

fjtirado Jan 9, 2025 •

edited

Loading

cdavernas Jan 9, 2025

fjtirado Jan 9, 2025 •

edited

Loading

cdavernas Jan 9, 2025

cdavernas Jan 9, 2025 •

edited

Loading

fjtirado Jan 16, 2025

cdavernas Jan 16, 2025

fjtirado Jan 9, 2025

cdavernas Jan 9, 2025

fjtirado Jan 9, 2025

fjtirado Jan 9, 2025

fjtirado Jan 9, 2025 •

edited

Loading

ricardozanini Jan 10, 2025

JBBianchi Jan 16, 2025

fjtirado Jan 16, 2025

cdavernas Jan 16, 2025 •

edited

Loading

cdavernas Jan 16, 2025 •

edited

Loading

fjtirado left a comment

cdavernas commented Jan 9, 2025 •

edited

Loading

cdavernas commented Jan 9, 2025

Add documentation about workflow and task lifecycle events #1054

Are you sure you want to change the base?

Add documentation about workflow and task lifecycle events #1054

Conversation

cdavernas commented Jan 9, 2025 • edited Loading

fjtirado Jan 9, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fjtirado Jan 9, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fjtirado Jan 9, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fjtirado Jan 9, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

cdavernas Jan 9, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fjtirado Jan 9, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

cdavernas Jan 16, 2025 • edited Loading

Choose a reason for hiding this comment

cdavernas Jan 16, 2025 • edited Loading

Choose a reason for hiding this comment

fjtirado left a comment

Choose a reason for hiding this comment

cdavernas commented Jan 9, 2025 • edited Loading

cdavernas commented Jan 9, 2025

cdavernas commented Jan 9, 2025 •

edited

Loading

fjtirado Jan 9, 2025 •

edited

Loading

fjtirado Jan 9, 2025 •

edited

Loading

fjtirado Jan 9, 2025 •

edited

Loading

fjtirado Jan 9, 2025 •

edited

Loading

cdavernas Jan 9, 2025 •

edited

Loading

fjtirado Jan 9, 2025 •

edited

Loading

cdavernas Jan 16, 2025 •

edited

Loading

cdavernas Jan 16, 2025 •

edited

Loading

cdavernas commented Jan 9, 2025 •

edited

Loading