Backward Compatibility: Parallel Branch does not support direct use of states definitions #413

yzhao244 · 2021-06-21T16:57:13Z

What is the question:

Hi, I have this issue opened and would like to discuss the spec regarding parallel branch does not support direct use of states definitions when comparing spec 0.1.

If my understanding is correct, currently, the parallel state branch supports actions definition which can use subFlowRef to support a more complex flow such as switch and so on.

However, using subFlowRef in practice requires that end-user has to successfully define a subflow first for getting a workflowId. After obtaining the workflow ID, end-user can start to use the ID to define the parallel state. I think it brings more complexity if end-user only builds a simple parallel workflow which branch only contains a switch, or a operation state and end-user may not need a subFlowRef ID for reusable purpose.

I am sorry to bring back the example of AWS Step function as my reference lol, which allows directly defining states in branch. I feel our specs and theirs have things in common lol..

What would you like to be added:
Is it okay to add back the states array to branch of parallel state definition.

Why is this needed:

It ensures backward compatibility.
It is straightforward to build workflow which allows end-user to directly define whatever the states in branch

tsurdilo · 2021-06-22T00:17:53Z

I can definitely understand your point of view on this.
The changes from v0.1 for this were:

We first removed the ability to use states in branches and allowed for either actions or subflow state only.
The reason for this was that you can build very complex nested structures like that, for example you can nest parallel states inside parallel states and define layers of error handling...using SubFlow states at that point seems as a reasonable alternative as runtime can look up the control flow logic via id and also you can make it reusable across multiple branches
We then removed SubFlow State and made it into an action...that to me is something we should look at again honestly.
I think that making it into an action did bring some benefits but it introduced some others (like forced looping via control flow logic and not the user friendly "repeat" we had on subflow states).

Can you show me an example of ASL where you find the use of states inside branches useful ?

yzhao244 · 2021-06-28T16:11:24Z

The following example is from Amazon step function where end-user can directly define states in branch. It is a very simple parallel state use case which has no nested structures and end-user does not require to get a subflow id when building a simple workflow... However, building the same workflow in current spec, it requires end-user to build subflow first and get an ID. After obtaining a subflow Id, end-user can start to define parallel state.

The parallel state should look like this

"MyParallelState": {
  "Type": "Parallel",
  "InputPath": "$",
  "OutputPath": "$",
  "ResultPath": "$.ParallelResultPath",
  "Next": "SetCartCompleteStatusState",
  "Branches": [
    {
      "StartAt": "UpdateMonthlyUsageState",
      "States": {
        "UpdateMonthlyUsageState": {
          "Type": "Task",
          "InputPath": "$",
          "OutputPath": "$",
          "ResultPath": "$.UpdateMonthlyUsageResultPath",
          "Resource": "LambdaARN",
          "Retry": [ {
            "ErrorEquals": ["HandledError"],
            "IntervalSeconds": 1,
            "MaxAttempts": 2,
            "BackoffRate": 2.0
         } 
          "End": true
        }
      }
    },
    {
      "StartAt": "QueueTaxInvoiceState",
      "States": {
        "QueueTaxInvoiceState": {
          "Type": "Task",
          "InputPath": "$",
          "OutputPath": "$",
          "ResultPath": "$.QueueTaxInvoiceResultPath",
          "Resource": "LambdaARN",
          "Retry": [ {
            "ErrorEquals": ["HandledError"],
            "IntervalSeconds": 1,
            "MaxAttempts": 2,
            "BackoffRate": 2.0
         } 
          "End": true
        }
      }
    }

tsurdilo · 2021-06-28T16:17:40Z

@yzhao244 thanks for the example!
May I offer a counter-point :)
If you look UpdateMonthlyUsageState and QueueTaxInvoiceState they are just actions, meaning they just invoke a service.
For this example using our DSL actions makes more sense.
This is the exact same as if you had in each branch an operation state with a single action :) so using actions in this case (and not dummy operation states) not only seems more intuitive, but also reduces the number of lines of json/yaml you have to write.

I was looking for an example where there is actual control-flow logic inside each branches. Off of your example I would actually argue that what we are doing atm is indeed better :)

cdavernas · 2021-07-03T11:05:37Z

@tsurdilo though I mostly agree with your last comment, I think @yzhao244 has a point, too. What if I want to execute two switch cases in parallel? Let's imagine you have a package, and you want to publish it in parallel in both staging and prod. The publishing processes would switch on the package type to determine on which app you actually want to publish it. AFAIK, this cannot be done now without subflows or complex jq expressions.

Now, I've had a similar concern with my a client, and we first ended using very complex jq expressions to do so, until we realized it was extremely hard to read and understand with no advanced knowledge of jq and ended up using a sublow, for a single condition check, basically.

In my opinion, we should think of a way to do so. Probably not using sub-states, as it looks confusing and goes against the actual state concept, but maybe by adding, for example, a condition property on actions.

@yzhao244 @tsurdilo WDYT?

tsurdilo · 2021-07-03T12:23:09Z

I was not dissageeing with this issue just wanted to find use case and you provided :) i wonder if for it it would make more sense to add branch data filter instead, where you can inject / overwrite state data field for that branch (one sets prod the other dev or whatever) . my thinking is that if we allow explicit control flow logic in branches it will end up creating more issues than bring value eapecially for large processes (imagine 50+ states in branches and that maintenance)

…

On Sat, Jul 3, 2021 at 7:05 AM cdavernas ***@***.***> wrote: @tsurdilo <https://github.com/tsurdilo> though I agree with your last comment, I think @yzhao244 <https://github.com/yzhao244> has a point. What if I want to execute two switch cases in parallel? Let's imagine you have a package, and you want to publish it in both staging and prod. The publishing process switches on the package type to determine on which app you actually want to publish it. Now, I've had a similar concern with my a client, and we first ended using very complex jq expressions to do so, until we realized it was extremely hard to read and understand with no advanced knowledge of jq and ended up using a sublow, for a single condition check, basically. In my opinion, we should think of a way to do so. Probably not using sub-states, as it looks confusing and goes against the actual state concept, but maybe by adding, for example, a condition property on actions. @yzhao244 <https://github.com/yzhao244> @tsurdilo <https://github.com/tsurdilo> WDYT? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#413 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAA5E7WG2QHZGWEOM5Y5ZUTTV3VIZANCNFSM47B6WW7Q> .

yzhao244 · 2021-07-19T18:09:33Z

@tsurdilo @cdavernas Thanks for the replies, guys. :) . Actually, the above AWS DSL example allows UI Console to easily render a graphical model in real-time since it allows end-user directly defines control logic which I believe Google Workflow, Ali Serverless Workflow both allow explicit control flow as well for the purpose of easily rendering graphical model.

In our spec, since using subFlowRef in practice requires a workflowId, it requires that end-user has to create subFlow first and then gets a workflowId. Finally, end-user has to put that workflowId in the branches. My personal feeling is a bit complex process and makes UI render graphical model a bit more complicated comparing to allowing explicit control flow.

@tsurdilo I looked at your counter-point... it is a very good point of using actions lol.. However, I updated my example with adding retries which our actions do not support retry. :) .. Actually, I totally agree that for better maintenance for large flows, definitely, using subworkflow ID is way better than explicit control flow logic. However, allowing explicit control flow logic in branches shouldn't be a problem because it sounds like a common practice that end-user would want to do. :)

Is it possible to support both explicit control flow in branches and subworkflowId which end-user can choose for better maintenance for large flows.

yzhao244 · 2021-08-04T17:30:10Z

@tsurdilo @cdavernas For better supporting explicit control flow, I am thinking to set a limit on number of nested layers if end-user chose to define explicit control flow in branches so that our spec can ensure backward compatibility of supporting explicit flow. The following is a suggested example spec of Branch definition for supporting both actions and states definitions. :)

Parallel State: Branch

Parameter	Description	Type	Required
name	Branch name	string	yes
actions	Actions to be executed in this branch	array	no
maxLayers	Limits to max number of nested layers	integer	no
states	States to be executed in this branch	array	no
timeouts	Branch specific timeout settings	object	no

tsurdilo · 2021-08-04T17:50:42Z

@yzhao244

I updated my example with adding retries which our actions do not support retry. :)

they do now (will be hopefully added to 0.7 release :) ) - #435

For better supporting explicit control flow, I am thinking to set a limit on number of nested layers

Overall I do think that supporting both is in the end what we want to do. The way I'd like to do it however is a little different for 3 reasons:

states inside a branch have to conform to the "workflow control flow logic" meaning they have to have a starting state and define one or more end definitions.
states inside the branches should not be able to be transitioned to from a) other branches in parallel state b) other states in the main workflow control flow logic.
states inside a branch should not be able to transition to 1) other branch states b) other core workflow states

I would like to create a "grouping" of states, with preferably a lable/id attached to them. From the parallel state branch then you can then simply reference this group of states instead of having them hard-coded.
States in the core workflow definition would all be in the "main" group. And for tooling then it would also make it easier as then validation can check if a state in the "main" group tries to transition to a state which is not in that same group...and raise validation error(s) and vice versa.

WDYT?

yzhao244 · 2021-08-18T21:42:10Z

@qjl1988

yzhao244 · 2021-08-18T21:59:14Z

@tsurdilo I read the ActionRetries in 0.7 release. Good Work :) ..
"grouping" of state sounds an interesting idea. However, since parallel branch state still needs to reference the group, then how is it different from referencing to a subworkflow Id in current spec. Maybe it is just me lol having difficulty of visualizing grouping of states or maybe I misunderstood something here. lol

tsurdilo · 2021-08-18T23:10:08Z

@yzhao244 yeah, we can/should definitely talk about it to figure it out. Let's plan on that.

yzhao244 · 2021-09-02T19:06:10Z

@tsurdilo created this one on discussion board :) . #466

github-actions · 2021-10-18T00:51:13Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

cdavernas · 2024-05-17T08:53:25Z

Closed as resolved by 1.0.0-alpha1, and therefore as part of #843

In 1.0.0-alpha1, the concepts of states and actions are replaced by tasks, which can define subtasks.

You can therefore do the something like:

document:
  dsl: 0.10
  namespace: default
  name: amazon-parallel-workflow-equivalency
do:
  myParallelState:
    execute:
      concurrently:
        updateMonthlyUsage: { ... }
        queueTask:
          execute:
            sequentially:
              invoice: { ... }
              email: { ... }

tsurdilo added the question label Jun 22, 2021

tsurdilo added this to the v0.8 milestone Aug 18, 2021

github-actions bot added the Stale Issue label Oct 18, 2021

tsurdilo removed the Stale Issue label Oct 29, 2021

tsurdilo self-assigned this Nov 9, 2021

tsurdilo added the Status: In progress label Nov 9, 2021

tsurdilo modified the milestones: v0.8, v0.9 Dec 4, 2021

ricardozanini added this to Progress Tracker May 25, 2023

ricardozanini moved this to Todo in Progress Tracker May 25, 2023

rguillome mentioned this issue Jul 27, 2023

Add parallel (and mixed) operations and events #778

Closed

cdavernas closed this as completed May 17, 2024

github-project-automation bot moved this from Todo to Done in Progress Tracker May 17, 2024

This was referenced May 20, 2024

1.0.0-alpha1: attempt#1 #846

Closed

1.0.0-alpha1 #847

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Backward Compatibility: Parallel Branch does not support direct use of states definitions #413

Backward Compatibility: Parallel Branch does not support direct use of states definitions #413

yzhao244 commented Jun 21, 2021

tsurdilo commented Jun 22, 2021

Uh oh!

yzhao244 commented Jun 28, 2021 •

edited

Loading

Uh oh!

tsurdilo commented Jun 28, 2021 •

edited

Loading

Uh oh!

cdavernas commented Jul 3, 2021 •

edited

Loading

Uh oh!

tsurdilo commented Jul 3, 2021 via email

Uh oh!

yzhao244 commented Jul 19, 2021

Uh oh!

yzhao244 commented Aug 4, 2021

Uh oh!

tsurdilo commented Aug 4, 2021 •

edited

Loading

Uh oh!

yzhao244 commented Aug 18, 2021

Uh oh!

yzhao244 commented Aug 18, 2021

Uh oh!

tsurdilo commented Aug 18, 2021

Uh oh!

yzhao244 commented Sep 2, 2021

Uh oh!

github-actions bot commented Oct 18, 2021

Uh oh!

cdavernas commented May 17, 2024 •

edited by ricardozanini

Loading

Uh oh!

Backward Compatibility: Parallel Branch does not support direct use of states definitions #413

Backward Compatibility: Parallel Branch does not support direct use of states definitions #413

Comments

yzhao244 commented Jun 21, 2021

tsurdilo commented Jun 22, 2021

Uh oh!

yzhao244 commented Jun 28, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tsurdilo commented Jun 28, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cdavernas commented Jul 3, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tsurdilo commented Jul 3, 2021 via email

Uh oh!

yzhao244 commented Jul 19, 2021

Uh oh!

yzhao244 commented Aug 4, 2021

Parallel State: Branch

Uh oh!

tsurdilo commented Aug 4, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yzhao244 commented Aug 18, 2021

Uh oh!

yzhao244 commented Aug 18, 2021

Uh oh!

tsurdilo commented Aug 18, 2021

Uh oh!

yzhao244 commented Sep 2, 2021

Uh oh!

github-actions bot commented Oct 18, 2021

Uh oh!

cdavernas commented May 17, 2024 • edited by ricardozanini Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yzhao244 commented Jun 28, 2021 •

edited

Loading

tsurdilo commented Jun 28, 2021 •

edited

Loading

cdavernas commented Jul 3, 2021 •

edited

Loading

tsurdilo commented Aug 4, 2021 •

edited

Loading

cdavernas commented May 17, 2024 •

edited by ricardozanini

Loading