Fix hashing step ids in loops #72

albertchae · 2024-09-06T06:13:24Z

Summary

Per https://github.com/inngest/inngest/blob/main/docs/SDK_SPEC.md#512-ids-and-hashing,
add :n starting with :1 for repeated instances of a step id

Refactored to be a combination of golang and inngest-js implementation to handle edge case of user defined stepId colliding

The inngest-js SDK currently optionally warns of parallel indexing, but
this isn't in scope for beta so I left it out.

Checklist

~~Update documentation~~ N/A documented by sdk spec
Added unit/integration tests

We may also need a check here to see is this step ID already exists.

A user could do something like:

Run "my-step"

Run "my-step:1" (user explicitly specifying this string)

Run a loop of "my-step" steps

I think here this would result in us redeclaring "my-step:1" in the first iteration of the loop even though we'd like to skip it and go straight to "my-step:2".

These IDs just being strings means a user can accidentally stumble into our [ID]:[count] format and break some stuff. 😄

OK pushed a commit to combine both a hash for O(1) in most cases and a loop afterwards just in case a user used that stepId already. Correct me if I'm wrong but is this a bug in the Go SDK then https://github.com/inngest/inngestgo/blob/0a00daba0b2db68ff0f080f787cf63f0a63b44d8/internal/sdkrequest/manager.go#L123-L131 @darwin67 ?

This seems like it could potentially be worth reserving some delimiter characters for metadata if Inngest has other cases where it would want to modify the user provided stepId.

That works! Thank you.

Mm there's definitely some silly edge case that would be unlikely to hit where users are utilizing *:n step IDs explicitly, but that exists everywhere.

Looks like that'd be a bug in Go too, aye. Long-term we can start to shift this over to something safer; it'd be great to not be directly influencing the ID internally for the hash, but requires a versioned change across SDKs.

I thought of another edge case where the user could reuse a my-step:n name after a loop that used it, so I updated the test and logic to handle that too

Per https://github.com/inngest/inngest/blob/main/docs/SDK_SPEC.md#512-ids-and-hashing, add `:n` starting with `:1` for repeated instances of a step id Mostly similar to inngest-js implementation https://github.com/inngest/inngest-js/blob/79069e1a3d700624ce49b323922c113fc952bcc6/packages/inngest/src/components/execution/v1.ts#L819-L831 The inngest-js SDK currently optionally warns of parallel indexing, but this isn't in scope for beta so I left it out

The hash means we don't have to loop from 0 every time and for most cases will just correctly return us the next unused stepId, but looping afterwards guarantees we don't collide with a user defined stepId. So this will be O(1) in most cases and potentially O(n) for pathological functions that have many steps manually named with the `:n` suffix

jpwilliams · 2024-09-12T12:17:29Z

inngest/src/main/kotlin/com/inngest/State.kt

+        val stepNumber = stepIdsToSeenCount[id]
+        stepIdsToSeenCount[id] = stepIdsToSeenCount.getValue(id) + 1
+
+        return "$id:$stepNumber"


That works! Thank you.

Mm there's definitely some silly edge case that would be unlikely to hit where users are utilizing *:n step IDs explicitly, but that exists everywhere.

Looks like that'd be a bug in Go too, aye. Long-term we can start to shift this over to something safer; it'd be great to not be directly influencing the ID internally for the hash, but requires a versioned change across SDKs.

…already allocated in loop

albertchae force-pushed the albert/INN-3329-hash-steps-loop branch from 10f39be to b4b2f5b Compare September 6, 2024 06:18

albertchae commented Sep 6, 2024

View reviewed changes

albertchae requested a review from KiKoS0 September 6, 2024 06:21

albertchae marked this pull request as ready for review September 6, 2024 06:21

albertchae requested review from djfarrelly, darwin67, jpwilliams and tonyhb as code owners September 6, 2024 06:21

albertchae commented Sep 6, 2024

View reviewed changes

albertchae force-pushed the albert/INN-3329-hash-steps-loop branch from f8e69f7 to 7bf72a1 Compare September 6, 2024 14:31

KiKoS0 approved these changes Sep 6, 2024

View reviewed changes

jpwilliams reviewed Sep 6, 2024

View reviewed changes

albertchae added 3 commits September 11, 2024 20:43

Use hash of stepId counts instead of looping

303e984

albertchae force-pushed the albert/INN-3329-hash-steps-loop branch from 7bf72a1 to 43373c8 Compare September 12, 2024 04:25

jpwilliams approved these changes Sep 12, 2024

View reviewed changes

Handle edge case where user reuses step with stepNumber after it was …

1fd5308

…already allocated in loop

albertchae merged commit 5faeb64 into main Sep 13, 2024
9 checks passed

albertchae deleted the albert/INN-3329-hash-steps-loop branch September 13, 2024 00:50

+                      while (true) {
+                          possibleStepId = "$id:$stepNumber"
+                          if (possibleStepId !in stepIds) {
+                              break
+                          }
+                          stepNumber++
+                      }

+                      int runningCount = 10;
+                      for (int i = 0; i < 5; i++) {
+                          int effectivelyFinalVariableForLambda = runningCount;
+                          runningCount = step.run("add-ten", () -> effectivelyFinalVariableForLambda + 10, Integer.class);

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix hashing step ids in loops #72

Fix hashing step ids in loops #72

albertchae commented Sep 6, 2024 •

edited

Loading

linear bot commented Sep 6, 2024

albertchae Sep 6, 2024

darwin67 Sep 6, 2024 •

edited

Loading

darwin67 Sep 6, 2024

albertchae Sep 6, 2024

albertchae Sep 6, 2024

albertchae Sep 6, 2024

albertchae Sep 6, 2024

jpwilliams Sep 6, 2024

albertchae Sep 12, 2024

jpwilliams Sep 12, 2024

albertchae Sep 12, 2024

jpwilliams Sep 12, 2024

+                      RunEntry<Object> loopRun = devServer.runsByEvent(loopEvent).first();
+                      assertEquals("Completed", loopRun.getStatus());
+                      assertEquals(60, loopRun.getOutput());

Fix hashing step ids in loops #72

Fix hashing step ids in loops #72

Conversation

albertchae commented Sep 6, 2024 • edited Loading

Summary

Checklist

Related

linear bot commented Sep 6, 2024

Choose a reason for hiding this comment

darwin67 Sep 6, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

albertchae commented Sep 6, 2024 •

edited

Loading

darwin67 Sep 6, 2024 •

edited

Loading