Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Replace push.apply with dedicated function #12361

Open
wants to merge 10 commits into
base: main
Choose a base branch
from
Open

Conversation

javagl
Copy link
Contributor

@javagl javagl commented Dec 5, 2024

Fixes #12053
Makes #12176 obsolete

Description

The title of the issue is "CesiumJS should not use push.apply for potentially large arrays" , which bears the question of what is a "large" array. Given that the answer to that question is the same as to "How long is a piece of string?", this PR just replaces all appearances of
target.push.apply(target, source);
with
addAll(source, target);
using a newly introduced addAll function.

This addAll function is a top-level function, but marked as @private for now.

Issue number and link

#12053

Testing plan

The case where this issue originally came up was that of a glTF with many accessors: Each accessor causes a 'promise' to be created, and these promises had been pushed to a target array with push.apply. For many accessors, this caused a RangeError, as described in the issue.

The following is a sandcastle that loads such a glTF asset, and can be used to verify that the issue no longer appears:

const viewer = new Cesium.Viewer("cesiumContainer", {
  globe: false
});

const url = "http://localhost:8003/callStackTest_250x250.glb";

const camera = viewer.scene.camera;
camera.position = new Cesium.Cartesian3(500, 140, 140);
camera.direction = Cesium.Cartesian3.negate(
  Cesium.Cartesian3.UNIT_X, new Cesium.Cartesian3());
camera.up = Cesium.Cartesian3.clone(Cesium.Cartesian3.UNIT_Z);
camera.frustum.fov = Cesium.Math.PI_OVER_THREE;
camera.frustum.near = 0.01;
camera.frustum.far = 1000.0;
camera.lookAtTransform(Cesium.Matrix4.IDENTITY);

let totalDurationMs = 0;


async function addModel(runs, currentRun) {
  if (currentRun === runs) {
    const averageDurationMs = totalDurationMs / runs;
    console.log("runs: "+runs);
    console.log("totalDurationMs: "+totalDurationMs);
    console.log("averageDurationMs: "+averageDurationMs);
  } else {
    
    console.log("Adding, run " + currentRun + " of " + runs);
    const before = performance.now();
    const model = await Cesium.Model.fromGltfAsync({
      url: url,
    });
    viewer.scene.primitives.add(model);

    model.readyEvent.addEventListener(async () => {
      const after = performance.now();
      const durationMs = after-before;
      console.log("ready, took "+durationMs+" ms");
      totalDurationMs += durationMs;
      if (currentRun < runs - 1) {
        viewer.scene.primitives.remove(model);
      }
      await addModel(runs, currentRun + 1);
    });
  }  
}

addModel(1, 0);
//addModel(10, 0);

The following archive contains the main asset for the test:

callStackTest.zip

When using the callStackTest_250x250.glb, then the current main state causes an error. With the fixed state, it works.


The archive contains a few smaller assets as well, and the sandcastle offers a very basic "benchmarking" functionality, to repeatedly load an asset. This is intended for checking whether this change has a noticable performance impact on this particular code path. This makes limited sense: There have been many places that used push.apply. Many of them never saw anything that even remotely was a "large" array. In the specific case of the glTF models, most of the time was not spent in push.apply, but with loading the data to begin with. But as a baseline for this case that can easily be tested (beyond some "microbenchmarks", as shown in the linked PR), I ran this test with the 120x120 asset, and there doesn't seem to be a significant performance impact.


Author checklist

  • I have submitted a Contributor License Agreement
  • I have added my name to CONTRIBUTORS.md
  • I have updated CHANGES.md with a short summary of my change
  • I have added or updated unit tests to ensure consistent code coverage
  • I have updated the inline documentation, and included code examples where relevant
  • I have performed a self-review of my code

Copy link

github-actions bot commented Dec 5, 2024

Thank you for the pull request, @javagl!

✅ We can confirm we have a CLA on file for you.

@ggetz
Copy link
Contributor

ggetz commented Dec 11, 2024

Thanks @javagl!

@jjspace Can you please review?

@ggetz ggetz requested a review from jjspace December 11, 2024 20:06
Copy link
Contributor

@jjspace jjspace left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I tested with the asset you provided and did confirm it works on this branch and not in main. I did have a few comments about the new function itself though.

* Cesium.addAll(source, target);
* // The target is now [ 0, 1, 2, 3, 4, 5 ]
*/
function addAll(source, target) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't think I like the name addAll. I don't feel it's descriptive enough to understand what it does without looking at the documentation or the function itself. addAllToArray or maybe combineArrays could be better? Or, given my previous comment on this topic with this essentially doing what Array.concat does, maybe concatInto or concatIntoArray would be more accurate?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No really strong opinions, just things that I thought about:

  • In the linked (soon to be obsolete) PR, I initially called that function pushAll, but I wanted to emphasize that it does exactly not push...
  • Any name that involves concat could easily be confused with Array.concat, and (as discussed in the other PR), that has a completely different semantic: People might expect it to return a concatenation. I think it's important to make clear that it operates in-place, and on the given target array
  • combine... could be OK (maybe people would expect it to return something there, but not necessarily)

Maybe something like appendAll or append(All?To?)Array or so ...?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Changed to addAllToArray - sounds reasonable and is long and clear enough for a global function.

Comment on lines 31 to 32
const s = source.length;
if (s === 0) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

it's a nitpick but in general I think we should avoid single letter variable names outside the looping variables i, j, k, and sometimes l.

Maybe it's just personal preference but I find this a little more readable even if it is more verbose

  const sourceLength = source.length;
  if (sourceLength === 0) {
    return;
  }
  const targetLength = target.length;
  target.length += sourceLength;
  for (let i = 0; i < sourceLength; i++) {
    target[targetLength + i] = source[i];
  }

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I nearly espected that, and can change it accordingly.
But regarding the variable name 1: Here is an animated GIF of l and 1, switching every second:

Cesium L

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Renamed to sourceLength and targetLength

Comment on lines 22 to 25
* const target = [ 0, 1, 2 ];
* const source = [ 3, 4, 5 ];
* Cesium.addAll(source, target);
* // The target is now [ 0, 1, 2, 3, 4, 5 ]
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nitpick on semantics, but this time I do think it actually matters. When I first glanced at this code I expected the order of the arguments to be reversed: addAll(target, source).
This is largely because I am comparing this to similar functions of target.push(...source) or target.concat(source) etc. In all the builtin array functions the "result" is on the left and the "source" is on the right.
Another way I think about it is that this is replacing the syntax [...target, ...source] to put all the target elements first and the source elements second. So the target variable should be the first argument. Even in this example you're doing addAll([ 3, 4, 5 ], [ 0, 1, 2 ]) = [ 0, 1, 2, 3, 4, 5 ]
(Maybe "source" and "target" are the wrong names in the first place)

I think swapping them also aligns better with the idea that the "source" could be null, undefined or []. Then the first argument is always defined even if the second only sometimes is.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I also hesitated here. Leaving the source as the last parameter would allow omitting it, as in
addAll(target);
which is probably not ideal.

But in general, I'm open to changing it, and embarassingly have to point out that commit c00243a ...

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

From my all-time favorite presentation on API design:

Cesium Parameter order

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Swapped the parameters. Fortunately, someone posted the Regex for that, which is to ...
replace (addAllToArray)\(([^,]+), *([^,]+)\)
with $1($3, $2)

packages/engine/Source/Core/addAll.js Outdated Show resolved Hide resolved
@ggetz
Copy link
Contributor

ggetz commented Jan 13, 2025

@javagl Is this ready for review and potential merge again?

@javagl
Copy link
Contributor Author

javagl commented Jan 13, 2025

@ggetz Fine to merge from my perspective, if the comments by @jjspace are resolved.

A tangential connection to #8359 : One of the rules/transforms of Lebap is

arg-spread - use of apply() to spread operator

Which changes this:

CesiumJS lebap

But... as it turned out: The ...spread operator also places the arguments on the stack - so this wouldn't solve the issue that triggered this. It could be a rewrite for readability and more idiomatic code.

An aside: I considered to review the code for cases where normal for-loops could now be replaced by addAllToArray. When they didn't use push.apply, they aren't caught in this PR, and could still benefit in terms of readabiltiy and maybe also performance, due to the pre-allocation of the required .length. But I'm not sure whether searching and identifying these places would be worth the effort...

@ggetz
Copy link
Contributor

ggetz commented Jan 13, 2025

Fine to merge from my perspective, if the comments by @jjspace are resolved.

Great, thank you! @jjspace this is pending another pass from you,

@ggetz
Copy link
Contributor

ggetz commented Jan 13, 2025

A tangential connection to #8359 : One of the rules/transforms of Lebap is...

I think this is a good reason to be targeted about modernization pushes we do, and to not blanket apply transforms: We need to ensure that there are no performance implications when we do so.

When they didn't use push.apply, they aren't caught in this PR, and could still benefit in terms of readabiltiy and maybe also performance, due to the pre-allocation of the required .length

I agree at minimum, this is out of scope with the original scope of this PR and there's no need to do it here. Unless we confirm that there is a performance implication to motivate any other use case, I think we're getting into bikeshedding territory and we should hold off.

Copy link
Contributor

@jjspace jjspace left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is looking good now. Just 2 comments:

  1. Should there be a section added to our Coding Guide (or some other guide) explaining why we want to avoid .push.apply? maybe linking back to the discussion in this PR or the issue?
  2. Optional change but should we actually restrict usage of this function through eslint? It's possible to do that through the no-restricted-syntax rule with either a warning or a full error. @ggetz thoughts on this?
rules: {
  "no-restricted-syntax": [
    "warn",
    {
      // Link to issue here
      selector:
        "CallExpression[callee.object.property.name=push][callee.property.name=apply]",
      message:
        "We prefer to avoid Array.push.apply() in favor of addAllToArray()",
    },
  ],
},

2025-01-14_15-21

@javagl
Copy link
Contributor Author

javagl commented Jan 14, 2025

@jjspace No strong opinion from my side, just minor thoughts:

As of my comment above, one 'Lebap' modernization transform allows converting push.apply to ...spread automatically. So one could make a case that the Coding Guide could just have a hint like "Write modern, idiomatic code". But... the ...spread operator could similarly be discouraged. So these points may indeed be worth mentioning. (I'm pretty sure that even many experienced developers don't have that 'call stack' issue on the radar...).

(BTW: I wasn't aware of the no-restricted-syntax and that it could be defined so easily (well... "easy", when you know the syntax for it 😁 ))

Maybe we should just collect some ideas in some "Possible updates for the Coding Guide" issue, to better keep track of that? (I mean, this could also include other "modernization" hints, like "Don't do array.indexOf(element) !== -1, but array.includes(element)" or so....)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

CesiumJS should not use push.apply for potentially large arrays
3 participants