Setup benchmarks at CI/CD #560

norberttech · 2023-10-10T16:38:06Z

While project is growing, it will become easier to miss performance degradation, like it happened here: #558

We should think about creating a set of benchmarks for each adapter/core/lib and execute those benchmarks at CI/CD so we can at least manually check which PR's introduced bottlenecks.
The ideal solution would be to store those performance benchmarks as workflows artifacts after merge to 1.x and compare them with benchmarks from newly open PR's.

I was thinking about creating benchmarks for specific building blocks separately, for example:

Extractors - we could come up with some dataset schema, save it as all supported file types, and just benchmark extraction without doing any operations on the dataset.
Transformers - since we reduced the number of transformers, keeping only critical ones, we might want to start at least from those most frequently used, like the one that evaluates expressions. Here, I think we can take a similar approach, but instead of using extractors, we can directly pass prepared Rows to it and measure the performance of transformations themselves.
Expressions - just like with Transformers, but here we don't even need Rows. Single Row should be enough
Loaders - similarly to Transformers, prepare Rows and execute Loading them into the destination directly

Those are very granular benchmarks, which can test all building blocks separately, providing clear insights about each element separately. However, on top of that, I would probably still try to benchmark entire Pipelines on a selected subset of the most frequently used extractors/loaders/transformers (we would need to develop a few scenarios here).

norberttech added this to Roadmap Oct 10, 2023

norberttech converted this from a draft issue Oct 10, 2023

stloyd mentioned this issue Oct 13, 2023

Add PHPBench tool and first benchmark example #581

Merged

stloyd moved this from Todo to In Progress in Roadmap Oct 14, 2023

stloyd self-assigned this Oct 16, 2023

stloyd added ci/cd Developer Experience Resolving this issue should improve development experience for the library users. labels Oct 16, 2023

stloyd changed the title ~~Setup benchmakrs at CI/CD~~ Setup benchmarks at CI/CD Oct 16, 2023

norberttech moved this from In Progress to Done in Roadmap Oct 25, 2023

norberttech closed this as completed Oct 25, 2023

norberttech added this to the 0.5.0 milestone Nov 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Setup benchmarks at CI/CD #560

Setup benchmarks at CI/CD #560

norberttech commented Oct 10, 2023 •

edited by stloyd

Loading

Setup benchmarks at CI/CD #560

Setup benchmarks at CI/CD #560

Comments

norberttech commented Oct 10, 2023 • edited by stloyd Loading

norberttech commented Oct 10, 2023 •

edited by stloyd

Loading