Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Es Archiver][Load Action] High Water Mark to 5000, Concurrency 4; Override-able #167993

Conversation

wayneseymour
Copy link
Member

@wayneseymour wayneseymour commented Oct 4, 2023

NEW CHANGE COMING

We're adding a new change, to make the more performant flow the default, but it can be overridden.

Summary

To speed up the load action in general, and in serverless in particular, this pr:

Increases stream batch size from 300 to 5000.
Also, it's now using concurrency from the bulk call api, set to 4.

Summary of Performance Characteristics

Using a list of over 70 archives from within the repo, we've benchmarked the speed before changing the performance characteristics, and after.

One champion example is the x-pack/test/functional/es_archives/getting_started/shakespeare archive.
It's load time decreased from ~2.5 minutes, to less than 18 seconds, in a serverless environment.

There are a little over a handful where the time actually got worse, but most are only milliseconds slower.
None are a full second slower.

@wayneseymour wayneseymour added release_note:skip Skip the PR/issue when compiling release notes backport:skip This commit does not require backporting FTR ES-ARCHIVER labels Oct 4, 2023
@wayneseymour wayneseymour self-assigned this Oct 4, 2023
@wayneseymour wayneseymour changed the title [Es Archiver] Higher Water Mark for Load Action [Es Archiver] Higher Water Mark for Load Action w/o benchmark tool Oct 4, 2023
@wayneseymour
Copy link
Member Author

@elasticmachine merge upstream

1 similar comment
@wayneseymour
Copy link
Member Author

@elasticmachine merge upstream

@wayneseymour wayneseymour force-pushed the esarch-fixed-higher-watermark-impl-minimal-code-chages-without-benchmark-tool branch 8 times, most recently from 51c2cd0 to 6bfb9b1 Compare October 26, 2023 08:54
@wayneseymour wayneseymour force-pushed the esarch-fixed-higher-watermark-impl-minimal-code-chages-without-benchmark-tool branch 6 times, most recently from fe67122 to 2b885fb Compare October 31, 2023 13:34
@wayneseymour wayneseymour force-pushed the esarch-fixed-higher-watermark-impl-minimal-code-chages-without-benchmark-tool branch 3 times, most recently from b9f0d5d to 336a21d Compare November 7, 2023 12:28
@wayneseymour wayneseymour force-pushed the esarch-fixed-higher-watermark-impl-minimal-code-chages-without-benchmark-tool branch 2 times, most recently from 8940657 to 050e4ab Compare November 8, 2023 10:14
@wayneseymour
Copy link
Member Author

@elasticmachine merge upstream

@wayneseymour
Copy link
Member Author

/ci

…without-benchmark-tool' of github.com:wayneseymour/kibana into esarch-fixed-higher-watermark-impl-minimal-code-chages-without-benchmark-tool
@wayneseymour
Copy link
Member Author

/ci

@wayneseymour
Copy link
Member Author

/ci

@wayneseymour
Copy link
Member Author

/ci

@wayneseymour wayneseymour changed the title [Es Archiver][Load Action] High Water Mark to 5000, Concurrency 4 [Es Archiver][Load Action] High Water Mark to 5000, Concurrency 4, but Optional Jan 6, 2024
@wayneseymour
Copy link
Member Author

/ci

@wayneseymour wayneseymour changed the title [Es Archiver][Load Action] High Water Mark to 5000, Concurrency 4, but Optional [Es Archiver][Load Action] High Water Mark to 5000, Concurrency 4; Override-able Jan 8, 2024
@wayneseymour
Copy link
Member Author

@elasticmachine merge upstream

@wayneseymour
Copy link
Member Author

/ci

…without-benchmark-tool' of github.com:wayneseymour/kibana into esarch-fixed-higher-watermark-impl-minimal-code-chages-without-benchmark-tool
@wayneseymour
Copy link
Member Author

@elasticmachine merge upstream

@wayneseymour
Copy link
Member Author

/ci

@kibana-ci
Copy link
Collaborator

kibana-ci commented Jan 9, 2024

💔 Build Failed

Failed CI Steps

Test Failures

  • [job] [logs] FTR Configs #92 / dashboard Reporting Download CSV Default Saved Search Data Download CSV export of a saved search panel
  • [job] [logs] FTR Configs #92 / dashboard Reporting Download CSV Default Saved Search Data Download CSV export of a saved search panel
  • [job] [logs] FTR Configs #32 / discover Discover CSV Export Generate CSV: archived search generates a report with data
  • [job] [logs] FTR Configs #33 / discover Discover CSV Export Generate CSV: archived search generates a report with data
  • [job] [logs] FTR Configs #84 / discover Discover CSV Export Generate CSV: archived search generates a report with data
  • [job] [logs] FTR Configs #33 / discover Discover CSV Export Generate CSV: archived search generates a report with data
  • [job] [logs] FTR Configs #32 / discover Discover CSV Export Generate CSV: archived search generates a report with data
  • [job] [logs] FTR Configs #84 / discover Discover CSV Export Generate CSV: archived search generates a report with data
  • [job] [logs] FTR Configs #27 / discover Discover CSV Export Generate CSV: new search generates a report from a new search with data: default
  • [job] [logs] FTR Configs #27 / discover Discover CSV Export Generate CSV: new search generates a report from a new search with data: default
  • [job] [logs] FTR Configs #40 / Reporting APIs CSV Generation from Saved Search ID export from timebased data view timezone formatting export with custom timezone and timeRange from locator params csv file matches
  • [job] [logs] FTR Configs #40 / Reporting APIs CSV Generation from Saved Search ID export from timebased data view timezone formatting export with custom timezone and timeRange from locator params csv file matches
  • [job] [logs] FTR Configs #43 / Reporting CSV Generation from SearchSource: Dashboard unquoted values Exports CSV with all fields when using defaults
  • [job] [logs] FTR Configs #93 / Reporting CSV Generation from SearchSource: Dashboard unquoted values Exports CSV with all fields when using defaults
  • [job] [logs] FTR Configs #43 / Reporting CSV Generation from SearchSource: Dashboard unquoted values Exports CSV with all fields when using defaults
  • [job] [logs] FTR Configs #93 / Reporting CSV Generation from SearchSource: Dashboard unquoted values Exports CSV with all fields when using defaults

Metrics [docs]

Public APIs missing comments

Total count of every public API that lacks a comment. Target amount is 0. Run node scripts/build_api_docs --plugin [yourplugin] --stats comments for more detailed information.

id before after diff
@kbn/es-archiver 14 18 +4
Unknown metric groups

API count

id before after diff
@kbn/es-archiver 27 31 +4

History

To update your PR or re-run it, just comment with:
@elasticmachine merge upstream

cc @wayneseymour

@wayneseymour
Copy link
Member Author

Closing in order to have less reviewers, in a new PR.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
backport:skip This commit does not require backporting ES-ARCHIVER FTR release_note:skip Skip the PR/issue when compiling release notes
Projects
None yet
Development

Successfully merging this pull request may close these issues.

9 participants