Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Stablize Infrastructure on Jenkins IntegTest regarding setup test env before running the test #4036

Closed
eirsep opened this issue Sep 18, 2023 · 8 comments · Fixed by #4039
Closed
Assignees

Comments

@eirsep
Copy link
Member

eirsep commented Sep 18, 2023

Describe the bug

Security Analytics plugin is seeing integ test failures in 2.10 RC build on jenkins
opensearch-project/security-analytics#594

We are unable to reproduce the client timeout exception which is seen in above ci failure. We suspect this may be due to resource crunch on instance which is simultaneously running different containers with each container running builds for different plugins

To reproduce

opensearch-project/security-analytics#594

Expected behavior

Reserve right amount of hardware for each container to ensure there is no test failure due to hardware settings or CPU/memory crunch.

Screenshots

opensearch-project/security-analytics#594

Host / Environment

opensearch-project/security-analytics#594

Additional context

opensearch-project/security-analytics#594

Relevant log output

https://github.com/opensearch-project/security-analytics/issues/594
@eirsep eirsep added bug Something isn't working untriaged Issues that have not yet been triaged labels Sep 18, 2023
@peterzhuamazon peterzhuamazon self-assigned this Sep 18, 2023
@peterzhuamazon peterzhuamazon added integtest cicd docker jenkins Jenkins related issue distributions enhancement New Enhancement and removed untriaged Issues that have not yet been triaged bug Something isn't working labels Sep 18, 2023
@peterzhuamazon peterzhuamazon moved this from Backlog to In Progress in OpenSearch Engineering Effectiveness Sep 18, 2023
@peterzhuamazon
Copy link
Member

peterzhuamazon commented Sep 18, 2023

@peterzhuamazon peterzhuamazon linked a pull request Sep 19, 2023 that will close this issue
@gaiksaya gaiksaya reopened this Sep 19, 2023
@github-actions github-actions bot added the untriaged Issues that have not yet been triaged label Sep 19, 2023
@gaiksaya
Copy link
Member

gaiksaya commented Sep 19, 2023

Hi @eirsep @praveensameneni

We reverted above resource constraint and looks like tests till went through. Since tests were ignored in recent RC I believe issue is on plugin side. No issue found on infra side.

@gaiksaya gaiksaya removed the untriaged Issues that have not yet been triaged label Sep 19, 2023
@peterzhuamazon peterzhuamazon moved this from Not started to In Progress in OpenSearch Engineering Effectiveness Sep 21, 2023
@github-actions github-actions bot added the untriaged Issues that have not yet been triaged label Sep 21, 2023
@peterzhuamazon peterzhuamazon changed the title Investigate if there is a hardware crunch issue when Jenkins runs integ tests for plugins Stablize Infrastructure on Jenkins IntegTest regarding setup test env before running the test Sep 21, 2023
@peterzhuamazon
Copy link
Member

peterzhuamazon commented Sep 21, 2023

Need more investigation.

Can try this new combination just for integTest:

  1. m5 2xlarge runner with 2 executors, each 4cpu/16g.
  2. Add pigz to speed up the stack/unstack process.
  3. Windows still need some observation at this point.

Thanks.

@praveensameneni
Copy link
Member

Hi @eirsep @praveensameneni

We reverted above resource constraint and looks like tests till went through. Since tests were ignored in recent RC I believe issue is on plugin side. No issue found on infra side.

@gaiksaya , can we re-revert the change and run the tests until the release.

@bbarani
Copy link
Member

bbarani commented Oct 2, 2023

@gaiksaya @peterzhuamazon Do we have anything pending on this issue or can we close it?

@gaiksaya
Copy link
Member

gaiksaya commented Oct 2, 2023

We were unable to conclude if infra was actually affecting the tests since tests were ignored before they can be tested. Would suggest to not ignore the tests and keep monitoring for 2.11.0.

@Divyaasm Divyaasm removed the untriaged Issues that have not yet been triaged label Oct 10, 2023
@Divyaasm
Copy link
Collaborator

Closing the issue . Please refer to the related meta issue #3966 for more information

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Development

Successfully merging a pull request may close this issue.

6 participants