Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CI] RelocationIT testIndexSearchAndRelocateConcurrently failing #115991

Open
elasticsearchmachine opened this issue Oct 30, 2024 · 5 comments
Open
Labels
medium-risk An open issue or test failure that is a medium risk to future releases :Search/Search Search-related issues that do not fall into other categories Team:Search Meta label for search team >test-failure Triaged test failures from CI

Comments

@elasticsearchmachine
Copy link
Collaborator

Build Scans:

Reproduction Line:

./gradlew ":server:internalClusterTest" --tests "org.elasticsearch.recovery.RelocationIT.testIndexSearchAndRelocateConcurrently" -Dtests.seed=1831477A5FD3EB9A -Dtests.locale=wae-Latn-CH -Dtests.timezone=Iran -Druntime.java=23

Applicable branches:
8.x

Reproduces locally?:
N/A

Failure History:
See dashboard

Failure Message:

java.lang.Exception: Test abandoned because suite timeout was reached.

Issue Reasons:

  • [8.x] 2 failures in test testIndexSearchAndRelocateConcurrently (0.9% fail rate in 234 executions)
  • [8.x] 2 failures in pipeline elasticsearch-periodic-platform-support (33.3% fail rate in 6 executions)

Note:
This issue was created using new test triage automation. Please report issues or feedback to es-delivery.

@elasticsearchmachine elasticsearchmachine added :Distributed/Distributed A catch all label for anything in the Distributed Area. If you aren't sure, use this one. >test-failure Triaged test failures from CI Team:Distributed Meta label for distributed team needs:risk Requires assignment of a risk label (low, medium, blocker) labels Oct 30, 2024
@elasticsearchmachine
Copy link
Collaborator Author

Pinging @elastic/es-distributed (Team:Distributed)

@nicktindall nicktindall added medium-risk An open issue or test failure that is a medium risk to future releases and removed needs:risk Requires assignment of a risk label (low, medium, blocker) labels Oct 31, 2024
@nicktindall
Copy link
Contributor

Flagging this as medium risk, the test logs some resource leaks before it fails, I think there's a good chance this is a legitimate bug


1> [2024-10-31T00:07:52,271][ERROR][o.e.t.LeakTracker        ] [[Cleaner-0]] LEAK: resource was not cleaned up before it was garbage-collected. |  
-- | --
  | 1> Recent access records: |  
  | 1> Created at: |  
  | 1> 	in [Thread-193][testIndexSearchAndRelocateConcurrently] |  
  | 1> 	org.elasticsearch.action.search.ArraySearchPhaseResults.<init>(ArraySearchPhaseResults.java:28) |  
  | 1> 	org.elasticsearch.action.search.QueryPhaseResultConsumer.<init>(QueryPhaseResultConsumer.java:86) |  
  | 1> 	org.elasticsearch.action.search.SearchPhaseController.newSearchPhaseResults(SearchPhaseController.java:841) |  
  | 1> 	org.elasticsearch.action.search.TransportSearchAction$AsyncSearchActionProvider.newSearchPhase(TransportSearchAction.java:1500) |  
  | 1> 	org.elasticsearch.action.search.TransportSearchAction.executeSearch(TransportSearchAction.java:1305) |  
  | 1> 	org.elasticsearch.action.search.TransportSearchAction.executeLocalSearch(TransportSearchAction.java:1038) |  
  | 1> 	org.elasticsearch.action.search.TransportSearchAction.lambda$executeRequest$4(TransportSearchAction.java:365) |  
  | 1> 	org.elasticsearch.action.ActionListenerImplementations$ResponseWrappingActionListener.onResponse(ActionListenerImplementations.java:247) |  
  | 1> 	org.elasticsearch.index.query.Rewriteable.rewriteAndFetch(Rewriteable.java:109) |  
  | 1> 	org.elasticsearch.index.query.Rewriteable.rewriteAndFetch(Rewriteable.java:77) |  
  | 1> 	org.elasticsearch.action.search.TransportSearchAction.executeRequest(TransportSearchAction.java:531) |  
  | 1> 	org.elasticsearch.action.search.TransportSearchAction.doExecute(TransportSearchAction.java:316) |  
  | 1> 	org.elasticsearch.action.search.TransportSearchAction.doExecute(TransportSearchAction.java:121)

@ywangd
Copy link
Member

ywangd commented Oct 31, 2024

I think this is likely the same or a similar issue as #115056

@ywangd ywangd added :Search/Search Search-related issues that do not fall into other categories and removed :Distributed/Distributed A catch all label for anything in the Distributed Area. If you aren't sure, use this one. labels Oct 31, 2024
@elasticsearchmachine elasticsearchmachine added Team:Search Meta label for search team and removed Team:Distributed Meta label for distributed team labels Oct 31, 2024
@elasticsearchmachine
Copy link
Collaborator Author

Pinging @elastic/es-search (Team:Search)

@ywangd
Copy link
Member

ywangd commented Oct 31, 2024

I rerouted to the search team to take a look. Feel free to bounce it back if it does not feel right.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
medium-risk An open issue or test failure that is a medium risk to future releases :Search/Search Search-related issues that do not fall into other categories Team:Search Meta label for search team >test-failure Triaged test failures from CI
Projects
None yet
Development

No branches or pull requests

3 participants