Stop using zerolog/log.Logger and use logger from context instead #3078

michel-laterman · 2023-11-04T01:28:40Z

What is the problem this PR solves?

I tried to change the json schema generation tool to a version that supports serializing to time.Time, however there were a lot of test-failures.
It was very difficult to track why tests were failing because a lot of fleet-server's codebase (like the bulker) outputs to zerolog's global logger, while other sections use embedded (or per-request) loggers and the output of tests is muddled and inconsistent to read.
Trying to change the global logger for a test case introduces a race-condition failure within the test.
If we use an alternate method of getting a "global" logger that we can adjust for tests it would make reading test output much easier.

How does this PR solve the problem?

Stop using the global zerolog logger and use one that is passed through the context instead.
Set context default to the same as the global-logger (the one that is controlled through the fleet-server's logger package) so that when fleet-server is running normally there is no change, however it is much easier to change for test cases so we can have log output that's tied to the specific test making it easier to track any errors in tests.
Change how the fleet-server logger package reloads config when adjusting for new output

How to test this PR locally

make test

Stop using the global zerolog logger and use one that is passed through the context instead. When fleet-server is running normally context-logger defaults to the same as the global logger, however it is much easier to change for test cases so we can have log output that's tied to the specific test making it easier to track any errors in tests.

Change logger reload to clone the parent logger with a new output instead of creating a new logger. Fix integration tests. Use context.TODO instead of background where contexts are not passed.

Across various previous PRs we disabled setting the logger across a lot of integration tests due to race conditions. Re-enable across all integration tests.

michel-laterman

This is a very tedious pr where I mostly just replaced log. with zerolog.Ctx(ctx).

michel-laterman · 2023-11-07T01:06:29Z

internal/pkg/action/dispatcher.go

@@ -84,7 +84,7 @@ func (d *Dispatcher) Subscribe(agentID string, seqNo sqn.SeqNo) *Sub {
 	sz := len(d.subs)
 	d.mx.Unlock()

-	log.Trace().Str(logger.AgentID, agentID).Int("sz", sz).Msg("Subscribed to action dispatcher")
+	zerolog.Ctx(context.TODO()).Trace().Str(logger.AgentID, agentID).Int("sz", sz).Msg("Subscribed to action dispatcher")


TODO is used in functions that don't take a context

should we create a follow up issue for adding a context to these functions?

michel-laterman · 2023-11-07T15:56:29Z

internal/pkg/server/fleet_integration_test.go

 	srv, err := startTestServer(t, ctx)
 	require.NoError(t, err)
+	ctx = testlog.SetLogger(t).WithContext(ctx)


I'm setting the test logger after startTestServer. If we set it before it can lead to some race conditions in the test code.

michel-laterman · 2023-11-07T15:59:51Z

internal/pkg/logger/logger.go

+		l.log = l.log.Output(out)
+		l.sync = wr
+
+		log.Logger = l.log
+		zerolog.DefaultContextLogger = &l.log // introduces race conditions in integration test?


The "biggest" change i made in this PR is here with how the logger is replaced on a reload operation.
This was one in an attempt to fix race conditions in the integration tests, it didn't resolve it, but it made it very clear what this part of the reload operation actually does

kpollich

This makes sense to me, and thanks for dealing with the tedium of replacing or updating all function signatures to accept the new logger context. We do this (passing a logger around) all over Kibana too, so I'm pretty comfortable with this pattern - especially since it makes writing tests/assertions for specific logging cases much easier. LGTM 🚀

nchaulet

LGTM 🚀

michel-laterman · 2023-11-07T19:00:49Z

This pr was to replace all global refs. We still pass zerolog.Logger objects around in some function signatures; these should at least now come from the context.
If needed we can create a follow up issue if we really want to remove the explicit logger passes in our codebase (I know this is a common occurrence in the api package)

elastic-sonarqube · 2023-11-07T19:17:09Z

SonarQube Quality Gate

0 Bugs
0 Vulnerabilities
0 Security Hotspots
2 Code Smells

50.3% Coverage
1.6% Duplication

michel-laterman added Team:Fleet Label for the Fleet team tech debt labels Nov 4, 2023

michel-laterman added 2 commits November 6, 2023 12:28

Fix integration tests, change how logger is reloaded

85c5241

Change logger reload to clone the parent logger with a new output instead of creating a new logger. Fix integration tests. Use context.TODO instead of background where contexts are not passed.

Use context logger across integration tests

dd7d8f8

Across various previous PRs we disabled setting the logger across a lot of integration tests due to race conditions. Re-enable across all integration tests.

michel-laterman commented Nov 7, 2023

View reviewed changes

michel-laterman marked this pull request as ready for review November 7, 2023 16:28

michel-laterman requested a review from a team as a code owner November 7, 2023 16:28

kpollich approved these changes Nov 7, 2023

View reviewed changes

nchaulet reviewed Nov 7, 2023

View reviewed changes

Merge branch 'main' into context-logger

4cf8c67

michel-laterman enabled auto-merge (squash) November 7, 2023 19:01

michel-laterman merged commit 0d439e6 into elastic:main Nov 7, 2023
9 checks passed

michel-laterman mentioned this pull request Nov 7, 2023

context.TODO and loggers #3087

Open

michel-laterman deleted the context-logger branch November 7, 2023 23:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stop using zerolog/log.Logger and use logger from context instead #3078

Stop using zerolog/log.Logger and use logger from context instead #3078

michel-laterman commented Nov 4, 2023 •

edited

Loading

michel-laterman left a comment

michel-laterman Nov 7, 2023

nchaulet Nov 7, 2023

michel-laterman Nov 7, 2023

michel-laterman Nov 7, 2023

michel-laterman Nov 7, 2023

kpollich left a comment

nchaulet left a comment

michel-laterman commented Nov 7, 2023

elastic-sonarqube bot commented Nov 7, 2023

Stop using zerolog/log.Logger and use logger from context instead #3078

Stop using zerolog/log.Logger and use logger from context instead #3078

Conversation

michel-laterman commented Nov 4, 2023 • edited Loading

What is the problem this PR solves?

How does this PR solve the problem?

How to test this PR locally

michel-laterman left a comment

Choose a reason for hiding this comment

michel-laterman Nov 7, 2023

Choose a reason for hiding this comment

nchaulet Nov 7, 2023

Choose a reason for hiding this comment

michel-laterman Nov 7, 2023

Choose a reason for hiding this comment

michel-laterman Nov 7, 2023

Choose a reason for hiding this comment

michel-laterman Nov 7, 2023

Choose a reason for hiding this comment

kpollich left a comment

Choose a reason for hiding this comment

nchaulet left a comment

Choose a reason for hiding this comment

michel-laterman commented Nov 7, 2023

elastic-sonarqube bot commented Nov 7, 2023

michel-laterman commented Nov 4, 2023 •

edited

Loading