[elastic_agent] Add additional output metrics to dashboards #8834

leehinman · 2024-01-05T18:08:49Z

Proposed commit message

Add dashboards for the following metrics to the [Elastic Agent] Agent metrics Dashboard.

beat.stats.libbeat.output.events.failed
beat.stats.libbeat.output.events.toomany
beat.stats.libbeat.output.events.dropped
beat.stats.libbeat.output.batches.split

All four of these are useful to see if your agent is having non-fatal errors when using the output which can result in degraded performance.

Checklist

I have reviewed tips for building integrations and this pull request is aligned with them.
I have verified that all data streams collect metrics or logs.
I have added an entry to my package's changelog.yml file.
I have verified that Kibana version constraints are current according to guidelines.

Author's Checklist

[ ]

How to test this PR locally

checkout PR
cd packages/elastic_agent
elastic-package stack up --version <8.11.2 or greater> -d

View the [Elastic Agent] Agent metrics Dashboard

Related issues

Relates Dashboard for 30s metrics elastic-agent#3826

Screenshots

- beat.stats.libbeat.output.events.failed - beat.stats.libbeat.output.events.toomany - beat.stats.libbeat.output.events.dropped - beat.stats.libbeat.output.batches.split All four of these are useful to see if your agent is having non-fatal errors when using the output which can result in degraded performance.

packages/elastic_agent/changelog.yml

elasticmachine · 2024-01-06T17:39:16Z

Pinging @elastic/elastic-agent (Team:Elastic-Agent)

P1llus

A few comments even though I approve:

I find it a bit weird they are all 0 if you are getting data in, was it possible to test that the graphs are actually reading the stats correctly?
These are metrics from elastic_agent? Because you only modified elastic_agent_metrics datastream, but there are other metrics data_streams inside the elastic_agent package, like filebeat_metrics etc?
I would add a filter to exclude (empty) from these, there should be an option to exclude null values I believe?

cmacknz · 2024-01-12T18:19:58Z

These are metrics from elastic_agent? Because you only modified elastic_agent_metrics datastream, but there are other metrics data_streams inside the elastic_agent package, like filebeat_metrics etc?

Thanks for catching this, these changes should be applied to each individual Beat metrics data stream. The agent itself doesn't have output metrics.

cmacknz

See above, these should be on the Beat metrics data streams.

nimarezainia · 2024-01-15T07:59:43Z

@leehinman thanks for these dashboards. Is there any chance we could also add the recent output latency metrics (histogram) that were added? (elastic/beats#37258). If not I can create a new issue. thanks

leehinman · 2024-01-22T04:30:48Z

@leehinman thanks for these dashboards. Is there any chance we could also add the recent output latency metrics (histogram) that were added? (elastic/beats#37258). If not I can create a new issue. thanks

yep, I'll add it. When I started the latency histogram wasn't done yet.

leehinman · 2024-01-25T20:27:16Z

A few comments even though I approve:

1. I find it a bit weird they are all 0 if you are getting data in, was it possible to test that the graphs are actually reading the stats correctly?

these are error conditions, dropped, failed, toomany & split. So under normal operation these should be 0. If they go up then back to 0, you have a transient error you should look at, and if they stay up well, then you have a persistent problem to investigate. I'll see if I can induce some errors to get these populated.

2. These are metrics from elastic_agent? Because you only modified elastic_agent_metrics datastream, but there are other metrics data_streams inside the elastic_agent package, like filebeat_metrics etc?

So this brings up an interesting question. I cloned the existing "Total events rate /s" panel. Does this mean that the Total events rate doesn't include everything? They both use the filter data_stream.dataset: elastic_agent.*, so it does pick up filebeat & metricbeat.

3. I would add a filter to exclude `(empty)` from these, there should be an option to exclude null values I believe?

I didn't see a filter to exclude empty in Lens, but I did find visual option to use a straight line when no data, which looks more like I would expect.

botelastic · 2024-02-24T20:41:56Z

Hi! We just realized that we haven't looked into this PR in a while. We're sorry! We're labeling this issue as Stale to make it hit our filters and make sure we get back to it as soon as possible. In the meantime, it'd be extremely helpful if you could take a look at it as well and confirm its relevance. A simple comment with a nice emoji will be enough :+1. Thank you for your contribution!

botelastic · 2024-03-25T20:42:10Z

Hi! This PR has been stale for a while and we're going to close it as part of our cleanup procedure. We appreciate your contribution and would like to apologize if we have not been able to review it, due to the current heavy load of the team. Feel free to re-open this PR if you think it should stay open and is worth rebasing. Thank you for your contribution!

leehinman force-pushed the 3854_elastic_agent_output_metrics branch from ea690d4 to 9120e2f Compare January 5, 2024 18:09

leehinman marked this pull request as ready for review January 5, 2024 18:47

leehinman requested a review from a team as a code owner January 5, 2024 18:47

leehinman requested a review from P1llus January 5, 2024 18:47

cmacknz approved these changes Jan 5, 2024

View reviewed changes

packages/elastic_agent/changelog.yml Outdated Show resolved Hide resolved

pierrehilbert added the Team:Elastic-Agent Label for the Agent team label Jan 6, 2024

add more detail to changelog

1face9b

P1llus approved these changes Jan 12, 2024

View reviewed changes

cmacknz requested changes Jan 12, 2024

View reviewed changes

leehinman mentioned this pull request Jan 23, 2024

Integration for 30s metrics elastic/elastic-agent#3854

Open

3 tasks

botelastic bot added the Stalled label Feb 24, 2024

botelastic bot closed this Mar 25, 2024

cmacknz mentioned this pull request Apr 4, 2024

Agents Metrics dashboards / visualizations are incorrect since the tsds was introduced with the new counter type #9724

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[elastic_agent] Add additional output metrics to dashboards #8834

[elastic_agent] Add additional output metrics to dashboards #8834

leehinman commented Jan 5, 2024

elasticmachine commented Jan 6, 2024

P1llus left a comment

cmacknz commented Jan 12, 2024

cmacknz left a comment

nimarezainia commented Jan 15, 2024

leehinman commented Jan 22, 2024

leehinman commented Jan 25, 2024

botelastic bot commented Feb 24, 2024

botelastic bot commented Mar 25, 2024

[elastic_agent] Add additional output metrics to dashboards #8834

[elastic_agent] Add additional output metrics to dashboards #8834

Conversation

leehinman commented Jan 5, 2024

Proposed commit message

Checklist

Author's Checklist

How to test this PR locally

Related issues

Screenshots

elasticmachine commented Jan 6, 2024

P1llus left a comment

Choose a reason for hiding this comment

cmacknz commented Jan 12, 2024

cmacknz left a comment

Choose a reason for hiding this comment

nimarezainia commented Jan 15, 2024

leehinman commented Jan 22, 2024

leehinman commented Jan 25, 2024

botelastic bot commented Feb 24, 2024

botelastic bot commented Mar 25, 2024