Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Can we expand the Grafana stack.... #126

Open
ewassef opened this issue Jun 23, 2022 · 3 comments
Open

Can we expand the Grafana stack.... #126

ewassef opened this issue Jun 23, 2022 · 3 comments

Comments

@ewassef
Copy link

ewassef commented Jun 23, 2022

It would be a better fit to replace promtail and the loki ingester stack with grafana agent and to add Tempo into the MLA https://grafana.com/docs/tempo/latest/grafana-agent/.

This would reduce the processing and configuration impact on the MLA stack and add a new feature for distributed tracing. I would imagine the swap wouldnt be too difficult with the MLA gateway already configured for promtail to be modified to accept the agent and then to install the tempo chart as well

@ewassef
Copy link
Author

ewassef commented Jun 30, 2022

@toschneck any one you know we can discuss this with?

@csengerszabo
Copy link
Contributor

@ewassef, can you please elaborate on:

  • your use case where this was helpful
  • how do you mean it would reduce the processing and configuration impact on the MLA stack
  • also on the new feature for distributed tracing, why this would be beneficial

@ewassef
Copy link
Author

ewassef commented Jul 27, 2022

Hi @csengerszabo

The MLA stack is built on the Grafana stack and the utilizes both the promtail configurations on the user cluster and the loki ingesters on the seed. The functionality of both of these can be replaced by the single grafana agent AND include open telemetry information as well. Furthermore, the agent provides these additional benefits:

Provides a one-stop solution for collecting metrics, logs, and traces.
Collects out-of-the-box telemetry from popular projects like MySQL through integrations
Works seamlessly with the Grafana Stack. Alternatively, metrics can be sent to any Prometheus-compatible endpoint, and traces can be sent to any OTLP-compatible endpoint.
Offers new solutions to help scale metrics collection like host_filtering and sharding
Provides the Grafana Agent Operator, which enables individual teams to manage their configurations through PodMonitors, ServiceMonitors, and Probes.
see: https://grafana.com/docs/agent/latest/

So, it would be helpful in more fine-grained log, metric and telemetry filtering (which we havent been able to do today), a CRD based way to add monitors etc vs using the KKP ui and, finally, adding open telementry onto the stack.

Because its a single agent, the load would be halved right now since we are only collecting logs and metrics, but would be a third once telemetry is added.

Finally, Distributed tracing is becoming a requirement with these microservices and distributed systems as a way to determine where items are failing and how to recover. These three components would rival any commercial product and be a no-brainer to adopt in the MLA stack.
see https://github.com/grafana/tempo

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants