[Bug]: Upstream dependencies should alert AMO developers promptly #15308

KevinMind · 2025-01-27T12:28:10Z

What happened?

We have now several times run into issues with upstream dependencies (socketlabs for emails, cinder for abuse reports) where the service is down for some period of time and we only find out about it indirectly and after some period of time.

What did you expect to happen?

When a critical upstream dependency is down, we should be alerted promptly, probably via slack in our production channel.

Is there an existing issue for this?

I have searched the existing issues

┆Issue is synchronized with this Jira Task

KevinMind · 2025-01-27T12:31:37Z

Idea: We can utilize an existing path for sending slack notifications by adding a scheduled github action workflow that pings our monitors.json endpoint and if any service is "state" == false, then we ping in slack.

Could run it every 5 minutes.

https://addons.mozilla.org/services/monitor.json

Idea: we could probably integrate this with pager duty somehow, but we don't really use pagerduty in AMO (yet) and even then we cannot slack directly from pagerduty because our prod channel is private.

Idea: we could use an amo controlled cron job, but this would require some way to ping slack directly from AMO... might not be a bad thing to have but still more work than the first idea.

KevinMind · 2025-01-27T14:02:22Z

https://mozilla-hub.atlassian.net/servicedesk/customer/portal/4/SDD-29163?created=true

KevinMind added needs:info repository:addons-server Issue relating to addons-server labels Jan 27, 2025

KevinMind mentioned this issue Jan 27, 2025

Add slack message API to send message on slack mozilla/addons-server#23030

Draft

5 tasks

data-sync-user removed the needs:info label Jan 28, 2025

KevinMind mentioned this issue Jan 29, 2025

Add GitHub Actions health check workflow mozilla/addons-server#23036

Open

5 tasks

data-sync-user assigned KevinMind Jan 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug]: Upstream dependencies should alert AMO developers promptly #15308

[Bug]: Upstream dependencies should alert AMO developers promptly #15308

KevinMind commented Jan 27, 2025 •

edited by data-sync-user

Loading

KevinMind commented Jan 27, 2025 •

edited

Loading

KevinMind commented Jan 27, 2025

[Bug]: Upstream dependencies should alert AMO developers promptly #15308

[Bug]: Upstream dependencies should alert AMO developers promptly #15308

Comments

KevinMind commented Jan 27, 2025 • edited by data-sync-user Loading

What happened?

What did you expect to happen?

Is there an existing issue for this?

KevinMind commented Jan 27, 2025 • edited Loading

KevinMind commented Jan 27, 2025

KevinMind commented Jan 27, 2025 •

edited by data-sync-user

Loading

KevinMind commented Jan 27, 2025 •

edited

Loading