A couple of us started building an open-source tool after an on-call shift where a simple container restart turned into more hassle than it should. What should have been a 10-second task meant opening a laptop, connecting to a server, and running commands in the middle of the night.
OpsiMate is our attempt to simplify that. It provides a single place to manage Docker containers, Kubernetes pods, and systemd services. We’re also working on integrations with monitoring tools so you can act on alerts and jump directly to dashboards.
It’s still early and a bit rough, but already running in use. We’d really appreciate feedback, ideas, and contributions from the community.
That’s awesome! Grafana, Kibana, and Datadog already cover a lot of ground for observability. Are you planning to add alerting/incident response tools (like PagerDuty, Opsgenie, or Slack integrations) as well? Those could really tie into the “wake up at 3 AM and fix it fast” use case.
BTW, besides restart, what kinds of actions do you currently support for containers/pods/services?
Thanks! At the moment we’re only pulling in alerts from Grafana, but we definitely want to expand into the alerting/incident response space. PagerDuty, Opsgenie, Slack, etc. are all on our list of directions to explore.
Right now we just support the basics—start and stop for containers/pods/services—but the plan is to grow into more advanced actions over time.
A couple of us started building an open-source tool after an on-call shift where a simple container restart turned into more hassle than it should. What should have been a 10-second task meant opening a laptop, connecting to a server, and running commands in the middle of the night.
OpsiMate is our attempt to simplify that. It provides a single place to manage Docker containers, Kubernetes pods, and systemd services. We’re also working on integrations with monitoring tools so you can act on alerts and jump directly to dashboards.
It’s still early and a bit rough, but already running in use. We’d really appreciate feedback, ideas, and contributions from the community.
This looks super useful—love the idea of reducing middle-of-the-night friction. Keep going, excited to see where OpsiMate goes!
thanks!!
I like the name OpsiMate! You really do need a mate when it comes to ops
thanks!!
Which kinds of integrations do you have?
https://opsimate.vercel.app/integrations/overview
Right now we have integrations with Grafana, Kibana, and Datadog, but we’re always open to ideas for new ones.
That’s awesome! Grafana, Kibana, and Datadog already cover a lot of ground for observability. Are you planning to add alerting/incident response tools (like PagerDuty, Opsgenie, or Slack integrations) as well? Those could really tie into the “wake up at 3 AM and fix it fast” use case.
BTW, besides restart, what kinds of actions do you currently support for containers/pods/services?
Thanks! At the moment we’re only pulling in alerts from Grafana, but we definitely want to expand into the alerting/incident response space. PagerDuty, Opsgenie, Slack, etc. are all on our list of directions to explore.
Right now we just support the basics—start and stop for containers/pods/services—but the plan is to grow into more advanced actions over time.
Cool!
thanks!
You got a star from me :)
Nice