Thread

Syed Fazle Rahman

Building Bug0, an AI-native E2E testing platform for modern apps - co-founder & ceo @ Hashnode

Jan 2, 2018

What do you use to monitor your production infrastructure?

Hey awesome developers! :) Happy new year 2018.

What service(s) do you use for tracking errors, checking infrastructure health and alerting team members when production is on?

#programming

Responses(10)

ravi rajus ai

Feb 27, 2018

Bugsnag for Error Reporting
Slack, HipChat and Email for alerting Euro to dollar trends (depends on team)

Many great options have already been mentioned, but I add a few items, since I'm surprised they haven't been mentioned yet.

If you don't want/can't afford the human cost of managing your own Prometheus, _Datadog__ _is a nice managed solution which isn't free but could help you focus on your core features before you can put more energy into saving costs and move to Prometheus. Sine the ops team in my current and previous companies are very very small, this has proven to be a real key item in monitoring and debugging our stacks. (And since they bought logmatic last year, their log solution to complete the metrics and apm tools should be released at some point this year, which would make it even easier to get started and focus on developing the product)

Finally, if you want a quite long list of available quality tools, The Cloud Native Foundation regularly updates their landscape stuff, in which the Observability and Analysis section should help you making sure you consider the best options available. CloudNativeLandscape_v1.0.png and the Github repo if you want to track updates: github.com/cncf/landscape

Side note: Why is this tagged General Programming and don't have the devops or architecture tags? (this is not a rant, I often find myself missing question because I'm getting lost in tags... )

Jan Vladimir Mostert

Idea Incubator

Jan 5, 2018

Custom-built tools for everything integrated into Slack.

sivaram

Giving life to Ideas

Jan 3, 2018

Bugsnag and sentry for few servers both integrated to slack. Keymetrics for monitoring nodejs servers and mongo db monitoring. Statuscake for uptime monitoring(reported to slack).

Md Zaid Imam

Let learning process continue | Principal Product Manager

Jan 3, 2018

Tools we are using currently :

Grafana - Data visualization & Monitoring
Slack - Bit-bucket code push and CI/CD notifications
Orbitera - Cost and Usability tracking
Opas - Kafa metrics monitoring
Pingdom - Monitoring for uptime
Elasticsearch, Logstash and Kibana (ELK) - To centralized the logs for our application Environment
Plivo - Call scheduler for any production or critical downtime
Nagios - For our internal infra alert and for health check notifications
Sonarqube : For code health check
Veracode : To identify vulnerability in codes

Atul Sharma

Full Stack Developer | Cloud Native Applications

Jan 3, 2018

We use custom in house developed monitoring tool. Their is a client daemon installed on every server to collect server metrics, its difficult to monitor when you have 50+ servers :P .

We have certain filters in our code to check the load on application per server and to notify us when their is load more than threshold.

So,. basically we don't use anything fancy / opensource to monitor our infrastructure :D

Gergely Polonkai

You have to believe in things that are not true. How else would they become?

Jan 2, 2018

We use Uptime robot for external check, and Zabbix for internal ones.

Uptime robot is great at periodically checking our public facing site. On the other hand, Zabbix can collect a lot of metrics and send alerts when something goes wrong.

Pankaj Patel

Blog, Tech, Photography etc.

Jan 2, 2018

We use

Bugsnag for Error Reporting
Slack, HipChat and Email for alerting (depends on team)

And I had tested Sentry for error reporting and I liked it very much, I use it for most of my side projects.

Mev-Rael

Executive Product Leader & Mentor for High-End Influencers and Brands @ mevrael.com

Jan 2, 2018

site24x7.com

Has free options.

To monitor events from your server, like errors - bugsnag.com which integrates with Slack and, basically, just simple custom scripts sending me message in Slack if something happens or just general stats. Machine stats (RAM, Network, etc) is available from DigitalOcean.com monitoring UI and I receive emails everytime server is low on memory for example.

Search Hashnode

What do you use to monitor your production infrastructure?

Responses(10)

Recent in Forum