Skip to content
This repository has been archived by the owner on Nov 26, 2019. It is now read-only.

Improve Netdata Dashboards #1080

Open
1 task done
sanfordd opened this issue Jun 4, 2018 · 1 comment
Open
1 task done

Improve Netdata Dashboards #1080

sanfordd opened this issue Jun 4, 2018 · 1 comment
Assignees
Labels

Comments

@sanfordd
Copy link

sanfordd commented Jun 4, 2018

Improve netdata dashboards for quick status checks,

  • Implement more yes/no status checks
@sanfordd sanfordd self-assigned this Jun 4, 2018
@sanfordd sanfordd added this to the post-launch milestone Jul 10, 2018
@sanfordd
Copy link
Author

Finished interviews with relevant users and we'll be using the netdata badges to function as simpler displays.
The goal is to have a simple yes/no box to check the status of the application itself and then a few boxes with similarly simple values to help people determine where to start looking at a problem.

From netdata we'll be able to see
Resque-worker count
Redis seconds since last connection
Connections to the sufia database (1+ indicates it's there and working)
Connections to the fcrepo database (1+ shows the database is there and has a connection though normal use is many more connections)

We can enable the tomcat metrics as well, which should let us monitor that aspect.

Fedora and Solr don't have any current direct monitoring tools. We can look into writing them if we want , or we can use services they rely on such as monitoring Tomcat and Postgres for Fedora to serve as a basic option.

Application availability will be handled by taking the uptime data from Honeybadger directly, it doesn't seem worthwhile to integrate it with netdata.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Projects
None yet
Development

No branches or pull requests

1 participant