Found 40 bookmarks
Newest
VictorOps | DevOps Alerting & Real-Time Incident Management
VictorOps | DevOps Alerting & Real-Time Incident Management

"VictorOps is a hub for centralizing the flow of information throughout the incident lifecycle. Driven by IT and DevOps system data, VictorOps provides a unified platform for real-time alerting, collaboration, and documentation.

Using VictorOps, teams resolve incidents faster to help minimize the impact of downtime and speed innovation."

VictorOps | DevOps Alerting & Real-Time Incident Management
zalando/zmon: ZMON is Zalando's monitoring tool and allows observing services and metrics on various layers, from CPU load to team KPIs.
zalando/zmon: ZMON is Zalando's monitoring tool and allows observing services and metrics on various layers, from CPU load to team KPIs.
"ZMON is Zalando's open-source platform monitoring tool, used in production since early 2014. It supports our many engineering teams in observing their services and metrics on various layers, from CPU load to team KPIs."
zalando/zmon: ZMON is Zalando's monitoring tool and allows observing services and metrics on various layers, from CPU load to team KPIs.
Stashboard: The open source status dashboard
Stashboard: The open source status dashboard

"Stashboard is a status dashboard for APIs and software services. It's similar to the Amazon AWS Status Page or the Google Apps Status Page. Stashboard was originally written by Twilio to provide status information on its Voice and SMS APIs. Stashboard is designed to provide a generic status dashboard for any hosted service or API. The code can be downloaded, customized, and run on any Google App Engine account.

Track multiple different API or SaaS services Set custom status messages and icons such as Up/Down Show historical status for each service Runs on Google App Engine so it's independent of your infrastructure (unless your app is on GAE) Full REST API for both getting and setting status information CNAME to http://status.yourapp.com Stashboard is written in Python and hosted on Google App Engine."

Stashboard: The open source status dashboard
clickalicious/phpMemAdmin
clickalicious/phpMemAdmin

"Detailed statistics, charts & graphs Data management for Memcached (full CRUD support) Memcached cluster health dashboard Memcached host dashboard Update check Nice & comfortable UI Clean & well documented code Responsive and mobile ready"

clickalicious/phpMemAdmin
zendframework/ZendDiagnostics
zendframework/ZendDiagnostics

"Simple component for performing diagnostic tests in real-world PHP applications.

It currently ships with the following Diagnostic Checks:

ApcFragmentation - check if APC memory fragmentation is below given threshold, ApcMemory - check available APC memory, Callback - call a user-defined diagnostic function, ClassExists - make sure class exists in current environment, CpuPerformance - check server CPU performance is above baseline, DirReadable - make sure given path is readable, DirWritable - make sure given path is writable, DiskFree - check there's enough free space on given path, ExtensionLoaded - make sure extension is loaded, HttpService - check if given http host is responding, Memcache - check if memcache extension is loaded and given server is reachable, PhpVersion - make sure that PHP version matches constraint, PhpFlag - make sure that given PHP flag (feature) is turned on or off. ProcessRunning - check if a process with given name or ID is currently running, SecurityAdvisory - check installed composer dependencies against SensioLabs SA database, StreamWrapperExists - make sure given stream wrapper is available. File validation checks:

IniFile - check if given INI file is available and valid, JsonFile - check if given JSON file is available and valid, XmlFile - check if given XML file is available and valid, YamlFile - check if given YAML file is available and valid"

zendframework/ZendDiagnostics
ElasticHQ - ElasticSearch monitoring and management application.
ElasticHQ - ElasticSearch monitoring and management application.
"From monitoring individual cluster nodes, to viewing real-time threads, ElasticHQ enables up-to-the-second insight in to ElasticSearch cluster runtime metrics and configurations, using the ElasticSearch REST API. ElasticHQ's real-time update feature works by polling your ElasticSearch cluster intermittently, always pulling the latest aggregate information and deltas; keeping you up-to-date with the internals of your working cluster."
ElasticHQ - ElasticSearch monitoring and management application.
ElasticHQ - ElasticSearch monitoring and management application.
ElasticHQ - ElasticSearch monitoring and management application.
"rom monitoring individual cluster nodes, to viewing real-time threads, ElasticHQ enables up-to-the-second insight in to ElasticSearch cluster runtime metrics and configurations, using the ElasticSearch REST API. ElasticHQ's real-time update feature works by polling your ElasticSearch cluster intermittently, always pulling the latest aggregate information and deltas; keeping you up-to-date with the internals of your working cluster."
ElasticHQ - ElasticSearch monitoring and management application.
Dashing - The exceptionally handsome dashboard framework.
Dashing - The exceptionally handsome dashboard framework.

"Dashing is a Sinatra based framework that lets you build beautiful dashboards.

Check out a demo over here. Here's another one, optimized for 1080p screens.

Key features:

Use premade widgets, or fully create your own with scss, html, and coffeescript. Widgets harness the power of data bindings to keep things DRY and simple. Powered by batman.js. Use the API to push data to your dashboards, or make use of a simple ruby DSL for fetching data. Drag & Drop interface for re-arranging your widgets. Host your dashboards on Heroku in less than 30 seconds. This project was created at Shopify for displaying custom dashboards on TVs around the office."

Dashing - The exceptionally handsome dashboard framework.
liip/LiipMonitorBundle · GitHub
liip/LiipMonitorBundle · GitHub
This bundle provides a way to run a series of application related health checks. Health checks in the scope of this bundle go beyond simple actions like performing a ping to a server to see if it's alive. For example a Memcache server can be alive and not displaying any errors in your Nagios but you might not be able to access it from your PHP application. Each health check should then implement some application logic that you want to make sure always works. Another usage can be testing for specific requirements, like availability of PHP extensions.
liip/LiipMonitorBundle · GitHub
OpenTSDB - A Distributed, Scalable Monitoring System
OpenTSDB - A Distributed, Scalable Monitoring System

OpenTSDB is a distributed, scalable Time Series Database (TSDB) written on top of HBase. OpenTSDB was written to address a common need: store, index and serve metrics collected from computer systems (network gear, operating systems, applications) at a large scale, and make this data easily accessible and graphable. Thanks to HBase's scalability, OpenTSDB allows you to collect many thousands of metrics from thousands of hosts and applications, at a high rate (every few seconds). OpenTSDB will never delete or downsample data and can easily store billions of data points. As a matter of fact, StumbleUpon uses it to keep track of hundred of thousands of time series and collects over 100 million data points per day in their main production cluster.

Imagine having the ability to quickly plot a graph showing the number of active worker threads in your web servers, the number of threads used by your database, and correlate this with your service's latency (example below). OpenTSDB makes generating such graphs on the fly a trivial operation, while manipulating millions of data point for very fine grained, real-time monitoring.

OpenTSDB - A Distributed, Scalable Monitoring System
SystemTap
SystemTap

SystemTap provides free software (GPL) infrastructure to simplify the gathering of information about the running Linux system. This assists diagnosis of a performance or functional problem. SystemTap eliminates the need for the developer to go through the tedious and disruptive instrument, recompile, install, and reboot sequence that may be otherwise required to collect data.

SystemTap provides a simple command line interface and scripting language for writing instrumentation for a live running kernel. We are publishing samples, as well as enlarging the internal "tapset" script library to aid reuse and abstraction.

Among other tracing/probing tools, SystemTap is the tool of choice for complex tasks that may require live analysis, programmable on-line response, and whole-system symbolic access. SystemTap can also handle simple tracing jobs.

SystemTap
Ganglia Monitoring System
Ganglia Monitoring System
Ganglia is a scalable distributed monitoring system for high-performance computing systems such as clusters and Grids. It is based on a hierarchical design targeted at federations of clusters. It leverages widely used technologies such as XML for data representation, XDR for compact, portable data transport, and RRDtool for data storage and visualization. It uses carefully engineered data structures and algorithms to achieve very low per-node overheads and high concurrency. The implementation is robust, has been ported to an extensive set of operating systems and processor architectures, and is currently in use on thousands of clusters around the world. It has been used to link clusters across university campuses and around the world and can scale to handle clusters with 2000 nodes.
Ganglia Monitoring System
Home - Graylog2 - Free Open Source remote TCP/UDP Syslog daemon with Web Interface
Home - Graylog2 - Free Open Source remote TCP/UDP Syslog daemon with Web Interface

Graylog2 is an open source syslog implementation that stores your logs in MongoDB. It consists of a server written in Java that accepts your syslog messages via TCP or UDP and stores it in the database. The second part is a Ruby on Rails web interface that allows you to view the log messages. FEATURES Accepts syslog messages via TCP/UDP Fast backend via MongoDB GELF (Graylog Extended Log Format) Blacklists Filtering Statistics

Home - Graylog2 - Free Open Source remote TCP/UDP Syslog daemon with Web Interface
mysql-snmp - monitoring MySQL with SNMP
mysql-snmp - monitoring MySQL with SNMP

mysql-snmp is a Net-SNMP Perl subagent that connects to a MySQL server and reports its statistics through SNMP. Additionally this project contains material to graph MySQL variables/counters in OpenNMS.

The graphs are based on the excellent MySQL Cacti Templates by Baron Schwartz.

mysql-snmp - monitoring MySQL with SNMP
mycheckpoint | openark forge
mycheckpoint | openark forge

mycheckpoint is an open source monitoring utility for MySQL, with strong emphasis on user accessibility to monitored data.

It is SQL oriented: charts, reports and advances metrics are generated on the fly with views. There is no need for an external program to diagnose the data. In fact, mycheckpoint’s main duty is to to a one-time creation of a special purpose schema.

mycheckpoint | openark forge
kontrollbase | kontrollsoft
kontrollbase | kontrollsoft
Kontrollbase is a MySQL monitoring, analytics, reporting, and historical analysis webapp for MySQL database administrators and advanced users of MySQL databases. Its goal is to give you enterprise level tools that are built on the open-source software model. We want your servers to perform at their best and these tools will help you get the most out of your investment.
kontrollbase | kontrollsoft