[colug-432] unix monitoring
travissidelinger at gmail.com
Wed Feb 24 20:15:24 EST 2016
I ended up rolling my own rpm's and had to create a number of patches for
We looked at NagiosXI. Maybe the newer version is better, but the previous
version was garbage under the covers. Their install scripts had numerous
blatant issues and even hard coded passwords.
On Wed, Feb 24, 2016 at 8:02 PM, Zach Villers <zachvatwork at gmail.com> wrote:
> The Fedora Infrastructure team uses Nagios. You can access the ansible
> repo to review how it is configured and look at the Infrastructure group
> wiki page to find the url/login for their nagios web deployment (read only
> of course). I believe this requires a Fedora account, which is quite
> On Wed, Feb 24, 2016 at 7:38 PM Travis Sidelinger <
> travissidelinger at gmail.com> wrote:
>> Zenoss is another good one.
>> We were also using SCOM for Linux. The failed heartbeat messages were in
>> the thousands daily and completely useless.
>> We went with Nagios+NRPE+NagiosQL+Nagstamon+Cacti. Yes, its
>> configuration management is not automated, but we did our own scripting to
>> automated the configs to build from our CMDB. We needed 5 different
>> instances for separate isolated environments, so using a solution where we
>> were not charged 5x was important. Also, it is a fairly lightweight
>> solution. It's not perfect, but got it done and onto the next task.
>> On Wed, Feb 24, 2016 at 6:14 PM, Rick Hornsby <richardjhornsby at gmail.com>
>>> We're presently using Microsoft SCOM to monitor our enterprise (mix of
>>> Windows and Linux) ... and it's ... horrible. There's simply no other way
>>> to describe the experience for a UNIX admin. We've given up trying to
>>> automate the unix agent installation. It's a broken in ways we cannot fix.
>>> So we're pondering a better solution for our UNIX environment. BMC
>>> Patrol is out - it's got big stompy feet and an even larger price tag.
>>> We're looking at free/OSS options to cover ~1000 unix hosts, mostly RHEL
>>> and SUSE but some Solaris and AIX.
>>> Start with the basics - CPU load, disk space, ports listening, processes
>>> running, etc - and have the ability to grow into application level
>>> monitoring. It would be nice if the OSS version supported LDAP auth. We
>>> plan to integrate the solution into our eventual server provisioning stuff
>>> that we're planning to build with Puppet. It would also be nice if the
>>> dashboard was pretty.
>>> For an idea, some of the ones we're considering are Zabbix, Nagios,
>>> Sensu, and PandoraFMS.
>>> Zabbix - I have some past experience with 1.x and 2.0/2.2. The web UI
>>> is a little painful. Just now I quick-like spun up a Zabbix 3.0 instance
>>> --- and things on the dashboard are blinking. No really, green blocks on
>>> the screen are blinking. Please stop blinking, giant blocks.
>>> Nagios - It's been around long enough to have earned a bad rep for
>>> basically being old, never very user friendly, and generally brittle by
>>> modern standards.
>>> Sensu - It looks pretty? Don't know much, but it's weird that the OSS
>>> version is Ruby but the enterprise version is Java?
>>> PandoraFMS - Don't know much about this one.
>>> What are you guys running? It feels like there must be more options out
>>> there that we're not aware of.
>>> colug-432 mailing list
>>> colug-432 at colug.net
>> "Remember never to be discouraged by those who cling to their beliefs.
>> They are worthy of our pity, not our scorn."
>> "A careful reading of history clearly demonstrates ...
>> that people don't read history carefully.”
>> “We can't solve problems by using the same kind of thinking we used when
>> we created them.”
>> —Albert Einstein
>> colug-432 mailing list
>> colug-432 at colug.net
> colug-432 mailing list
> colug-432 at colug.net
"Remember never to be discouraged by those who cling to their beliefs.
They are worthy of our pity, not our scorn."
"A careful reading of history clearly demonstrates ...
that people don't read history carefully.”
“We can't solve problems by using the same kind of thinking we used when we
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the colug-432