2. Get all required modules listed. You can find them at nearest mirror or at the CPAN archive (www.cpan.org). I am not very familiar with Perl, so i downloaded them from CPAN archive as .tar.gz packages and installed them usual way (perl Makefile.pl && make && make test && make install).
3. Mon is software for monitoring different network resources. It can ping computers, connect to various ports, monitor WWW, MySQL etc. In case of dysfunction of some resources it triggers some scripts.
4. Unpack mon in some directory. Best starting point is README file. Complete documentation is in <dir>/doc, where <dir> is place where you unpacked mon package.
5. For a fast start do following steps:
copy all subdirs found in <dir> to /usr/lib/mon
create dir /etc/mon
copy auth.cf from <dir>/etc to /etc/mon
Now, mon is prepared to work. You need to create your own mon.cf file,
where you should point to resources mon should watch and actions mon will
start in case
of dysfunction and when resources are available again. All monitoring scripts
are in /usr/lib/mon/mon.d/. At the beginning of every script you can find
explanation how to use it.
All alert scripts are placed in /usr/lib/mon/alert.d/. Those are scripts
triggered in case something went wrong. In case you are using ipvs on theirs
homepage (www.linuxvirtualserver.org) you can find scripts for adding and
removing servers from ipvs list.
Yes! Use the ipfail plug-in. For each interface you wish to monitor, specify one or more "ping" nodes in your configuration. Each node in your cluster will monitor these ping nodes. Should one node detect a failure in one of these ping nodes, it will contact the other node in order to determine whether it or the ping node has the problem. If the cluster node has the problem, it will try to failover its resources (if it has any).
To use ipfail, you will need to add the following to your /etc/ha.d/ha.cf
files:
respawn hacluster /usr/lib/heartbeat/ipfail
ping <IPaddr1> <IPaddr2>
... <IPaddrN>
IPaddr1..N are your ping nodes. NOTE: ipfail requires the "nice_failback
on" option.
This isn't a problem with heartbeat, but rather is caused by various versions of net-tools. Upgrade to the most recent version of net-tools and it will go away. You can test it with ifconfig manually.
This means you need to send 6 or 8 files. Include 6 if your debug output goes into the same file as your normal output and 8 otherwise. For each machine you need to send:
Rev 0.0.5
(c) 2000 Rudy Pawul rpawul@iso-ne.com
(c) 2001 Dusan Djordjevic dj.dule@linux.org.yu