DESCRIPTION =========== Job Monarch is a set of tools to monitor and optionally archive (batch)job information. It is a addon for the Ganglia monitoring system and plugs in to a existing Ganglia setup. To view a operational setup with Job Monarch, have a look here: http://ganglia.sara.nl/?c Job Monarch stands for 'Job Monitoring and Archiving' tool and consists of three (3) components: * jobmond The Job Monitoring Daemon. Gathers PBS/Torque batch statistics on jobs/nodes and submits them into Ganglia's XML stream. Through this daemon, users are able to view the PBS/Torque batch system and the jobs/nodes that are in it (be it either running or queued). * jobarchived The Job Archiving Daemon (optionally). Listens to Ganglia's XML stream and archives the job and node statistics. It stores the job statistics in a Postgres SQL database and the node statistics in RRD files. Through this daemon, users are able to lookup a old/finished job and view all it's statistics. Optionally: You can either choose to use this daemon if your users have use for it. As it can be a heavy application to run - even though optimized (staged/buffered writes and multi threaded) - and not everyone may have a need for it. * web The Job Monarch web interface. This interfaces with the jobmond data and (optionally) the jobarchived and presents the data and graphs. It does this in a similar layout/setup as Ganglia itself, so the navigation and usage is intuitive. REQUIREMENTS ============ jobmond: - pbs_python v2.8.1 or higher ftp://ftp.sara.nl/pub/outgoing/pbs_python.tar.gz - gmond 3.0.1 or higher http://www.ganglia.info jobarchived: - Postgres SQL v7.xx http://www.postgres.org - rrdtool v1.xx http://people.ee.ethz.ch/~oetiker/webtools/rrdtool/ - gmetad 3.x.x http://www.ganglia.info web: - PHP 4.1 or higher http://www.php.net INSTALLATION ============ Prior to installing the software make sure you meet the necessary requirements as mentioned above. 1. copy CONFIGURATION ============= START ===== CONTACT ======= To contact the author for anything from bugfixes to flame/hate mail: Ramon Bastiaans