Custom Query (101 matches)


Show under each result:

Results (16 - 18 of 101)

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16
Ticket Resolution Summary Owner Reporter
#22 fixed consumes too much cpu system time when there are no jobs bastiaans gastineau@…


I run the last stable version of jobmonarch on an IA64 server with redhat Linux AS 4 and torque 2.1.8. I install yesterday jobmarch and it works fine until a long job (which was running about 12hours) finished this night. After that, the process consumes about 20% of "cpu system time". With the ps command , I see that run very frequently pbs_iff.

root 1772 4.8 0.1 67280 8096 ? S Apr25 60:21 /usr/bin/python -v /usr/local/sbin/ -c /etc/jobmond.conf root 4146 0.0 0.0 752 320 ? R 09:58 0:00 /usr/local/torque.2.1.8.fPIC/sbin/pbs_iff localhost 15001 4

How can I correct this problem ?



I attach the content of my file /etc/jobmond.conf

#23 fixed implement native gmetric support bastiaans bastiaans

Nick Galbreath created a 100% python native gmetric module:

It does not require a C module, swig, nothing.

Hi Ramon,

correct, the python code is one file and "does it all" without any dependencies.  No C, No Swig, No non-standard modules.

Take a peek
that's really all it is.

The main difference between my code and the official gmetric is that my code only will send metrics to  ONE mutlicast or  ONE udp port.  official gmetric reads in the gmond.conf file and can send metrics to multiple ports with multiple protocols at once.  But I think 99% of the ganglia installs just sends data to 1 port.

Feel free to package it up however you like.  Its the MIT license, which just says "Don't sue me if it blows up, and preserve the copyright notice"   If you need a different license let me know ( but MIT has is least restrictive).

have fun!


Need to implement this ASAP, as I think this will speed up jobmond quite a bit and make it more flexible.

#25 fixed global Names not defined bastiaans anonymous


in v0.2 two global variables are used but not defined. The attached patch fixes it. The first error:

Traceback (most recent call last):
  File "/usr/sbin/", line 816, in ?
  File "/usr/sbin/", line 811, in main
  File "/usr/sbin/", line 341, in run = self.getJobData( )
  File "/usr/sbin/", line 530, in getJobData
    except PBSError:
NameError: global name 'PBSError' is not defined

And the second error:

Traceback (most recent call last):
  File "/usr/sbin/", line 819, in ?
  File "/usr/sbin/", line 814, in main
  File "/usr/sbin/", line 341, in run = self.getJobData( )
  File "/usr/sbin/", line 532, in getJobData
    time.sleep( TORQUE_POLL_INTERVAL )
NameError: global name 'TORQUE_POLL_INTERVAL' is not defined



1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16
Note: See TracQuery for help on using queries.