Custom Query (101 matches)

Filters
 
Or
 
  
 
Columns

Show under each result:


Results (16 - 18 of 101)

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16
Ticket Resolution Summary Owner Reporter
#22 fixed jobmond.py consumes too much cpu system time when there are no jobs bastiaans gastineau@…
Description

Hi,

I run the last stable version of jobmonarch on an IA64 server with redhat Linux AS 4 and torque 2.1.8. I install yesterday jobmarch and it works fine until a long job (which was running about 12hours) finished this night. After that, the process jobmond.py consumes about 20% of "cpu system time". With the ps command , I see that jobmond.py run very frequently pbs_iff.

root 1772 4.8 0.1 67280 8096 ? S Apr25 60:21 /usr/bin/python -v /usr/local/sbin/jobmond.py -c /etc/jobmond.conf root 4146 0.0 0.0 752 320 ? R 09:58 0:00 /usr/local/torque.2.1.8.fPIC/sbin/pbs_iff localhost 15001 4

How can I correct this problem ?

Thanks,

Mickael,

I attach the content of my file /etc/jobmond.conf

#23 fixed implement native gmetric support bastiaans bastiaans
Description

Nick Galbreath created a 100% python native gmetric module:

http://code.google.com/p/embeddedgmetric/wiki/GmetricPython

It does not require a C module, swig, nothing.

Hi Ramon,

correct, the python code is one file and "does it all" without any dependencies.  No C, No Swig, No non-standard modules.

Take a peek
http://embeddedgmetric.googlecode.com/svn/trunk/python/gmetric.py
that's really all it is.



The main difference between my code and the official gmetric is that my code only will send metrics to  ONE mutlicast or  ONE udp port.  official gmetric reads in the gmond.conf file and can send metrics to multiple ports with multiple protocols at once.  But I think 99% of the ganglia installs just sends data to 1 port.


Feel free to package it up however you like.  Its the MIT license, which just says "Don't sue me if it blows up, and preserve the copyright notice"   If you need a different license let me know ( but MIT has is least restrictive).

have fun!

--nickg

Need to implement this ASAP, as I think this will speed up jobmond quite a bit and make it more flexible.

#25 fixed global Names not defined bastiaans anonymous
Description

Hello,

in jobmond.py v0.2 two global variables are used but not defined. The attached patch fixes it. The first error:

Traceback (most recent call last):
  File "/usr/sbin/jobmond.py", line 816, in ?
    main()
  File "/usr/sbin/jobmond.py", line 811, in main
    gather.run()
  File "/usr/sbin/jobmond.py", line 341, in run
    self.jobs = self.getJobData( self.jobs )
  File "/usr/sbin/jobmond.py", line 530, in getJobData
    except PBSError:
NameError: global name 'PBSError' is not defined

And the second error:

Traceback (most recent call last):
  File "/usr/sbin/jobmond.py", line 819, in ?
    main()
  File "/usr/sbin/jobmond.py", line 814, in main
    gather.run()
  File "/usr/sbin/jobmond.py", line 341, in run
    self.jobs = self.getJobData( self.jobs )
  File "/usr/sbin/jobmond.py", line 532, in getJobData
    time.sleep( TORQUE_POLL_INTERVAL )
NameError: global name 'TORQUE_POLL_INTERVAL' is not defined

Regards,

Peter

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16
Note: See TracQuery for help on using queries.