Custom Query (101 matches)

Filters
 
Or
 
  
 
Columns

Show under each result:


Results (19 - 21 of 101)

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17
Ticket Resolution Summary Owner Reporter
#22 fixed jobmond.py consumes too much cpu system time when there are no jobs bastiaans gastineau@…
Description

Hi,

I run the last stable version of jobmonarch on an IA64 server with redhat Linux AS 4 and torque 2.1.8. I install yesterday jobmarch and it works fine until a long job (which was running about 12hours) finished this night. After that, the process jobmond.py consumes about 20% of "cpu system time". With the ps command , I see that jobmond.py run very frequently pbs_iff.

root 1772 4.8 0.1 67280 8096 ? S Apr25 60:21 /usr/bin/python -v /usr/local/sbin/jobmond.py -c /etc/jobmond.conf root 4146 0.0 0.0 752 320 ? R 09:58 0:00 /usr/local/torque.2.1.8.fPIC/sbin/pbs_iff localhost 15001 4

How can I correct this problem ?

Thanks,

Mickael,

I attach the content of my file /etc/jobmond.conf

#23 fixed implement native gmetric support bastiaans bastiaans
Description

Nick Galbreath created a 100% python native gmetric module:

http://code.google.com/p/embeddedgmetric/wiki/GmetricPython

It does not require a C module, swig, nothing.

Hi Ramon,

correct, the python code is one file and "does it all" without any dependencies.  No C, No Swig, No non-standard modules.

Take a peek
http://embeddedgmetric.googlecode.com/svn/trunk/python/gmetric.py
that's really all it is.



The main difference between my code and the official gmetric is that my code only will send metrics to  ONE mutlicast or  ONE udp port.  official gmetric reads in the gmond.conf file and can send metrics to multiple ports with multiple protocols at once.  But I think 99% of the ganglia installs just sends data to 1 port.


Feel free to package it up however you like.  Its the MIT license, which just says "Don't sue me if it blows up, and preserve the copyright notice"   If you need a different license let me know ( but MIT has is least restrictive).

have fun!

--nickg

Need to implement this ASAP, as I think this will speed up jobmond quite a bit and make it more flexible.

#24 worksforme SGE support broken ramonb bastiaans
Description
After going through the instructions, I attempted to execute
jobmond.py. When I did that, I received the following error message:
cluster1:/usr/local/sbin # /usr/local/sbin/jobmond.py -c /etc/jobmond.conf
Traceback (most recent call last):
 File "/usr/local/sbin/jobmond.py", line 814, in ?
   main()
 File "/usr/local/sbin/jobmond.py", line 807, in main
   gather.daemon()
UnboundLocalError: local variable 'gather' referenced before assignment

An examination of the code reveals that the SGE data gathering code
was commented out on line 792. Uncommenting it had the following
effect:
cluster1:/usr/local/sbin # /usr/local/sbin/jobmond.py -c /etc/jobmond.conf
 File "/usr/local/sbin/jobmond.py", line 797
   debug_msg( 0, "fatal error: BATCH_API set to 'sge' but python
module 'sge_drmaa' is not installed' )

                               ^
SyntaxError: EOL while scanning single-quoted string

Commenting out everything but "gather = SgeDataGatherer()" gave me the
following error:
cluster1:/usr/local/sbin # /usr/local/sbin/jobmond.py -c /etc/jobmond.conf
Traceback (most recent call last):
 File "/usr/local/sbin/jobmond.py", line 814, in ?
   main()
 File "/usr/local/sbin/jobmond.py", line 800, in main
   gather = SgeDataGatherer()
 File "/usr/local/sbin/jobmond.py", line 419, in __init__
   self.initSgeJobInfo()
 File "/usr/local/sbin/jobmond.py", line 426, in initSgeJobInfo
   self.qstatparser = SgeQstatXMLParser( SGE_QSTAT_XML_FILE )
NameError: global name 'SGE_QSTAT_XML_FILE' is not defined

At this point, I decided to search my systems for references to drmaa.
I saw several references to C++ example and header files related to
it. Is the sge_drmaa module supposed to be provided by Job Monarch or
Sun Grid Engine? 
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17
Note: See TracQuery for help on using queries.