Custom Query (101 matches)
Results (34 - 36 of 101)
Ticket | Resolution | Summary | Owner | Reporter |
---|---|---|---|---|
#76 | worksforme | jobarchived does not change status to "F" | ramonb | j.kasiak@… |
Description |
Jobarchived does not update a jobs status to "F" once it finishes. Jobmond runs on the head node. gmetad runs on a seperate box. I've narrowed down the problem: when I do on my gmetad box telnet -l ganglia localhost 8651 | grep -i monarch | grep -i 23055 <METRIC NAME="MONARCH-JOB-23055-0" VAL="status=R start_timestamp=1269222985 name=STDIN poll_interval=30 queue=batch reported=1269223164 requested_time=100:00:00 queued_timestamp=1269222984 owner=user1 nodes=p340050" TYPE="string" UNITS="" TN="442" TMAX="60" DMAX="0" SLOPE="both" SOURCE="gmond"> Connection closed by foreign host. The job is still there!!! Only a restart of gmetad clears this. This is a problem, since jobarchived parses this xml file and puts this node in an array of active nodes, and never gets to set the job_status to "F". How can I fix this? Thanks, Jan |
|||
#75 | fixed | create a proper debian/ directory and .spec file | olahaye | pk@… |
Description |
Hello, the usual Debian way for packages is to have a debian/ directory with all the meta information included. I'd like to convert the current mechanism (make deb) to the debian way (dpkg-buildpackage). I have done this for many other packages already so I know I can do it here as well. But before I do, I'd like to know:
Thanks,
|
|||
#74 | fixed | Error while trying to run jobmond | somebody | lari.lloret@… |
Description |
When trying to run /usr/local/sbin/jobmond.py -c /etc/jobmond.conf I get the following message: [root@hostname ganglia_jobmonarch-0.3.1]# /usr/local/sbin/jobmond.py -c /etc/jobmond.conf Traceback (most recent call last):
ValueError?: invalid literal for int(): 1 |