Custom Query (101 matches)


Show under each result:

Results (7 - 9 of 101)

1 2 3 4 5 6 7 8 9 10 11 12 13
Ticket Resolution Summary Owner Reporter
#76 worksforme jobarchived does not change status to "F" ramonb j.kasiak@…

Jobarchived does not update a jobs status to "F" once it finishes. Jobmond runs on the head node. gmetad runs on a seperate box. I've narrowed down the problem: when I do on my gmetad box

telnet -l ganglia localhost 8651 | grep -i monarch | grep -i 23055

<METRIC NAME="MONARCH-JOB-23055-0" VAL="status=R start_timestamp=1269222985 name=STDIN poll_interval=30 queue=batch reported=1269223164 requested_time=100:00:00 queued_timestamp=1269222984 owner=user1 nodes=p340050" TYPE="string" UNITS="" TN="442" TMAX="60" DMAX="0" SLOPE="both" SOURCE="gmond"> Connection closed by foreign host.

The job is still there!!! Only a restart of gmetad clears this. This is a problem, since jobarchived parses this xml file and puts this node in an array of active nodes, and never gets to set the job_status to "F".

How can I fix this? Thanks, Jan

#77 worksforme When would Job Monarch start working with newer Ganglia (3.1.*) somebody prakash.velayutham@…


Do you have a time line on when Job Monarch would be able to work with newer Ganglia (3.1.x) versions?

Thanks, Prakash

#31 duplicate with no jobs jobmond fills log files in no time somebody pk@…


the attached patch will make jobmond sleep() not when a PBSError occurs but when there are no jobs in the queue.

1 2 3 4 5 6 7 8 9 10 11 12 13
Note: See TracQuery for help on using queries.