Opened 17 years ago

Closed 11 years ago

#24 closed defect (worksforme)

SGE support broken

Reported by: bastiaans Owned by: ramonb
Priority: normal Milestone: 1.0
Component: jobmond Version: 0.3
Keywords: Cc: d.love@…
Estimated Number of Hours:

Description

After going through the instructions, I attempted to execute
jobmond.py. When I did that, I received the following error message:
cluster1:/usr/local/sbin # /usr/local/sbin/jobmond.py -c /etc/jobmond.conf
Traceback (most recent call last):
 File "/usr/local/sbin/jobmond.py", line 814, in ?
   main()
 File "/usr/local/sbin/jobmond.py", line 807, in main
   gather.daemon()
UnboundLocalError: local variable 'gather' referenced before assignment

An examination of the code reveals that the SGE data gathering code
was commented out on line 792. Uncommenting it had the following
effect:
cluster1:/usr/local/sbin # /usr/local/sbin/jobmond.py -c /etc/jobmond.conf
 File "/usr/local/sbin/jobmond.py", line 797
   debug_msg( 0, "fatal error: BATCH_API set to 'sge' but python
module 'sge_drmaa' is not installed' )

                               ^
SyntaxError: EOL while scanning single-quoted string

Commenting out everything but "gather = SgeDataGatherer()" gave me the
following error:
cluster1:/usr/local/sbin # /usr/local/sbin/jobmond.py -c /etc/jobmond.conf
Traceback (most recent call last):
 File "/usr/local/sbin/jobmond.py", line 814, in ?
   main()
 File "/usr/local/sbin/jobmond.py", line 800, in main
   gather = SgeDataGatherer()
 File "/usr/local/sbin/jobmond.py", line 419, in __init__
   self.initSgeJobInfo()
 File "/usr/local/sbin/jobmond.py", line 426, in initSgeJobInfo
   self.qstatparser = SgeQstatXMLParser( SGE_QSTAT_XML_FILE )
NameError: global name 'SGE_QSTAT_XML_FILE' is not defined

At this point, I decided to search my systems for references to drmaa.
I saw several references to C++ example and header files related to
it. Is the sge_drmaa module supposed to be provided by Job Monarch or
Sun Grid Engine? 

Attachments (2)

sge.diff (29.9 KB) - added by anonymous 16 years ago.
patch for SGE 6.0
sge-2.diff (39.5 KB) - added by d.love@… 16 years ago.
untested merge of changes against revision 496

Download all attachments as: .zip

Change History (15)

comment:1 Changed 17 years ago by bastiaans

  • Owner changed from somebody to bastiaans
  • Priority changed from normal to major
  • Status changed from new to assigned

reported by Jeremy Stout

comment:2 Changed 17 years ago by bastiaans

made some fixes in changeset r347

comment:3 Changed 17 years ago by bastiaans

I downloaded the trunk version and tried it. When I started the
script, it failed to daemonize. If I comment out lines 326-328 and
start the program, I get the following output:

cluster1:/usr/local/sbin # /usr/local/sbin/jobmond.py -c /etc/jobmond.conf
Traceback (most recent call last):
 File "/usr/local/sbin/jobmond.py", line 813, in ?
   main()
 File "/usr/local/sbin/jobmond.py", line 806, in main
   gather.daemon()
 File "/usr/local/sbin/jobmond.py", line 334, in daemon
   self.run()
 File "/usr/local/sbin/jobmond.py", line 341, in run
   self.jobs = self.getJobData( self.jobs )
TypeError: getJobData() takes exactly 1 argument (2 given)

If I remove the "self.jobs" argument from lines 342 and 343, the
script daemonizes itself. However, I don't think it will produce the
desired output if I try to do that.

Thank you for your help. 

comment:4 Changed 17 years ago by bastiaans

done some additional fixes in changeset r348, to make datagatherer superclass more uniform for different types of datagatherer's

comment:5 Changed 17 years ago by bastiaans

  • Milestone changed from 0.2.1 to 0.3

comment:6 Changed 16 years ago by bastiaans

  • Priority changed from major to minor
  • Summary changed from SGE support bit buggy to SGE support broken

I can't fix this, I don't have access to SGE systems

Changed 16 years ago by anonymous

patch for SGE 6.0

comment:7 Changed 16 years ago by d.love@…

I've attached diffs tested with SGE 6.0u11. Unfortunately, they're against svn v. 450, and I can't currently test. I hope to set up jobmonarch with SGE 6.1 before long.

comment:8 Changed 16 years ago by bastiaans

  • Cc d.love@… added
  • Milestone 0.4 deleted
  • Type changed from defect to enhancement
  • Version changed from 0.2 to trunk

I will try to backport your patch into current trunk and send it out to the testers.

right now I'm working on release 0.3.1 with some small bugfixes first though.

Changed 16 years ago by d.love@…

untested merge of changes against revision 496

comment:9 Changed 16 years ago by bastiaans

  • Cc d.love@… removed
  • Milestone set to 0.4
  • Priority changed from minor to normal

this is now committed, awaiting test user results.

without any significant bug reports it will be included in version 0.4

comment:10 Changed 16 years ago by ramonb

  • Owner changed from bastiaans to ramonb
  • Status changed from assigned to new

my username changed

comment:11 Changed 15 years ago by ramonb

  • Cc d.love@… added
  • Status changed from new to assigned

On hold until I can receive confirmation from Dave Love or other SGE users.

comment:12 Changed 15 years ago by davel

I belatedly added some fixes to the trunk and 0.3 branch to make them consistent with what I've had running under a few versions of SGE 6.0. The result may work with 6.1, and should be easily fixable if it doesn't. The output has changed radically in 6.2, though, and that currently won't work. I'll see if I can fix it later, though there's no relevant doc, unfortunately.

comment:13 Changed 11 years ago by ramonb

  • Resolution set to worksforme
  • Status changed from assigned to closed
  • Type changed from enhancement to defect
  • Version changed from trunk to 0.3
Note: See TracTickets for help on using tickets.