Custom Query (101 matches)
Results (1 - 3 of 101)
Ticket | Resolution | Summary | Owner | Reporter |
---|---|---|---|---|
#279 | fixed | patch for SLURM usage | somebody | mrobbert@… |
Description |
related to recent changes in the pyslurm library that jobmond.py uses to interface with Slurm. It was recently re-worked to bring it up to date with the most recent Slurm API and along the way the seem to have changed some of their data structures. Below is a patch I used to get it to run at our site. I hope this helps. --- jobmond/jobmond.py 2014-01-20 09:24:08.000000000 -0700 +++ /usr/local/sbin/jobmond.py 2014-12-16 17:22:00.501223234 -0700 @@ -1306,7 +1306,7 @@ for node, attrs in slurm_nodes.items(): - ( num_state, name_state ) = attrs['node_state'] + name_state = attrs['node_state'] if name_state == 'DOWN': @@ -1371,7 +1371,7 @@ else: ppn = min_cpus - ( something, status_long ) = self.getAttr( attrs, 'job_state' ) + status_long = self.getAttr( attrs, 'job_state' ) status = 'Q' Thanks, Mike Robbert HPC Engineer Colorado School of Mines |
|||
#176 | fixed | jobmond sometimes crashes after a while | ramonb | ramonb |
Description |
with this Traceback (most recent call last): File "/usr/sbin/jobmond", line 2203, in <module> File "/usr/sbin/jobmond", line 2198, in main File "/usr/sbin/jobmond", line 1073, in run File "/usr/sbin/jobmond", line 909, in submitJobData File "/usr/sbin/jobmond", line 759, in multicastGmetric File "/usr/sbin/jobmond", line 1969, in __init__ File "/usr/lib/python2.7/socket.py", line 187, in __init__ socket.error: [Errno 24] Too many open files |
|||
#175 | fixed | conf.php file missing or misplaced | somebody | keberflores@… |
Description |
I've read in the point 2. of web subsection of configuration section in the readme file, that changes to /var/www/ganglia/addons/job_monarch/conf.php should be made to reflect own settings, but this file doesn't exists. In 1.0 the file exists in web/addons/job_monarch/conf.php, In 1.1 and 1.1.1 the file doesn't exists in that location, but in web/conf.php.in maybe the file should be relocated/renamed, or the install instructions be updated. |
Note: See TracQuery
for help on using queries.