Custom Query (101 matches)
Results (25 - 27 of 101)
Ticket | Resolution | Summary | Owner | Reporter |
---|---|---|---|---|
#279 | fixed | patch for SLURM usage | somebody | mrobbert@… |
Description |
related to recent changes in the pyslurm library that jobmond.py uses to interface with Slurm. It was recently re-worked to bring it up to date with the most recent Slurm API and along the way the seem to have changed some of their data structures. Below is a patch I used to get it to run at our site. I hope this helps. --- jobmond/jobmond.py 2014-01-20 09:24:08.000000000 -0700 +++ /usr/local/sbin/jobmond.py 2014-12-16 17:22:00.501223234 -0700 @@ -1306,7 +1306,7 @@ for node, attrs in slurm_nodes.items(): - ( num_state, name_state ) = attrs['node_state'] + name_state = attrs['node_state'] if name_state == 'DOWN': @@ -1371,7 +1371,7 @@ else: ppn = min_cpus - ( something, status_long ) = self.getAttr( attrs, 'job_state' ) + status_long = self.getAttr( attrs, 'job_state' ) status = 'Q' Thanks, Mike Robbert HPC Engineer Colorado School of Mines |
|||
#53 | worksforme | Error trying to run jobarchive | ramonb | mhanafi@… |
Description |
Looks like it doesn't find all the hosts and give the following error... I have tried version 0.3.1 and 0.4 [root@aphrodite-adm jobarchived]# python jobarchived.py Mon 17 Mar 2008 15:37:36 - Checking database.. Mon 17 Mar 2008 15:37:36 - Check done. Mon 17 Mar 2008 15:37:36 - Checking rrd archive.. Mon 17 Mar 2008 15:37:36 - Check done. Mon 17 Mar 2008 15:37:36 - torque_xml_thread(): started. Mon 17 Mar 2008 15:37:36 - torque_xml_thread(): Retrieving XML data.. Mon 17 Mar 2008 15:37:36 - torque_xml_thread(): Done retrieving. Mon 17 Mar 2008 15:37:36 - ganglia_parse_thread(): Parsing XML.. Mon 17 Mar 2008 15:37:36 - main threading started. Mon 17 Mar 2008 15:37:36 - XML: Processed 1492 elements - found 1 (updated) jobs Mon 17 Mar 2008 15:37:36 - ganglia_xml_thread(): started. Mon 17 Mar 2008 15:37:36 - ganglia_xml_thread(): Sleeping.. (15s) Mon 17 Mar 2008 15:37:36 - torque_xml_thread(): Storing.. Mon 17 Mar 2008 15:37:36 - ganglia_parse_thread(): started. Mon 17 Mar 2008 15:37:36 - ganglia_parse_thread(): Retrieving XML data.. Mon 17 Mar 2008 15:37:36 - ganglia_parse_thread(): Done retrieving. Mon 17 Mar 2008 15:37:36 - ganglia_parse_thread(): Parsing XML.. Mon 17 Mar 2008 15:37:36 - ganglia_store_metric_thread(): started. Mon 17 Mar 2008 15:37:36 - ganglia_store_metric_thread(): Storing data.. Mon 17 Mar 2008 15:37:36 - ganglia_store_thread(): started. Mon 17 Mar 2008 15:37:36 - ganglia_store_thread(): Sleeping.. (360s) Mon 17 Mar 2008 15:37:36 - Entering storeMetrics() Mon 17 Mar 2008 15:37:36 - size of cluster 'aphrodite': 3 hosts 71 metrics 71 values 1027 bits 128 bytes Exception in thread store_metric_thread: Traceback (most recent call last): File "/usr/lib64/python2.4/threading.py", line 442, in __bootstrap self.run() File "/usr/lib64/python2.4/threading.py", line 422, in run self.__target(*self.__args, **self.__kwargs) File "jobarchived.py", line 1378, in storeThread ret = self.myXMLHandler.storeMetrics() File "jobarchived.py", line 1104, in storeMetrics ret = rrdh.storeMetrics() File "jobarchived.py", line 1752, in storeMetrics create_ret = self.createCheck( hostname, metricname, period ) File "jobarchived.py", line 1891, in createCheck heartbeat = 8 * int( interval ) TypeError: int() argument must be a string or a number Mon 17 Mar 2008 15:37:36 - ganglia_parse_thread(): Done parsing. Mon 17 Mar 2008 15:37:36 - ganglia_parse_thread(): finished. Mon 17 Mar 2008 15:37:36 - torque_xml_thread(): Done storing. Mon 17 Mar 2008 15:37:36 - ganglia_parse_thread(): Done parsing. Mon 17 Mar 2008 15:37:36 - torque_xml_thread(): Sleeping.. (15s) |
|||
#58 | worksforme | Error on php with 0.3.1 | ramonb | leotavaneiro@… |
Description |
Hi, I'm getting a blank image on cluster page on ganglia and it is not showing the running jobs. [Fri Jul 04 13:45:52 2008] [error] [client 10.25.28.31] PHP Warning: Invalid argument supplied for foreach() in /var/www/html/mpet_linux/addons/job_monarch/libtoga.php on line 39, referer: http://ganglia2/mpet_linux/?m=load_one&r=hour&s=descending&c=MPET+Linux+bwr4&h=&sh=1&hc=4 |
Note: See TracQuery
for help on using queries.