Custom Query (101 matches)

Filters
 
Or
 
  
 
Columns

Show under each result:


Results (25 - 27 of 101)

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19
Ticket Resolution Summary Owner Reporter
#279 fixed patch for SLURM usage somebody mrobbert@…
Description
related to recent changes in the pyslurm library that jobmond.py uses to interface with Slurm. It was recently re-worked to bring it up to date with the most recent Slurm API and along the way the seem to have changed some of their data structures. Below is a patch I used to get it to run at our site. I hope this helps.

--- jobmond/jobmond.py  2014-01-20 09:24:08.000000000 -0700
+++ /usr/local/sbin/jobmond.py  2014-12-16 17:22:00.501223234 -0700
@@ -1306,7 +1306,7 @@

        for node, attrs in slurm_nodes.items():

-            ( num_state, name_state ) = attrs['node_state']
+            name_state = attrs['node_state']

            if name_state == 'DOWN':

@@ -1371,7 +1371,7 @@
            else:
                ppn = min_cpus

-            ( something, status_long ) = self.getAttr( attrs, 'job_state' )
+            status_long = self.getAttr( attrs, 'job_state' )

            status = 'Q'

Thanks,
Mike Robbert
HPC Engineer
Colorado School of Mines
#53 worksforme Error trying to run jobarchive ramonb mhanafi@…
Description

Looks like it doesn't find all the hosts and give the following error... I have tried version 0.3.1 and 0.4

[root@aphrodite-adm jobarchived]# python jobarchived.py 
Mon 17 Mar 2008 15:37:36 - Checking database..
Mon 17 Mar 2008 15:37:36 - Check done.
Mon 17 Mar 2008 15:37:36 - Checking rrd archive..
Mon 17 Mar 2008 15:37:36 - Check done.
Mon 17 Mar 2008 15:37:36 - torque_xml_thread(): started.
Mon 17 Mar 2008 15:37:36 - torque_xml_thread(): Retrieving XML data..
Mon 17 Mar 2008 15:37:36 - torque_xml_thread(): Done retrieving.
Mon 17 Mar 2008 15:37:36 - ganglia_parse_thread(): Parsing XML..
Mon 17 Mar 2008 15:37:36 - main threading started.
Mon 17 Mar 2008 15:37:36 - XML: Processed 1492 elements - found 1 (updated) jobs
Mon 17 Mar 2008 15:37:36 - ganglia_xml_thread(): started.
Mon 17 Mar 2008 15:37:36 - ganglia_xml_thread(): Sleeping.. (15s)
Mon 17 Mar 2008 15:37:36 - torque_xml_thread(): Storing..
Mon 17 Mar 2008 15:37:36 - ganglia_parse_thread(): started.
Mon 17 Mar 2008 15:37:36 - ganglia_parse_thread(): Retrieving XML data..
Mon 17 Mar 2008 15:37:36 - ganglia_parse_thread(): Done retrieving.
Mon 17 Mar 2008 15:37:36 - ganglia_parse_thread(): Parsing XML..
Mon 17 Mar 2008 15:37:36 - ganglia_store_metric_thread(): started.
Mon 17 Mar 2008 15:37:36 - ganglia_store_metric_thread(): Storing data..
Mon 17 Mar 2008 15:37:36 - ganglia_store_thread(): started.
Mon 17 Mar 2008 15:37:36 - ganglia_store_thread(): Sleeping.. (360s)
Mon 17 Mar 2008 15:37:36 - Entering storeMetrics()
Mon 17 Mar 2008 15:37:36 - size of cluster 'aphrodite': 3 hosts 71 metrics 71 values 1027 bits 128 bytes 
Exception in thread store_metric_thread:
Traceback (most recent call last):
  File "/usr/lib64/python2.4/threading.py", line 442, in __bootstrap
    self.run()
  File "/usr/lib64/python2.4/threading.py", line 422, in run
    self.__target(*self.__args, **self.__kwargs)
  File "jobarchived.py", line 1378, in storeThread
    ret = self.myXMLHandler.storeMetrics()
  File "jobarchived.py", line 1104, in storeMetrics
    ret = rrdh.storeMetrics()
  File "jobarchived.py", line 1752, in storeMetrics
    create_ret = self.createCheck( hostname, metricname, period )
  File "jobarchived.py", line 1891, in createCheck
    heartbeat   = 8 * int( interval )
TypeError: int() argument must be a string or a number

Mon 17 Mar 2008 15:37:36 - ganglia_parse_thread(): Done parsing.
Mon 17 Mar 2008 15:37:36 - ganglia_parse_thread(): finished.
Mon 17 Mar 2008 15:37:36 - torque_xml_thread(): Done storing.
Mon 17 Mar 2008 15:37:36 - ganglia_parse_thread(): Done parsing.
Mon 17 Mar 2008 15:37:36 - torque_xml_thread(): Sleeping.. (15s)
#58 worksforme Error on php with 0.3.1 ramonb leotavaneiro@…
Description

Hi, I'm getting a blank image on cluster page on ganglia and it is not showing the running jobs.

[Fri Jul 04 13:45:52 2008] [error] [client 10.25.28.31] PHP Warning: Invalid argument supplied for foreach() in /var/www/html/mpet_linux/addons/job_monarch/libtoga.php on line 39, referer: http://ganglia2/mpet_linux/?m=load_one&r=hour&s=descending&c=MPET+Linux+bwr4&h=&sh=1&hc=4

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19
Note: See TracQuery for help on using queries.