ticket summary component version milestone type owner status created _changetime _description _reporter 281 global name 'cfgcluster' is not defined jobarchived trunk 1.2 defect ramonb assigned 2015-08-25T14:18:51+02:00 2015-08-27T11:31:44+02:00 "If ARCHIVE_PATH does not exists, jobarchived crashes trying to create it. {{{ Tue 25 Aug 2015 13:59:59 - Housekeeping: checking RRD archive for cluster DIGITEO Cluster (may take a while).. Traceback (most recent call last): File ""/usr/sbin/jobarchived"", line 2398, in main() File ""/usr/sbin/jobarchived"", line 2345, in main run() File ""/usr/sbin/jobarchived"", line 2303, in run myGangliaProcessors.append( GangliaXMLProcessor( myXMLSource, myDataStore, archive_cluster ) ) File ""/usr/sbin/jobarchived"", line 1552, in __init__ self.myXMLHandler = GangliaXMLHandler( self.config, self.ds, self.clusterName ) File ""/usr/sbin/jobarchived"", line 1214, in __init__ self.gatherCluster() File ""/usr/sbin/jobarchived"", line 1235, in gatherCluster cluster_dir = '%s/%s' %( check_dir(ARCHIVE_PATH), cfgcluster ) NameError: global name 'cfgcluster' is not defined }}}" olahaye 177 jobmond crash jobmond 1.1.2 1.1.3 defect ramonb reopened 2014-01-20T12:02:18+01:00 2014-02-01T18:22:37+01:00 "{{{ +2+12+19+4root+0+0+1+4NULL+2+12+59+4rootTraceback (most recent call last): File ""/usr/sbin/jobmond"", line 2220, in main() File ""/usr/sbin/jobmond"", line 2215, in main gather.run() File ""/usr/sbin/jobmond"", line 1085, in run self.getJobData() File ""/usr/sbin/jobmond"", line 1796, in getJobData joblist = self.pq.getjobs() File ""/usr/lib/pymodules/python2.7/pbs/PBSQuery.py"", line 350, in getjobs self._statjob('', attrib_list) File ""/usr/lib/pymodules/python2.7/pbs/PBSQuery.py"", line 331, in _statjob jobs = pbs.pbs_statjob(self.con, job_name, self.attribs, 'NULL') TypeError: not a list }}} This could also be a pbs_python bug" ramonb 168 graph start/end times don't reset between single result archive searches web 1.1 1.2 defect ramonb assigned 2013-05-18T18:01:12+02:00 2013-08-27T15:00:48+02:00 "If you first search a specific job in archive: * id = something While in search results, you alter the query, still the same graph start/end times are used" ramonb 173 archiving multiple clusters malfunction jobarchived 1.1 1.2 defect ramonb assigned 2013-07-31T15:13:00+02:00 2014-01-20T17:04:56+01:00 "seems archiving multiple clusters does not work properly in 1.1.0 for starters, if you set 2 ARCHIVE_DATASOURCES, there seems to be host overlap and clashes i.e. ""Render Cluster"" : 22 hosts and ""LISA Cluster"" : 628 hosts but jobarchived's debug output shows: {{{ Wed 31 Jul 2013 14:22:02 - size of cluster 'Render Cluster': 0 hosts 0 metrics 0 values 0 bits 0 bytes Wed 31 Jul 2013 14:22:02 - size of cluster 'LISA Cluster': 1 hosts 23 metrics 23 values 367 bits 45 bytes Wed 31 Jul 2013 14:23:02 - size of cluster 'Render Cluster': 650 hosts 81378 metrics 273561 values 4325519 bits 540689 bytes Wed 31 Jul 2013 14:30:03 - size of cluster 'Render Cluster': 650 hosts 81378 metrics 1057378 values 16719410 bits 2089926 bytes Wed 31 Jul 2013 14:37:04 - size of cluster 'Render Cluster': 650 hosts 81378 metrics 829343 values 13115240 bits 1639405 bytes Wed 31 Jul 2013 14:43:26 - size of cluster 'LISA Cluster': 650 hosts 81378 metrics 369914 values 5850154 bits 731269 bytes Wed 31 Jul 2013 14:44:04 - size of cluster 'Render Cluster': 650 hosts 81378 metrics 361926 values 5723618 bits 715452 bytes Wed 31 Jul 2013 14:44:50 - size of cluster 'LISA Cluster': 650 hosts 81378 metrics 313158 values 4953612 bits 619201 bytes Wed 31 Jul 2013 14:51:05 - size of cluster 'Render Cluster': 650 hosts 81378 metrics 212615 values 3364312 bits 420539 bytes Wed 31 Jul 2013 14:51:40 - size of cluster 'LISA Cluster': 650 hosts 81378 metrics 188598 values 2984378 bits 373047 bytes Wed 31 Jul 2013 14:52:38 - size of cluster 'LISA Cluster': 650 hosts 82770 metrics 144556 values 2286402 bits 285800 bytes Wed 31 Jul 2013 14:58:04 - size of cluster 'Render Cluster': 650 hosts 82983 metrics 162484 values 2569767 bits 321220 bytes Wed 31 Jul 2013 14:58:21 - size of cluster 'LISA Cluster': 650 hosts 82983 metrics 157824 values 2496428 bits 312053 bytes Wed 31 Jul 2013 15:05:04 - size of cluster 'Render Cluster': 650 hosts 82983 metrics 309337 values 4896702 bits 612087 bytes Wed 31 Jul 2013 15:05:07 - size of cluster 'LISA Cluster': 650 hosts 82983 metrics 315044 values 4987062 bits 623382 bytes Wed 31 Jul 2013 15:10:54 - size of cluster 'LISA Cluster': 650 hosts 82983 metrics 290641 values 4597175 bits 574646 bytes }}} it looks like all hosts are added to both clusters: 628 + 22 = 650. This results in hosts being stored twice in RRD. In addition jobs from Render Cluster do not seem to be picked up, but only LISA Cluster's jobs." ramonb 280 patch for bind_hostname general 1.1.1 1.2 defect somebody new 2015-08-24T14:48:19+02:00 2015-08-24T14:48:19+02:00 "{{{ I've attached a patch which adds support for the udp_send_channel bind_hostname option. This addresses issues running on systems with multiple network adapters. }}}" joshua.weage@… 282 Cannot find python module general 1.1.1 1.2 defect somebody new 2017-07-07T16:15:51+02:00 2017-07-07T16:15:51+02:00 "giving this error Starting SYSV: jobmond reports information on batch jobs as Ganglia metrics... Jul 08 01:14:10 p-test jobmond[30239]: Starting Job Monitoring Daemon: FATAL ERROR: BATCH_API set to 'pbs' but python module 'pbs_python' is not found or installed Jul 08 01:14:10 p-test jobmond[30239]: /usr/lib64/python2.7/site-packages/pbs/_pbs.so: undefined symbol: pbs_submit_hash " anonymous 283 I have successfully monitored the usage of PBS but in slurm not able to get the jobarchive working. Do we need a different database to dump. general 1.1.2 1.2 defect somebody new 2017-07-26T14:09:56+02:00 2017-07-26T14:09:56+02:00 saurabhaninave@… 284 THis New Ticket general 1.1 1.2 defect somebody new 2018-04-04T13:46:38+02:00 2018-04-04T13:46:38+02:00 "THis New Ticket"">