[342] | 1 | |
---|
[363] | 2 | LEGEND f: fixed - c: changed - a: added - r: removed |
---|
[361] | 3 | |
---|
[498] | 4 | 0.3.1: |
---|
| 5 | |
---|
| 6 | other) |
---|
| 7 | f: updated INSTALL since "addons" directory is not included by default anymore in Ganglia |
---|
| 8 | thanks to: Steven DuChene linux(d.a.s.h)clusters(a.t)mindspring(d.o.t)com |
---|
| 9 | for reporting it |
---|
| 10 | |
---|
| 11 | rpm) |
---|
| 12 | f: add "addons" directory since it's not included by default anymore in Ganglia |
---|
| 13 | |
---|
| 14 | web) |
---|
| 15 | f: typo in empty_cpu variable: causing incorrect 'free cpu' count and similar errors |
---|
| 16 | thanks to: Craig West - cwest(a.t)astro(d.o.t)umass(d.o.t)edu |
---|
| 17 | for reporting it |
---|
| 18 | f: changed domain detection a little |
---|
| 19 | thanks to: Jeffrey Sarlo - JSarlo(a.t)Central(d.o.t)UH(d.o.t)EDU |
---|
| 20 | for reporting it |
---|
| 21 | a: now properly detects whether or not to use FQDN or short hostnames w/o domain |
---|
| 22 | thanks to: Jeffrey Sarlo - JSarlo(a.t)Central(d.o.t)UH(d.o.t)EDU |
---|
| 23 | for the many testing and reporting it |
---|
| 24 | |
---|
| 25 | SPECIAL THANKS to the University of Houston for sending me a shirt! |
---|
| 26 | |
---|
[500] | 27 | jobarchived) |
---|
| 28 | f: properly catch postgres exception |
---|
| 29 | f: don't use debug_message while loading config file |
---|
| 30 | |
---|
[452] | 31 | 0.3: |
---|
[342] | 32 | |
---|
| 33 | web) |
---|
| 34 | a: allow per-cluster settings/override options: see CLUSTER_CONFS option |
---|
| 35 | a: clusterimage can now draw nodes at x,y position parsed from hostname |
---|
[427] | 36 | see SORTBY_HOSTNAME for this in clusterconf/example.php |
---|
[342] | 37 | a: clusterimage nodes are now clickable: has link to all jobs from that host |
---|
[427] | 38 | a: clusterimage nodes now have a tooltip: displays hostname and jobids for now |
---|
[345] | 39 | a: jobmonarch logo image |
---|
| 40 | thank to: Robin Day |
---|
| 41 | for the design |
---|
[414] | 42 | a: rrd graph of running/queued jobs to overview |
---|
[460] | 43 | a: per-cluster settings for archive database |
---|
| 44 | thanks to: Alexis Michon - alexis(d.o.t)michon(a.t)ibcp(d.o.t)fr |
---|
| 45 | for the patch |
---|
[342] | 46 | |
---|
[414] | 47 | c: host archive view is now more complete and detailed in the same manner as |
---|
| 48 | Ganglia's own host view |
---|
[427] | 49 | c: host archive view available metric list is now compiled from disk, |
---|
| 50 | so that the detailed archive host view works even when the node is currently down. |
---|
[400] | 51 | c: removed size restrictions from detailed host archive view |
---|
| 52 | |
---|
[465] | 53 | f: compatibility: removed php5 call |
---|
[460] | 54 | thanks to: Alexis Michon - alexis(d.o.t)michon(a.t)ibcp(d.o.t)fr |
---|
| 55 | for the patch |
---|
[458] | 56 | f: prevent negative cpu/node calculation |
---|
| 57 | thanks to: aloga(a.t)ifca(d.o.t)unican(d.o.t)es |
---|
| 58 | for the patch |
---|
[364] | 59 | f: archive search not properly resetting nodes list |
---|
| 60 | thanks to: Alexis Michon - alexis(d.o.t)michon(a.t)ibcp(d.o.t)fr |
---|
| 61 | for the patch |
---|
[400] | 62 | f: detailed host view from jobarchive was broken since hostbased support of 0.2 |
---|
| 63 | now host view is properly set and parsed again |
---|
| 64 | thanks to: Alexis Michon - alexis(d.o.t)michon(a.t)ibcp(d.o.t)fr |
---|
| 65 | for reporting the bug and suggesting a patch |
---|
[403] | 66 | f: bug where jobstart redline indicator in host detail graphs was set incorrectly |
---|
[414] | 67 | or not at all due to a miscalculation in job times |
---|
[427] | 68 | f: bug where hostimage headertext xoffset was miscalculated, causing the column names |
---|
| 69 | to overlap their position when the columnname was longer than the columnvalues |
---|
[364] | 70 | |
---|
[342] | 71 | jobmond) |
---|
| 72 | |
---|
[376] | 73 | a: syslog support |
---|
[427] | 74 | a: report number of running/queued jobs as seperate metrics |
---|
| 75 | a: native gmetric support, much faster and cleaner! |
---|
[361] | 76 | thanks to: Nick Galbreath - nickg(a.t)modp(d.o.t)com |
---|
| 77 | for writing it and allowing inclusion in jobmond |
---|
| 78 | |
---|
[452] | 79 | f: crashing jobmond when multiple nodes amounts are requested in |
---|
| 80 | a queued job: numeric_node variable not initialized properly |
---|
| 81 | thanks to: aloga(a.t)ifca(d.o.t)unican(d.o.t)es |
---|
| 82 | for supplying the patch |
---|
| 83 | and many others for reporting and helping debug this |
---|
[361] | 84 | f: hanging/blocked, increased cpu usage and halted reporting |
---|
| 85 | thanks to: Bas van der Vlies - basv(a.t)sara(d.o.t)nl |
---|
| 86 | for discovering the origin of the bug |
---|
| 87 | thanks to: Mickael Gastineau - gastineau(a.t)imcce(d.o.t)fr |
---|
| 88 | for reporting it and testing the fix |
---|
| 89 | thanks to: Craig West - cwest(a.t)astro(d.o.t)umass(d.o.t)edu |
---|
| 90 | for reporting it and testing the fix |
---|
[342] | 91 | f: uninitialized variable in checkGmetricVersion() |
---|
| 92 | thanks to: Peter Kruse - pk(a.t)q-leap(d.o.t)com |
---|
| 93 | for the patch |
---|
[364] | 94 | f: undefined PBSError |
---|
| 95 | thanks to: Peter Kruse - pk(a.t)q-leap(d.o.t)com |
---|
| 96 | for reporting it |
---|
[342] | 97 | |
---|
[363] | 98 | r: SGE support broken |
---|
| 99 | |
---|
[361] | 100 | jobarchived) |
---|
| 101 | |
---|
[427] | 102 | a: can now use py-rrdtool api instead of pipes, much faster! |
---|
[376] | 103 | install py-rrdtool to use this |
---|
| 104 | backwards compatible fails back to pipes if module not installed |
---|
[367] | 105 | |
---|
[427] | 106 | c: all XML input was uniencoded, which could cause errors, |
---|
| 107 | now all properly converted to normal strings |
---|
| 108 | |
---|
[470] | 109 | f: when XML data source (gmetad) is unavailable parsethread didn't return correctly |
---|
| 110 | which caused a large number of threads to spawn while consuming large amounts of memory |
---|
[376] | 111 | f: autocreate clusterdirs in archivedir |
---|
| 112 | f: unhandled gather exception |
---|
[361] | 113 | f: incorrect stop_timestamping when jobs finished |
---|
| 114 | thanks to: Alexis Michon - alexis(d.o.t)michon(a.t)ibcp(d.o.t)fr |
---|
[376] | 115 | for finding and debugging/testing it |
---|
[361] | 116 | |
---|
[308] | 117 | 0.2: |
---|
| 118 | |
---|
| 119 | web) |
---|
[342] | 120 | f: misc. optimization and bugfixes |
---|
| 121 | f: now fully compatible with latest PHP5 and PHP4 |
---|
[308] | 122 | |
---|
[342] | 123 | c: cluster image now incorporates small text descr. |
---|
| 124 | c: monarch (cluster/host) images no longer displayed |
---|
| 125 | for clusters that are not jobmond enabled |
---|
| 126 | c: pie chart percentages are now cpu-based instead of node-based |
---|
[308] | 127 | |
---|
[342] | 128 | a: host template for Ganglia |
---|
| 129 | adds a extra monarch host image to Ganglia's host overview |
---|
| 130 | which displays/links to the jobs on that host |
---|
| 131 | NOTE!: be sure to copy/install new template from addons/templates |
---|
| 132 | a: (optional) nodes hostnames column |
---|
| 133 | thanks to: Daniel Barthel - daniel(d.o.t)barthel(a.t)nottingham(d.o.t)ac(d.o.t)uk |
---|
| 134 | for the suggestion |
---|
[308] | 135 | |
---|
| 136 | jobmond) |
---|
| 137 | |
---|
[342] | 138 | f: when a job metric is longer than maximum metric length, |
---|
| 139 | the info is split up amongst multiple metrics |
---|
| 140 | f: no longer exit when batch server is unavailable |
---|
| 141 | thanks to: Peter Kruse - pk(a.t)q-leap(d.o.t)com |
---|
| 142 | for the patch |
---|
| 143 | f: fd closure bug causing stderr/stdout to remain open after daemonizing |
---|
[308] | 144 | |
---|
[342] | 145 | c: rearranged code to allow support for other batch systems |
---|
[308] | 146 | |
---|
[342] | 147 | a: (experimental) SGE (Sun Grid Engine) support as batch server |
---|
| 148 | thanks to: Babu Sundaram - babu(a.t)cs(d.o.t)uh(d.o.t)edu |
---|
| 149 | who developed it for a OSCAR's Google-SoC project |
---|
| 150 | a: pidfile support |
---|
| 151 | thanks to: Michael Jeanson - michael(a.t)ccs(d.o.t)usherbrooke(d.o.t)ca |
---|
| 152 | for the patch |
---|
| 153 | a: usage display |
---|
| 154 | thanks to: Michael Jeanson - michael(a.t)ccs(d.o.t)usherbrooke(d.o.t)ca |
---|
| 155 | for the patch |
---|
| 156 | a: queue selection support: ability to specify which QUEUE's to get jobinfo from |
---|
| 157 | thanks to: Michael Jeanson - michael(a.t)ccs(d.o.t)usherbrooke(d.o.t)ca |
---|
| 158 | for the patch |
---|
[308] | 159 | |
---|
| 160 | jobarchived) |
---|
| 161 | |
---|
[342] | 162 | f: XML retrieval for Ganglia version >= 3.0.3 working properly again |
---|
| 163 | f: database storing for Ganglia version >= 3.0.3 working properly again |
---|
| 164 | f: fd closure bug causing stderr/stdout to remain open after daemonizing |
---|
[308] | 165 | |
---|
[342] | 166 | c: misc. bugfixes to optimize XML connections |
---|
| 167 | c: misc. bugfixes for misc. minor issues |
---|
[308] | 168 | |
---|
[342] | 169 | a: cleaning of stale jobs in dbase: see JOB_TIMEOUT option |
---|
[308] | 170 | |
---|
[283] | 171 | 0.1.1: |
---|
[249] | 172 | |
---|
| 173 | web) |
---|
| 174 | |
---|
[342] | 175 | f: misc. layout bugs for overview & search |
---|
| 176 | f: bug that occured when calculating the number of nodes when there |
---|
| 177 | was more than one job running on a machine |
---|
[253] | 178 | |
---|
[342] | 179 | c: column requested memory is now optional through conf.php |
---|
| 180 | c: search and overview tables are now full screen (100%) |
---|
| 181 | c: overview jobnames are now cutoff at max 9 characters |
---|
| 182 | to prevent (layout) scews in the tables |
---|
| 183 | c: overview graphs are no longer downsized |
---|
[253] | 184 | |
---|
[342] | 185 | a: (optional) column 'queued' (since) in overview |
---|
| 186 | a: search results (can) now have a SEARCH_RESULT_LIMIT |
---|
| 187 | this increases performance of the query's significantly! |
---|
| 188 | a: date/time format as displayed is now configurable through conf.php |
---|
[249] | 189 | |
---|
| 190 | jobmond) |
---|
| 191 | |
---|
[342] | 192 | a: now reports 'queued since' (or creation time) of jobs |
---|
[249] | 193 | |
---|
| 194 | documentation) |
---|
| 195 | |
---|
[342] | 196 | f: wrong e-mail adress in INSTALL (doh!) |
---|
[249] | 197 | |
---|
[342] | 198 | 0.1: |
---|
[249] | 199 | |
---|
| 200 | - First public release |
---|