source: branches/1.1/CHANGELOG @ 937

Last change on this file since 937 was 937, checked in by ramonb, 10 years ago


  • corrected old init.d script names to new names


  • updated
File size: 12.5 KB
2        LEGEND  f: fixed - c: changed - a: added - r: removed
6    packaging)
8        f: Correctly set the JOBARCHIVE_RRDS in both jobarchived.conf anf
9           web/
10        f: debian init.d script names in post/pre pkg corrected to new name
12    web)
14        c: column nodes renamed to: hosts
15        a: sorting by hosts now implemented
17    jobarchived)
19        f: now properly exits on fatal xml errors
20        f: prevent exception to occur when no timed out jobs are found during
21           Housekeeping
23    jobmond)
25        f: BATCH_HOST_TRANSLATE no longer required in jobmond.conf
29    web)
31        a: archive search now has "include running jobs" option
32        c: rewritten short versus FQDN hostname detection: now works properly
33           with ganglia hosts not using FQDN hostnames
34        f: display of xml parsetime for overview. no longer display parsetime
35           for archive (no parsing done)
36        f: down/offline nodes are now properly marked in cluster image again
37        f: bug where "Unavailable" row would not be shown in overview summary
38           table
40    packaging)
42        c: completely redone and rewritten by Olivier Lahaye - thanks!
44    jobmond)
46        a: now supports SLURM Workload Manager!
47        a: warning if connecting to remote BATCH_SERVER is not supported by
48           selected BATCH_API
49        f: bug where incorrect commandline option would trigger traceback in
50           usage()
52    jobarchived)
54        a: now performs regular database Housekeeping every 20 job XML
55           iterations (previously only once at startup)
56        a: now checks if ARCHIVE_DATASOURCES are present in gmetad.conf
57        f: prevent an Exception to occur when determining datasource polling
58           interval
59        f: bug where config file handle was not closed
63    jobmond)
65        a: now supports multiple udp send channels
66        a: now supports job arrays
67        c: updated Gmetric XDR protocol to version 3.1+ compatible
69        c: gmond.conf parsing has been rewritten to handle include's and
70           multiple send channels
71        c: METRIC_MAX_VAL_LEN is now determined from gmond.conf
72        c: utilize new job monarch protocol
74        f: can now handle new PBSQuery / pbs_python versions
75        f: default gmond.conf search location is now /etc/ganglia/gmond.conf
76        f: fatal error's are now printed to shell upon startup, not just syslog
77        f: more error checking and miscellanious bugfixes
79    jobarchived)
81        r: no longer use pyPgSQL for postgres database
82        c: now use psycopg2 module for postgres database
84        a: job thread now utilizes db commits and rollbacks
85        a: now use USER/PASS authentication to database (in stead of hostbased)
87        c: database schema: changed job_id to varchar to support job arrays
88        c: database schema: changed job_name max length to 255, just like
89           torque
90        c: database schema: added username/password role authentication
91        c: utilize new job monarch protocol
93        f: job thread no longer hangs when insert/update of a job in database
94           fails
95        f: rewrite of job (finished) detection: all finished jobs again
96           properly detected
97        f: job checking now done post-parsing not while parsing
98        f: more error checking and miscellanious bugfixes
100    web)
102        r: removed Pie chart
103        r: removed TemplatePower
104        r: removed php ini_set's and time limit directive: should be handled in
105           php.ini
106        r: removed "Get Fresh Data" button: served no purpose anymore
107        a: now utilize Dwoo templates for html output
109        a: now use USER/PASS authentication to database (in stead of hostbased)
110        a: ClusterImage now drops a shadow below nodes
111        a: RRDs now show "Last: Min: Avg: Max:" values in legend
113        c: utilize new job monarch protocol
114        c: all templates rewritten from TemplatePower to Dwoo
115        c: graph.php now used for overview and archive
116        c: RRDs job start/finish line is now dashed green/red line with legend
118        f: some dbase fields are now CAST to INT for php since postgres now
119           requires explicit casts
120        f: sort order descending/ascending is now correct
121        f: many, many speed and memory improvements
122        f: more error checking and miscellanious bugfixes
126        jobmond)
127                a:      SGE support
128                        thanks to: Dave Love - d(d.o.t)love(a.t)liverpool(d.o.t)ac(d.o.t)uk
129                        for writing it!
130                a:      LSF support
131                        thanks to: Mahmoud Hanafi - mhanafi(a.t)csc(d.o.t)com
132                        for writing it!
133                a:      GMETRIC_TARGET is now parsed from gmond.conf
134                a:      GMETRIC_BINARY is now looked for in PATH
135                f:      queue selection support is now working
136                        thanks to: Craig West - cwest(a.t)astro(d.o.t)umass(d.o.t)edu
137                        for the patch
138        web)
139                a:      large graphs link for job report
140                        thanks to: Craig West - cwest(a.t)astro(d.o.t)umass(d.o.t)edu
141                a:      SHOW_EMPTY_COLUMN, SHOW_EMPTY_ROW options for ClusterImage hostname parsing
145        other)
146                f:      updated INSTALL since "addons" directory is not included by default anymore in Ganglia
147                        thanks to: Steven DuChene linux(d.a.s.h)clusters(a.t)mindspring(d.o.t)com
148                        for reporting it
150        rpm)
151                f:      add "addons" directory since it's not included by default anymore in Ganglia
152                f:      properly rewrite WEBDIR path in %files when rebuilding rpms with Makefile
154        web)
155                f:      typo in empty_cpu variable: causing incorrect 'free cpu' count and similar errors
156                        thanks to: Craig West - cwest(a.t)astro(d.o.t)umass(d.o.t)edu
157                        for reporting it
158                f:      changed erroneous domain detection a little
159                        thanks to: Craig West - cwest(a.t)astro(d.o.t)umass(d.o.t)edu
160                        for reporting it
161                a:      now properly detects whether or not to use FQDN or short hostnames w/o domain
162                        thanks to: Craig West - cwest(a.t)astro(d.o.t)umass(d.o.t)edu
163                        thanks to: Jeffrey Sarlo - JSarlo(a.t)Central(d.o.t)UH(d.o.t)EDU
164                        for the many testing and reporting it
166                        SPECIAL THANKS to the University of Houston for sending me a shirt!
168        jobarchived)
169                f:      properly catch postgres exception
170                f:      don't use debug_message while loading config file
174        web)
175                a:      allow per-cluster settings/override options: see CLUSTER_CONFS option
176                a:      clusterimage can now draw nodes at x,y position parsed from hostname
177                        see SORTBY_HOSTNAME for this in clusterconf/example.php
178                a:      clusterimage nodes are now clickable: has link to all jobs from that host
179                a:      clusterimage nodes now have a tooltip: displays hostname and jobids for now
180                a:      jobmonarch logo image
181                        thank to: Robin Day
182                        for the design
183                a:      rrd graph of running/queued jobs to overview
184                a:      per-cluster settings for archive database
185                        thanks to: Alexis Michon - alexis(d.o.t)michon(a.t)ibcp(d.o.t)fr
186                        for the patch
188                c:      host archive view is now more complete and detailed in the same manner as
189                        Ganglia's own host view
190                c:      host archive view available metric list is now compiled from disk,
191                        so that the detailed archive host view works even when the node is currently down.
192                c:      removed size restrictions from detailed host archive view
194                f:      compatibility: removed php5 call
195                        thanks to: Alexis Michon - alexis(d.o.t)michon(a.t)ibcp(d.o.t)fr
196                        for the patch
197                f:      prevent negative cpu/node calculation
198                        thanks to: aloga(a.t)ifca(d.o.t)unican(d.o.t)es
199                        for the patch
200                f:      archive search not properly resetting nodes list
201                        thanks to: Alexis Michon - alexis(d.o.t)michon(a.t)ibcp(d.o.t)fr
202                        for the patch
203                f:      detailed host view from jobarchive was broken since hostbased support of 0.2
204                        now host view is properly set and parsed again
205                        thanks to: Alexis Michon - alexis(d.o.t)michon(a.t)ibcp(d.o.t)fr
206                        for reporting the bug and suggesting a patch
207                f:      bug where jobstart redline indicator in host detail graphs was set incorrectly
208                        or not at all due to a miscalculation in job times
209                f:      bug where hostimage headertext xoffset was miscalculated, causing the column names
210                        to overlap their position when the columnname was longer than the columnvalues
212        jobmond)
214                a:      syslog support
215                a:      report number of running/queued jobs as seperate metrics
216                a:      native gmetric support, much faster and cleaner!
217                        thanks to: Nick Galbreath - nickg(a.t)modp(d.o.t)com
218                        for writing it and allowing inclusion in jobmond
220                f:      crashing jobmond when multiple nodes amounts are requested in
221                        a queued job: numeric_node variable not initialized properly
222                        thanks to: aloga(a.t)ifca(d.o.t)unican(d.o.t)es
223                        for supplying the patch
224                        and many others for reporting and helping debug this
225                f:      hanging/blocked, increased cpu usage and halted reporting
226                        thanks to: Bas van der Vlies - basv(a.t)sara(d.o.t)nl
227                        for discovering the origin of the bug
228                        thanks to: Mickael Gastineau - gastineau(a.t)imcce(d.o.t)fr
229                        for reporting it and testing the fix
230                        thanks to: Craig West - cwest(a.t)astro(d.o.t)umass(d.o.t)edu
231                        for reporting it and testing the fix
232                f:      uninitialized variable in checkGmetricVersion()
233                        thanks to: Peter Kruse - pk(a.t)q-leap(d.o.t)com
234                        for the patch
235                f:      undefined PBSError
236                        thanks to: Peter Kruse - pk(a.t)q-leap(d.o.t)com
237                        for reporting it
239                r:      SGE support broken
241        jobarchived)
243                a:      can now use py-rrdtool api instead of pipes, much faster!
244                        install py-rrdtool to use this
245                        backwards compatible fails back to pipes if module not installed
247                c:      all XML input was uniencoded, which could cause errors,
248                        now all properly converted to normal strings
250                f:      when XML data source (gmetad) is unavailable parsethread didn't return correctly
251                        which caused a large number of threads to spawn while consuming large amounts of memory
252                f:      autocreate clusterdirs in archivedir
253                f:      unhandled gather exception
254                f:      incorrect stop_timestamping when jobs finished
255                        thanks to: Alexis Michon - alexis(d.o.t)michon(a.t)ibcp(d.o.t)fr
256                        for finding and debugging/testing it
260        web)
261                f:      misc. optimization and bugfixes
262                f:      now fully compatible with latest PHP5 and PHP4
264                c:      cluster image now incorporates small text descr.
265                c:      monarch (cluster/host) images no longer displayed
266                        for clusters that are not jobmond enabled
267                c:      pie chart percentages are now cpu-based instead of node-based
269                a:      host template for Ganglia
270                        adds a extra monarch host image to Ganglia's host overview
271                        which displays/links to the jobs on that host
272                        NOTE!: be sure to copy/install new template from addons/templates
273                a:      (optional) nodes hostnames column
274                        thanks to: Daniel Barthel - daniel(d.o.t)barthel(a.t)nottingham(d.o.t)ac(d.o.t)uk
275                        for the suggestion
277        jobmond)
279                f:      when a job metric is longer than maximum metric length,
280                        the info is split up amongst multiple metrics
281                f:      no longer exit when batch server is unavailable
282                        thanks to: Peter Kruse - pk(a.t)q-leap(d.o.t)com
283                        for the patch
284                f:      fd closure bug causing stderr/stdout to remain open after daemonizing
286                c:      rearranged code to allow support for other batch systems
288                a:      (experimental) SGE (Sun Grid Engine) support as batch server
289                        thanks to: Babu Sundaram - babu(a.t)cs(d.o.t)uh(d.o.t)edu
290                        who developed it for a OSCAR's Google-SoC project
291                a:      pidfile support
292                        thanks to: Michael Jeanson - michael(a.t)ccs(d.o.t)usherbrooke(d.o.t)ca
293                        for the patch
294                a:      usage display
295                        thanks to: Michael Jeanson - michael(a.t)ccs(d.o.t)usherbrooke(d.o.t)ca
296                        for the patch
297                a:      queue selection support: ability to specify which QUEUE's to get jobinfo from
298                        thanks to: Michael Jeanson - michael(a.t)ccs(d.o.t)usherbrooke(d.o.t)ca
299                        for the patch
301        jobarchived)
303                f:      XML retrieval for Ganglia version >= 3.0.3 working properly again
304                f:      database storing for Ganglia version >= 3.0.3 working properly again
305                f:      fd closure bug causing stderr/stdout to remain open after daemonizing
307                c:      misc. bugfixes to optimize XML connections
308                c:      misc. bugfixes for misc. minor issues
310                a:      cleaning of stale jobs in dbase: see JOB_TIMEOUT option
314        web)
316                f:      misc. layout bugs for overview & search
317                f:      bug that occured when calculating the number of nodes when there
318                        was more than one job running on a machine
320                c:      column requested memory is now optional through conf.php
321                c:      search and overview tables are now full screen (100%)
322                c:      overview jobnames are now cutoff at max 9 characters
323                        to prevent (layout) scews in the tables
324                c:      overview graphs are no longer downsized
326                a:      (optional) column 'queued' (since) in overview
327                a:      search results (can) now have a SEARCH_RESULT_LIMIT
328                        this increases performance of the query's significantly!
329                a:      date/time format as displayed is now configurable through conf.php
331        jobmond)
333                a:      now reports 'queued since' (or creation time) of jobs
335        documentation)
337                f:      wrong e-mail adress in INSTALL (doh!)
341        - First public release
Note: See TracBrowser for help on using the repository browser.