Make HeatMap more general #849

philippjfr · 2016-09-05T16:14:19Z

The HeatMap Element has gone through a lot of redesign, initially being defined as a Raster type and then allowing columnar data. This PR refactors HeatMap to improve the simplify aggregation step and allow any number of value dimensions which can appear as hover information in bokeh. Still a WIP and I need to ensure the code is backward compatible (it should be).

philippjfr · 2017-01-08T14:27:11Z

@jlstevens The PR is now ready for review, I'll rebuild the test data shortly. The test data will have to be updated because HeatMap no longer pads with NaN values in the constructor, instead it computes a 2D gridded aggregate, which is used for display. I've also readded the previously supported raster as a property so it can go through a deprecation cycle. Representing the HeatMap as a gridded aggregate is much more flexible since it allows multiple value dimensions.

philippjfr · 2017-01-08T14:31:16Z

Still need to add thorough unit tests for the aggregation. I'm also now wondering whether I should add a warning when you pass non-aggregated data to a HeatMap, e.g. there are two different values for index ('A', 'a') in the heatmap, it would just silently ignore the second value. Really it should warn you that you should aggregate your data using some function (mean, max, min) before passing it to the HeatMap.

philippjfr · 2017-01-08T14:36:17Z

Also deprecates support for unpickling the original HeatMap format (which was replaced in 1.4).

jlstevens · 2017-01-08T16:44:58Z

holoviews/element/util.py

+
+
+def get_2d_aggregate(obj):
+    """


Perhaps this would be better expressed as an operation? Then maybe it could have a minimal docstring example in the class docstring?

jlstevens · 2017-01-08T16:45:18Z

holoviews/element/raster.py

-                                 for k1, k2 in product(keys1, keys2)})
-            return dense_map.dframe()
-        return super(HeatMap, self).dframe()
+        self.gridded = get_2d_aggregate(self)


Nice to see how much HeatMap has been simplified!

That said, it isn't immediately obvious that gridded is now a Dataset. Not sure I am necessarily recommending changing the name as gridded_dataset is awkward...

jlstevens · 2017-01-08T16:45:59Z

holoviews/element/raster.py

@@ -383,85 +381,18 @@ class HeatMap(Dataset, Element2D):

    vdims = param.List(default=[Dimension('z')])

-    def __init__(self, data, extents=None, **params):
+    depth = 1


I might have forgotten...what is this depth class attribute?

I think this may be wrong now, will have to look into it.

Wasn't needed at all in the end, removed it.

jlstevens · 2017-01-08T16:50:30Z

holoviews/plotting/bokeh/raster.py

@@ -130,26 +136,31 @@ class HeatmapPlot(ColorbarPlot):
    def _axes_props(self, plots, subplots, element, ranges):
        dims = element.dimensions()
        labels = self._get_axis_labels(dims)
-        xvals, yvals = [element.dimension_values(i, False)
+        agg = element.gridded
+        xvals, yvals = [unique_array(agg.dimension_values(i, False))


I thought gridded Datasets have the 1D coordinate arrays available. Is the uniqueness being applied over the 2D set of samples or the 1D sequence?

Yes, good point, no longer any need for the unique_array here.

jlstevens · 2017-01-08T16:51:37Z

holoviews/plotting/bokeh/raster.py

                            for i in range(2)]
            data = {x: xvals, y: yvals, z: zvals}

+        if 'hover' in self.tools+self.default_tools:
+            for vdim in element.vdims[1:]:
+                data[vdim.name] = ['' if is_nan(v) else v


Wondering if an empty string really suggests NaN. 'NaN' would be explicit but might look noisy.

Good point, I'm now using masked arrays to represent the data, in matplotlib the NaNs are therefore represented by -, which might be better.

Yes, I think - might be a good compromise.

jlstevens · 2017-01-08T16:53:24Z

holoviews/plotting/mpl/raster.py

+        shape = data.shape
+        cmap_name = style.pop('cmap', None)
+        cmap = copy.copy(plt.cm.get_cmap('gray' if cmap_name is None else cmap_name))
+        cmap.set_bad('w', 1.)


Might want to make this a plot option at some point instead of hard coding 'w'.

Again good point, indeed we already expose this via clipping_colors, should hook that in here.

I also find it curious that you are using copy.copy on a colormap - which suggests you are mutating it. I guess set_bad must have side-effects which explains the copying...

jlstevens · 2017-01-08T16:59:38Z

Ok, I've made my comments for now (and you have already replied to most of them). The biggest suggestion is that get_2d_aggregate might be better expressed as an operation (if that makes sense).

philippjfr · 2017-01-08T17:02:47Z

The biggest suggestion is that get_2d_aggregate might be better expressed as an operation (if that makes sense).

Yes, get_2d_aggregate is basically a fairly crude approximation of datashader aggregation for categorical key dimensions (i.e. 2D aggregation without the binning). Unfortunately a fair bit of complexity is required to allow aggregating without sorting the key dimensions (just added topological sorting to make that work properly).

philippjfr · 2017-01-08T19:11:58Z

@jlstevens Once tests are passing this is ready for a second review and then merge.

philippjfr · 2017-01-08T21:49:31Z

@jlstevens Tests now passing on the PR build.

jlstevens · 2017-01-09T15:54:45Z

holoviews/core/util.py

@@ -647,6 +647,30 @@ def walk_depth_first(name):
                                    (names_by_level.get(i, None)
                                     for i in itertools.count())))

+
+def is_cyclic(graph):
+    """Return True if the directed graph g has a cycle."""


What is the representation of the graph? A list of edges as tuples? Would be good to mention in the docstring.

I'm guessing the representation is similar as in one_to_one...even so, probably worth mentioning..

Right, all three methods here (sort_topologically, cyclical and one_to_one) use the same representation, which is mapping between nodes and edges, will add the docstring.

jlstevens · 2017-01-09T15:57:13Z

holoviews/element/util.py

+    return np.NaN
+
+
+class categorical_aggregate2d(ElementOperation):


Looks great! I was just wondering if you want to keep this class in util or move it to operation.element?

It's imported there but can't be moved, cyclical imports again.

Ok, having it available for operation.element is fine.

jlstevens · 2017-01-09T15:58:40Z

holoviews/element/util.py

+        Generates a categorical 2D aggregate by inserting NaNs at all
+        cross-product locations that do not already have a value assigned.
+        Returns a 2D gridded Dataset object.
+        """


Quite a long method...if you see chunks that could be split up into helper methods, that might be sensible. Up to you though!

Happy to split it up.

jlstevens · 2017-01-09T15:59:27Z

I made three more comments for you to reply to. Otherwise looks good and I expect this will be merged very soon!

philippjfr · 2017-01-09T18:09:42Z

Latest comments addressed, ready to merge when tests pass.

jlstevens · 2017-01-09T18:35:06Z

Tests passed. Merging!

philippjfr added status: WIP in progress labels Sep 5, 2016

philippjfr mentioned this pull request Sep 13, 2016

Bokeh colorbars #861

Merged

4 tasks

philippjfr force-pushed the nd_heatmap branch from 9ee8034 to 6d2eca3 Compare September 19, 2016 11:28

philippjfr added this to the v1.7.0 milestone Nov 16, 2016

philippjfr force-pushed the nd_heatmap branch from 6d2eca3 to 097680b Compare December 10, 2016 23:55

philippjfr added 4 commits January 8, 2017 00:49

Added is_nan utility

bec8024

Added functions to generate dense 2D aggregate from coordinates

339f988

Simplified HeatMap and allowed any number of value dimensions

1d3d57e

Fixes for HeatMap implementations

69a9793

philippjfr force-pushed the nd_heatmap branch from 097680b to 69a9793 Compare January 8, 2017 00:50

Fixed missing imports

efd4bd9

philippjfr force-pushed the nd_heatmap branch from 125c2a4 to efd4bd9 Compare January 8, 2017 02:21

philippjfr added tag: API tag: component: data and removed in progress status: WIP labels Jan 8, 2017

philippjfr added 4 commits January 8, 2017 12:51

Added backward compatible raster property on HeatMap

17651f0

HeatMap now pre-computes gridded representation

f3543e6

Fixes for HeatMap aggregation

843387c

Made the get_2d_aggregate helper function general

29f47c9

philippjfr requested a review from jlstevens January 8, 2017 14:32

Fixed bug in HeatmapPlot

3f4b073

philippjfr force-pushed the nd_heatmap branch from b8289fe to 3f4b073 Compare January 8, 2017 15:54

Added unit tests for HeatMap aggregation

d68485f

jlstevens reviewed Jan 8, 2017

View reviewed changes

Retain global ordering of y-value dimensions

143c301

philippjfr added 5 commits January 8, 2017 17:49

Made categorical_aggregate2d an ElementOperation

0a91dce

Small optimizations for categorical_aggregate2D

03cebf6

Cleaned up HeatMap plotting classes

844c1ad

Improved formatting for NaNs in HeatMap hover and annotations

fb4b207

Removed depth on HeatMap

fcac23e

philippjfr added 3 commits January 8, 2017 19:14

Removed unused variable

d380d08

Fixes for categorical_aggregate2d ordering

dcae11f

Fixed and simplified one-to-one mapping function

9082070

jlstevens reviewed Jan 9, 2017

View reviewed changes

philippjfr added 2 commits January 9, 2017 18:08

Added docstrings for graph utility functions

f5998f2

Split categorical_aggregate2d into a few methods

050c4c7

jlstevens merged commit 66901bc into master Jan 9, 2017

philippjfr mentioned this pull request Jan 9, 2017

Followup fixes for HeatMap generalization #1043

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make HeatMap more general #849

Make HeatMap more general #849

philippjfr commented Sep 5, 2016 •

edited

Loading

philippjfr commented Jan 8, 2017

philippjfr commented Jan 8, 2017 •

edited

Loading

philippjfr commented Jan 8, 2017

jlstevens Jan 8, 2017

jlstevens Jan 8, 2017

jlstevens Jan 8, 2017

jlstevens Jan 8, 2017

philippjfr Jan 8, 2017

philippjfr Jan 8, 2017

jlstevens Jan 8, 2017

philippjfr Jan 8, 2017

jlstevens Jan 8, 2017

philippjfr Jan 8, 2017

jlstevens Jan 8, 2017

jlstevens Jan 8, 2017

philippjfr Jan 8, 2017

jlstevens Jan 8, 2017

jlstevens commented Jan 8, 2017

philippjfr commented Jan 8, 2017 •

edited

Loading

philippjfr commented Jan 8, 2017

philippjfr commented Jan 8, 2017

jlstevens Jan 9, 2017

jlstevens Jan 9, 2017

philippjfr Jan 9, 2017

jlstevens Jan 9, 2017 •

edited

Loading

philippjfr Jan 9, 2017

jlstevens Jan 9, 2017

jlstevens Jan 9, 2017

philippjfr Jan 9, 2017

jlstevens commented Jan 9, 2017

philippjfr commented Jan 9, 2017

jlstevens commented Jan 9, 2017

		return np.NaN


		class categorical_aggregate2d(ElementOperation):

Make HeatMap more general #849

Make HeatMap more general #849

Conversation

philippjfr commented Sep 5, 2016 • edited Loading

philippjfr commented Jan 8, 2017

philippjfr commented Jan 8, 2017 • edited Loading

philippjfr commented Jan 8, 2017

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jlstevens commented Jan 8, 2017

philippjfr commented Jan 8, 2017 • edited Loading

philippjfr commented Jan 8, 2017

philippjfr commented Jan 8, 2017

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jlstevens Jan 9, 2017 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jlstevens commented Jan 9, 2017

philippjfr commented Jan 9, 2017

jlstevens commented Jan 9, 2017

philippjfr commented Sep 5, 2016 •

edited

Loading

philippjfr commented Jan 8, 2017 •

edited

Loading

philippjfr commented Jan 8, 2017 •

edited

Loading

jlstevens Jan 9, 2017 •

edited

Loading