Optimize gaussian performance #331

fehiepsi · 2020-05-16T21:27:34Z

This small PR adds some optimizations explored in #315:

Lazily add two BlockVector, BlockMatrix. Currently, we convert them to numeric arrays, then add the results.
Some optimization for Cholesky/triangular_solve for scalar matrices.

I explored 2D-cat to convert BlockMatrix to a tensor, but we don't gain anything for the performance. It is true that the number of cat operators is reduced (for gaussian hmm with time_dim=6000, the number of cat op is reduced from 160 calls to 91 calls), but the .reshape operators after them become expensive.

Test on GaussianHMM with batch_dim, time_dim, obs_dim, hidden_dim = 5, 6000, 3, 2, time reduced from 192ms to 160ms to evaluate log_prob.

profiling code

import pyro.distributions as dist
import pytest
import torch
from pyro.distributions.util import broadcast_shape

from funsor.pyro.hmm import GaussianHMM
from funsor.testing import assert_close, random_mvn
import funsor; funsor.set_backend("torch")

batch_dim, time_dim, obs_dim, hidden_dim = 5, 6000, 3, 2

init_shape = (batch_dim,)
trans_mat_shape = trans_mvn_shape = obs_mat_shape = obs_mvn_shape = (batch_dim, time_dim)
init_dist = random_mvn(init_shape, hidden_dim)
trans_mat = torch.randn(trans_mat_shape + (hidden_dim, hidden_dim))
trans_dist = random_mvn(trans_mvn_shape, hidden_dim)
obs_mat = torch.randn(obs_mat_shape + (hidden_dim, obs_dim))
obs_dist = random_mvn(obs_mvn_shape, obs_dim)

actual_dist = GaussianHMM(init_dist, trans_mat, trans_dist, obs_mat, obs_dist)
shape = broadcast_shape(init_shape + (1,),
                        trans_mat_shape, trans_mvn_shape,
                        obs_mat_shape, obs_mvn_shape)
data = obs_dist.expand(shape).sample()
assert data.shape == actual_dist.shape()

%time actual_log_prob = actual_dist.log_prob(data)

fritzo

Nice speedup. I believe there is a subtler compatibility condition though.

fritzo · 2020-07-11T19:02:16Z

funsor/gaussian.py

+        shape = broadcast_shape(self.shape, other.shape)
+        matrix = BlockMatrix(shape)
+        for part in set(self.parts.keys()) | set(other.parts.keys()):
+            a = self.parts[part]


Won't this error if part in other.parts but part not in self.parts? And conversely for the following line?

fritzo · 2020-07-11T19:20:35Z

funsor/gaussian.py

+    keep_block = isinstance(lhs_info_vec, BlockVector)
+    rhs_info_vec, rhs_precision = align_gaussian(inputs, rhs, try_keeping_block_form=keep_block)
+
+    if keep_block and not isinstance(rhs_info_vec, BlockVector):


I think you'll also need to test whether the two block forms are compatible. E.g. the following two block forms are not compatible:

[0:1], [1:3]

[0:2], [2:3]

fritzo · 2020-07-11T19:22:40Z

funsor/gaussian.py

        result = ops.cat(-1, *parts)
        if not get_tracing_state():
            assert result.shape == self.shape
        return result

+    def __add__(self, other):
+        assert isinstance(other, BlockVector)


I think we'll also want either (1) an assertion that the two block forms are compatible, or (2) a brach to convert both .as_tensor() and add the results. Two block forms are incompatible if any of their slices overlap, e.g. [0:3],[3:10] versus [0:4],[4:10].

I remember that I was worried about this issue too but later found that there is no test for it. So I assumed that each block corresponds to a variable and we only add two BlockVectors when their common variables have consistent dimensions. If that is true, I'll add an assertation. Otherwise, you are right that we need a branch.

Either way, we need a check for that consistency. So adding a branch has no harm after all. I'll do it.

fehiepsi added 16 commits May 2, 2020 22:54

move files from numdist branch

8e3a7ef

make test_distributions backend-agnostic

b50e63b

update numpyro version

ce09b11

using distributions backend explicit

f9b05b6

fix the failing test

c588e4b

rename funsor.distributions to funsor.distribution

95e8683

fix some tests

32ae224

revert the change

b3ec65b

not import minipyro; make sure funsor works if pyro/torch not installed

a3e7848

update pyroapi version, add __init__ file to test/pyro

7d24882

merge master

6fd166d

add init file to test folder

e1648c8

fix failing test in gaussian

635f9c2

don't use rng_key for numpy backend

408d594

fix Travis failing tests

5f5b2c9

lazy add two blockvector

8665e24

fehiepsi added the Blocked Blocked by other issues label May 16, 2020

tweak

50b6fcc

eb8680 added Blocked Blocked by other issues and removed Blocked Blocked by other issues labels May 17, 2020

fehiepsi added 2 commits July 11, 2020 12:13

merge master

eeb32ea

fix merge conflicts

90dafdf

fehiepsi removed the Blocked Blocked by other issues label Jul 11, 2020

remove 2D cat because its performance is worse

8f3e83d

fehiepsi added the awaiting review label Jul 11, 2020

fehiepsi requested a review from fritzo July 11, 2020 17:49

fritzo reviewed Jul 11, 2020

View reviewed changes

fritzo added awaiting response and removed awaiting review labels Jul 12, 2020

fritzo closed this Feb 2, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize gaussian performance #331

Optimize gaussian performance #331

fehiepsi commented May 16, 2020 •

edited

Loading

fritzo left a comment

fritzo Jul 11, 2020

fritzo Jul 11, 2020

fritzo Jul 11, 2020

fehiepsi Jul 11, 2020

fehiepsi Jul 11, 2020

Optimize gaussian performance #331

Optimize gaussian performance #331

Conversation

fehiepsi commented May 16, 2020 • edited Loading

fritzo left a comment

Choose a reason for hiding this comment

fritzo Jul 11, 2020

Choose a reason for hiding this comment

fritzo Jul 11, 2020

Choose a reason for hiding this comment

fritzo Jul 11, 2020

Choose a reason for hiding this comment

fehiepsi Jul 11, 2020

Choose a reason for hiding this comment

fehiepsi Jul 11, 2020

Choose a reason for hiding this comment

fehiepsi commented May 16, 2020 •

edited

Loading