Add flag parameters for topic diff #1448

parulsethi · 2017-06-25T18:59:32Z

Add flag parameters to optionally calculate diff matrix, diff diagonal and annotations.

menshikh-iv · 2017-07-05T07:02:42Z

The main idea of this PR - for diagonal case calculates only diagonal values (with/without annotation for diagonal), it's needed for free speed-up this function for concrete case (for example - visdom integration). Please change logic (first, check for diagonal case) and refactor code (remove duplication)

parulsethi · 2017-07-09T23:08:55Z

gensim/models/ldamodel.py

-        annotation = [[None] * t1_size for _ in range(t2_size)]
+        if diagonal:
+            t_size = min(t1_size, t2_size)
+            z = np.zeros(t_size)


If the no. of topics is different between two models (diff will not be a square matrix), then what should be returned in the diagonal case? Either raise an error for num_topics to be same or just return the diff of identical topic no.s till the smaller value of num_topic of both model? (latter one implemented right now)

I think you should raise error

Replaced to raise the error instead

… into diff_flag_params

menshikh-iv · 2017-08-02T12:36:29Z

gensim/models/ldamodel.py

+                diff_terms[topic] = [pos_tokens, neg_tokens]
+
+        if normed:
+                if np.abs(np.max(z)) > 1e-8:


excess indent

menshikh-iv · 2017-08-02T12:39:10Z

gensim/models/ldamodel.py

+            # initialize z and annotation matrix
+            z = np.zeros((t1_size, t2_size))
+            if annotation:
+                diff_terms = np.zeros((t1_size, t2_size), dtype=list)


This terms for diff and intersection (not only diff_terms), please rename this variable to old variant (annotation)

annotation is already being used as boolean parameter to indicate if annotations should be calculated or not. So I've renamed it to annotation_terms.

menshikh-iv · 2017-08-02T12:40:21Z

gensim/models/ldamodel.py

-        for topic1 in range(t1_size):
-            for topic2 in range(t2_size):
+            z[topic] = distance_func(d1[topic1], d2[topic2])
+            if annotation:
                pos_tokens = fst_topics[topic1] & snd_topics[topic2]
                neg_tokens = fst_topics[topic1].symmetric_difference(snd_topics[topic2])

                pos_tokens = sample(pos_tokens, min(len(pos_tokens), n_ann_terms))


As I remember, you want to remove sample for another PR, you already do it in different PR?

Yep, it's in #1484

parulsethi added 2 commits June 25, 2017 06:06

add flags for diagnol and annotation

c0dea13

make matrix default

22875ef

parulsethi added 2 commits July 10, 2017 04:21

remove duplication

68eb54e

Merge branch 'develop' into diff_flag_params

a7e710e

parulsethi commented Jul 9, 2017

View reviewed changes

parulsethi added 4 commits July 12, 2017 20:18

raise error on diff no. of topics

076ae38

add docstrings

4e7f3c7

Merge branch 'diff_flag_params' of https://github.com/parulsethi/gensim…

29c3403

… into diff_flag_params

Fix flake8

10f35f0

menshikh-iv suggested changes Aug 2, 2017

View reviewed changes

parulsethi added 5 commits August 2, 2017 20:12

rename annotation matrix variable

e52e4fb

add tests

31731ea

fix merge conflict

8ee0fed

fix indent

4aa8378

flake8 fixes

8ef695e

menshikh-iv approved these changes Aug 3, 2017

View reviewed changes

menshikh-iv merged commit 3cb8495 into piskvorky:develop Aug 3, 2017

parulsethi deleted the diff_flag_params branch August 3, 2017 10:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add flag parameters for topic diff #1448

Add flag parameters for topic diff #1448

parulsethi commented Jun 25, 2017 •

edited

Loading

menshikh-iv commented Jul 5, 2017

parulsethi Jul 9, 2017 •

edited

Loading

menshikh-iv Jul 11, 2017

parulsethi Jul 12, 2017

menshikh-iv Aug 2, 2017

menshikh-iv Aug 2, 2017

parulsethi Aug 2, 2017

menshikh-iv Aug 2, 2017

parulsethi Aug 2, 2017

Add flag parameters for topic diff #1448

Add flag parameters for topic diff #1448

Conversation

parulsethi commented Jun 25, 2017 • edited Loading

menshikh-iv commented Jul 5, 2017

parulsethi Jul 9, 2017 • edited Loading

Choose a reason for hiding this comment

menshikh-iv Jul 11, 2017

Choose a reason for hiding this comment

parulsethi Jul 12, 2017

Choose a reason for hiding this comment

menshikh-iv Aug 2, 2017

Choose a reason for hiding this comment

menshikh-iv Aug 2, 2017

Choose a reason for hiding this comment

parulsethi Aug 2, 2017

Choose a reason for hiding this comment

menshikh-iv Aug 2, 2017

Choose a reason for hiding this comment

parulsethi Aug 2, 2017

Choose a reason for hiding this comment

parulsethi commented Jun 25, 2017 •

edited

Loading

parulsethi Jul 9, 2017 •

edited

Loading