Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

多次执行lpa算法的结果不一致 #140

Open
yangyi423 opened this issue Mar 10, 2021 · 4 comments
Open

多次执行lpa算法的结果不一致 #140

yangyi423 opened this issue Mar 10, 2021 · 4 comments

Comments

@yangyi423
Copy link

3000万数据 在集群模式下运行 每次的结果都不一致,这是什么问题导致的呀?

@jsycdut
Copy link
Contributor

jsycdut commented Mar 12, 2021

lpa,指的是标签传播算法(Label Propagation Algorithm),在计算过程中,节点会将自己的标签更新为邻居中标签数量最多的标签,如果邻居中有多个标签的数量相同,那么就随机选一个标签作为自己的标签,正是这个随机选择,导致了lpa可能每次计算的结果都不一样。

@jsycdut
Copy link
Contributor

jsycdut commented Mar 12, 2021

并且这个和你的数据量是无关的,算法早已决定了它具有每次计算结果都不一致的特点(或者说缺点)

@actcwlf
Copy link
Contributor

actcwlf commented Mar 12, 2021

另外就是可以查看一下每次根据标签划分的顶点集合差异大不大

@abasi20
Copy link

abasi20 commented Nov 2, 2021

lpa算法是不确定的,可以看一下lpa的相关文献。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants