Skip to content
This repository has been archived by the owner on Sep 30, 2022. It is now read-only.

Topic/v1.10/coll fixes #1262

Merged
merged 11 commits into from
Jul 28, 2016
Merged

Conversation

ggouaillardet
Copy link
Contributor

No description provided.

bosilca and others added 11 commits July 11, 2016 09:11
some of the collective modules. Added a new function
opan_datatype_span, to compute the memory span of
count number of datatype, excluding the gaps in the
beginning and at the end. If a memory allocation is
made using the returned value, the gap (also returned)
should be removed from the allocated pointer.

(back-ported from commit open-mpi/ompi@4d00c59)
Offset temporary buffer when a non zero lower bound datatype is used.

Thanks Hristo Iliev for the report

(back-ported from commit open-mpi/ompi@0e39319)
…duce_intra_basic_linear()

Thanks Yuki Matsumoto for the report

(back-ported from commit open-mpi/ompi@c06fb04)
 - correctly handle non zero lower bound ddt
 - correctly handle ddt with size > extent

Thanks Yuki Matsumoto for the report

(back-ported from commit open-mpi/ompi@488d037)
 - correctly handle non zero lower bound ddt
 - correctly handle ddt with size > extent

Thanks Yuki Matsumoto for the report

(back-ported from commit open-mpi/ompi@3e559a1)
 - correctly handle non commutative operators
 - correctly handle non zero lower bound ddt
 - correctly handle ddt with size > extent
 - revamp NBC_Sched_op so it takes two buffers and matches ompi_op_reduce semantic
 - various fix for inter communicators

Thanks Yuki Matsumoto for the report

(back-ported from commit open-mpi/ompi@678d086)
as reported by Coverity with CIDs 1363349-1363362
(back-ported from commit open-mpi/ompi@7b8094a)
…e_alltoallv_intra_basic_inplace()

(back-ported from commit open-mpi/ompi@a55d574)
…reduce_scatter_inter()

this is now useless since the scatter is done via the local communicator

(back-ported from commit open-mpi/ompi@1462450)
@mellanox-github
Copy link

Test FAILed.
See http://bgate.mellanox.com/jenkins/job/gh-ompi-release-pr/1842/ for details.

@ggouaillardet
Copy link
Contributor Author

:bot:assign: @bosilca
:bot🏷️bug
:bot:milestone:v1.10.4

@bosilca
Copy link
Member

bosilca commented Jul 11, 2016

👍

@jsquyres
Copy link
Member

@jladd-mlnx @artpol84 Looks like Mellanox Jenkins timed out on the thread tests. But I see that the --bind-to none fix was applied there. Are the thread tests relevant to run on the v1.10 branch?

@artpol84
Copy link
Contributor

Yes for the yalla I think.
I noticed that some tests are really slow progressing and the runtime was
close to timeout value. I can increase it for now. With to check the reason
of the slowness when I'll have the time.

понедельник, 11 июля 2016 г. пользователь Jeff Squyres написал:

@jladd-mlnx https://github.com/jladd-mlnx @artpol84
https://github.com/artpol84 Looks like Mellanox Jenkins timed out on
the thread tests. But I see that the --bind-to none fix was applied
there. Are the thread tests relevant to run on the v1.10 branch?


You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
#1262 (comment),
or mute the thread
https://github.com/notifications/unsubscribe/AHL5PmZiP72a7drQ9MDWmrW0LhR65zGNks5qUi7kgaJpZM4JJIeS
.


Best regards, Artem Polyakov
(Mobile mail)

@artpol84
Copy link
Contributor

artpol84 commented Jul 11, 2016

Indeed those are high latencies. Maybe related to non-bounded run:

12:16:11 Size (bytes) Time (us)
12:16:31 0 9906.442500
12:16:50 16 9628.991000
12:17:10 32 9732.075000
12:17:30 48 10289.908500
12:17:51 64 10521.495500
12:18:13 80 10696.196000
12:18:33 96 10311.793000
12:18:54 112 10442.492000
12:19:15 128 10665.487000
12:19:35 144 9607.496000
12:19:54 160 9481.857500
12:20:14 176 10232.132000
12:20:36 192 10754.497500
12:20:56 208 10296.493500
12:21:15 224 9524.908000
12:21:35 240 10106.571500
12:21:57 256 10715.749500
12:22:18 320 10668.747000
12:22:40 384 10953.499000
12:23:02 448 10758.986500
12:23:23 512 10828.990000
12:23:44 576 10571.493500
12:24:06 640 10701.493000
12:24:26 704 10335.995500
12:24:45 768 9506.994000
12:25:07 832 10707.497000
12:25:28 896 10694.498000
12:25:50 960 10679.993000

I'll check ASAP

@artpol84
Copy link
Contributor

Bot:retest

@jsquyres
Copy link
Member

@artpol84 The thread tests might not be relevant for the v1.10 branch. Only in master / v2.x did we do all the correctness and performance improvements for THREAD_MULTIPLE.

@jsquyres
Copy link
Member

bot:assign: @bosilca
bot:label:bug
bot:milestone:v1.10.4

@ompiteam-bot
Copy link

OMPIBot error: Milestone "v1.10.4" does not exist.

@jsquyres jsquyres added this to the 1.10.4 milestone Jul 11, 2016
@mellanox-github
Copy link

Test FAILed.
See http://bgate.mellanox.com/jenkins/job/gh-ompi-release-pr/1843/ for details.

@artpol84
Copy link
Contributor

bot:retest

@mellanox-github
Copy link

Test FAILed.
See http://bgate.mellanox.com/jenkins/job/gh-ompi-release-pr/1884/ for details.

@Di0gen
Copy link

Di0gen commented Jul 17, 2016

bot:retest

@mellanox-github
Copy link

Test FAILed.
See http://bgate.mellanox.com/jenkins/job/gh-ompi-release-pr/1887/ for details.

@artpol84
Copy link
Contributor

bot:retest

@artpol84
Copy link
Contributor

bot:retest

@mellanox-github
Copy link

Test FAILed.
See http://bgate.mellanox.com/jenkins/job/gh-ompi-release-pr/1889/ for details.

@mellanox-github
Copy link

Test PASSed.
See http://bgate.mellanox.com/jenkins/job/gh-ompi-release-pr/1890/ for details.

@bosilca
Copy link
Member

bosilca commented Jul 19, 2016

👍

@jsquyres
Copy link
Member

bot:mellanox:retest

@mellanox-github
Copy link

Test PASSed.
See http://bgate.mellanox.com/jenkins/job/gh-ompi-release-pr/1974/ for details.

@rhc54 rhc54 merged commit 640bcf6 into open-mpi:v1.10 Jul 28, 2016
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

8 participants