-
Notifications
You must be signed in to change notification settings - Fork 860
WeeklyTelcon_20190903
Geoffrey Paulsen edited this page Sep 3, 2019
·
1 revision
- Dialup Info: (Do not post to public mailing list or public wiki)
- Brendan Cunningham (Intel)
- Edgar Gabriel (UH)
- Erik Zeiske
- Geoffrey Paulsen (IBM)
- Harumi Kuno
- Jeff Squyres (Cisco)
- Ralph Castain (Intel)
- Todd Kordenbrock (Sandia)
- Howard Pritchard (LANL)
- Tom Naughton
- Akshay Venkatesh (NVIDIA)
- Harumi Kuno (HPE)
- Matthew Dosanjh (Sandia)
- Michael Heinz (Intel)
- Artem Polyakov (Mellanox)
- Brandon Yates (Intel)
- Josh Hursey (IBM)
- Brian Barrett (AWS)
- David Bernhold (ORNL)
- George Bosilca (UTK)
- Joshua Ladd (Mellanox)
- Mark Allen (IBM)
- Matias Cabral (Intel)
- Nathan Hjelm (Google)
- Noah Evans (Sandia)
- Thomas Naughton (ORNL)
- Xin Zhao (Mellanox)
- mohan (AWS)
- PR6844 - Want to test if this affects containers.
- Worth the question, don't see any reason not to take this.
- Jeff will review and add comments.
- Howard will do some testing and talk to Charlie-cloud.
- No update (Brian on vacation)
- Merged
--recurse-submodules
update intoompi-scripts
Jenkins script as first step. Let's see if that works.
Blockers All Open Blockers
Review v3.0.x Milestones v3.0.4
- No new news
Review v3.1.x Milestones v3.1.4
- PR6556 and PR 6621 should go to the v3.x release branches.
- No new news
Review v4.0.x Milestones v4.0.2
-
Still have some issues; we expect to still have to do an rc2, e.g., https://github.com/open-mpi/ompi/issues/6932.
-
Discuss Issue 6568 - large messages overwhelm put
- This SHOULD stay as a blocker, since it ends in hang.
- We need to look for a workaround.
- Could disable put completely.
- Could use an opal_unlikely check of message-size, and only then kick it back if the message size is too large.
- OB1 tries put / get, and if these don't work, it falls back to send/recv.?
- possibly a flaw in put itself.
- Jeff will ask george what would be viable workaround, and identify.
- Not signing up to implement.
-
PR6942 - ready to merge.
-
https://github.com/open-mpi/ompi/issues/6949 - Geoff (and others please review)
-
MTT failures in Generic Simple unpack on v4.0.x - segfaults, assertions.
- DDT-unpack assertion on v4.0.x
Review Master Master Pull Requests
- IBM's PGI test has NEVER worked. Is it a real issue or local to IBM.
- nVidia bought PGI, perhaps someone there could take a look?
- Akshay said he'd talk to a PGI person at nVidia to see.
- Edgar mentioned that Mark Allen should rebase PR6756 and get that in to resolve an issue another customer is seeing.
- Cray running into problems again. :frown:
- Back on track.
- No discussion this week.
- See older weekday notes for prior items.
- No discussion this week.
- See older weekday notes for prior items.
- No discussion this week.
- See older weekday notes for prior items.
- IBM has to triage some failures on master and v4.0.x and some test build issues. Josh Hursey thought they might be accidentally mixing XLC and PGI compilers. Will investigate.
- Cisco has a build failure to investigate.