AWS ParallelCluster v2.5.1
demartinofra
released this
13 Dec 16:35
·
16 commits
to release-2.5
since this release
We're excited to announce the release of AWS ParallelCluster 2.5.1.
Upgrade
How to upgrade?
sudo pip install --upgrade aws-parallelcluster
Enhancements
- Add
--show-url
flag topcluster dcv connect
command in order to generate a one-time URL that can be used to start a DCV session. This unblocks the usage of DCV when the browser cannot be launched automatically.
Changes
- Upgrade CUDA library to version 10.2.
- Using a Placement Group is not required anymore but highly recommended when enabling EFA.
- Increase default root volume size in Centos 6 AMI to 25GB.
- Increase the retention of CloudWatch logs produced when generating AWS Batch Docker images from 1 to 14 days.
- Increase the total time allowed to build Docker images from 20 minutes to 30 minutes. This is done to better deal with slow networking in China regions.
- Upgrade EFA installer to version 1.7.1:
- Kernel module:
efa-1.4.1
- RDMA core:
rdma-core-25.0
- Libfabric:
libfabric-aws-1.8.1amzn1.3
- Open MPI:
openmpi40-aws-4.0.2
- Kernel module:
Bug Fixes
- Fix installation of NVIDIA drivers on Ubuntu 18.
- Fix installation of CUDA toolkit on Centos 6.
- Fix invalid default value for
spot_price
. - Fix issue that was preventing the cluster from being created in VPCs configured with multiple CIDR blocks.
- Correctly handle failures when retrieving ASG in
pcluster instances
command. - Fix the default mount dir when a single EBS volume is specified through a dedicated ebs configuration section.
- Correctly handle failures when there is an invalid parameter in the
aws
config section. - Fix a bug in
pcluster delete
that was causing the cli to exit with error when the cluster is successfully deleted. - Exit with status code 1 if
pcluster create
fails to create a stack. - Better handle the case of multiple or no network interfaces on FSX filesystems.
- Fix
pcluster configure
to retain default values from old config file. - Fix bug in sqswatcher that was causing the daemon to fail when more than 100 DynamoDB tables are present in the cluster region.
- Fix installation of Munge on Amazon Linux, Centos 6, Centos 7 and Ubuntu 16.
Support
Need help / have a feature request?
AWS Support: https://console.aws.amazon.com/support/home
ParallelCluster Issues tracker on GitHub: https://github.com/aws/aws-parallelcluster
The HPC Forum on the AWS Forums page: https://forums.aws.amazon.com/forum.jspa?forumID=192