Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / aws/aws-k8s-tester issues and pull requests

#511 - update consolidation policy

Pull Request - State: closed - Opened by tzneal 12 days ago

#510 - wait longer for node termination

Pull Request - State: closed - Opened by tzneal 13 days ago

#509 - Update nvidia_static_cluster_nodepool.yaml.template

Pull Request - State: closed - Opened by Issacwww 14 days ago

#508 - Emit node metrics with multiple dimension sets

Pull Request - State: closed - Opened by cartermckinnon 16 days ago

#507 - make the inbound security rule for SSH a no-op

Pull Request - State: closed - Opened by tzneal 17 days ago

#506 - Fix typo

Pull Request - State: closed - Opened by Issacwww 20 days ago

#505 - Add support for nccom test for neuron instances

Pull Request - State: closed - Opened by Pavani-Panakanti 20 days ago - 2 comments

#504 - Add support for nvidia instance type when use static cluster

Pull Request - State: closed - Opened by Issacwww 20 days ago

#503 - Update mult node nccl test

Pull Request - State: closed - Opened by Pavani-Panakanti 27 days ago - 2 comments

#502 - Ensure node for static cluster

Pull Request - State: closed - Opened by Issacwww 27 days ago

#501 - Add osDistro metric dimension

Pull Request - State: closed - Opened by cartermckinnon about 1 month ago

#500 - Add Batch Optimization Scripts for Neuron Instances

Pull Request - State: open - Opened by mattcjo about 1 month ago - 2 comments

#499 - Add support for static cluster

Pull Request - State: closed - Opened by Issacwww about 1 month ago

#498 - Add Batch Optimization Scripts for NVIDIA Instances

Pull Request - State: open - Opened by mattcjo about 2 months ago

#497 - Config MPI4 for EFA

Pull Request - State: closed - Opened by Issacwww about 2 months ago

#496 - Config MPI5 for EFA

Pull Request - State: closed - Opened by Issacwww about 2 months ago

#495 - Fix get_instance_type

Pull Request - State: closed - Opened by Issacwww about 2 months ago

#494 - Fetch instance type with fallback

Pull Request - State: closed - Opened by Issacwww about 2 months ago

#493 - fix codebuild

Pull Request - State: closed - Opened by Issacwww about 2 months ago

#492 - Fix typo

Pull Request - State: closed - Opened by Issacwww about 2 months ago

#491 - Collect node logs after --up, --down phases

Pull Request - State: closed - Opened by cartermckinnon about 2 months ago

#490 - fix: Expose all 32 EFA interfaces on p5 launch template

Pull Request - State: closed - Opened by bryantbiggs about 2 months ago

#489 - Fix Nvidia Image build

Pull Request - State: closed - Opened by Issacwww about 2 months ago - 1 comment

#488 - Fix unit test

Pull Request - State: closed - Opened by Issacwww about 2 months ago

#487 - chore: Update GPU Dockerfile versions

Pull Request - State: closed - Opened by bryantbiggs about 2 months ago

#486 - Add support to emit metric to the target AMP

Pull Request - State: open - Opened by weicongw 2 months ago - 1 comment

#485 - Bump Neuron SDK components versions

Pull Request - State: closed - Opened by nkvetsinski 2 months ago

#484 - Add support to emit metric to the target AMP workspace

Pull Request - State: closed - Opened by weicongw 2 months ago

#483 - Add support to emit metric to the target AMP workspace

Pull Request - State: closed - Opened by weicongw 2 months ago

#482 - Opt in device plugin

Pull Request - State: closed - Opened by Issacwww 2 months ago

#481 - Fix the AZs when creating subnets

Pull Request - State: closed - Opened by weicongw 2 months ago

#480 - capacity-resevation requries efa

Pull Request - State: open - Opened by Issacwww 2 months ago

#479 - Support Certs via Environement Variable

Pull Request - State: open - Opened by Issacwww 3 months ago

#478 - Support additional certificate

Pull Request - State: open - Opened by Issacwww 3 months ago

#477 - Add debug logging for e2e-nvidia setup

Pull Request - State: closed - Opened by cartermckinnon 3 months ago

#476 - Remove unsupported instance types in isolated regions

Pull Request - State: closed - Opened by ndbaker1 3 months ago

#475 - Add default instance types for managed nodegroups

Pull Request - State: closed - Opened by cartermckinnon 3 months ago

#474 - Verify GPU Direct RDMA is used on supported instance.

Pull Request - State: closed - Opened by weicongw 3 months ago

#473 - Verify GPU Direct RDMA is used on supported instance.

Pull Request - State: closed - Opened by weicongw 3 months ago

#472 - Fix volume capacity issue

Pull Request - State: closed - Opened by Issacwww 3 months ago

#471 - Enable EFA set up for bottlerocket

Pull Request - State: closed - Opened by Issacwww 3 months ago

#469 - Add --node-creation-timeout flag

Pull Request - State: closed - Opened by cartermckinnon 3 months ago

#468 - Bump go version in kubetest2 image

Pull Request - State: closed - Opened by ndbaker1 3 months ago - 1 comment

#467 - Add BERT e2e training test

Pull Request - State: open - Opened by mattcjo 4 months ago

#466 - Add bert e2e test for neuron device

Pull Request - State: closed - Opened by weicongw 4 months ago

#465 - Fix GetJobLogs and e2e-neuron binary not exits issue.

Pull Request - State: closed - Opened by weicongw 4 months ago

#463 - replace `wait.WithTimeout(timeout)` with `wait.WithContext(ctx))`

Pull Request - State: closed - Opened by weicongw 4 months ago

#462 - Increase cluster creation time out

Pull Request - State: closed - Opened by Issacwww 4 months ago

#461 - Add bert e2e test for neuron device

Pull Request - State: closed - Opened by weicongw 4 months ago - 1 comment

#460 - Add inference test e2e go binary to Dockerfile.kubetest2

Pull Request - State: closed - Opened by mattcjo 4 months ago

#459 - Add BERT Inference Test

Pull Request - State: closed - Opened by mattcjo 4 months ago

#458 - Use instance type from EC2 API instead of Node label

Pull Request - State: closed - Opened by cartermckinnon 5 months ago

#456 - Add GPU unit test

Pull Request - State: closed - Opened by weicongw 5 months ago

#455 - Add docker image for BERT e2e inference task

Pull Request - State: closed - Opened by mattcjo 5 months ago - 3 comments

#454 - Add docker image for BERT e2e training task

Pull Request - State: closed - Opened by mattcjo 5 months ago - 1 comment

#453 - Add --user-data-file option

Pull Request - State: open - Opened by cartermckinnon 5 months ago

#452 - Add single node Neuron test to the e2e tester

Pull Request - State: closed - Opened by weicongw 5 months ago

#451 - Add node metrics for time to register, ready

Pull Request - State: closed - Opened by cartermckinnon 5 months ago

#450 - Add single node Neuron test to the e2e tester

Pull Request - State: closed - Opened by weicongw 5 months ago - 1 comment

#449 - Determine default instance types based on AMI architecture

Pull Request - State: closed - Opened by cartermckinnon 6 months ago

#448 - Remove InstancesDistribution from unmanaged nodegroup cfn

Pull Request - State: closed - Opened by ndbaker1 6 months ago

#447 - Integrate multi-node nccl testing into the tester package

Pull Request - State: closed - Opened by weicongw 6 months ago

#446 - Update aws-efa-nccl-tests docker file to the latest cuda and nccl version

Pull Request - State: closed - Opened by weicongw 6 months ago - 1 comment

#445 - Add base docker image for EFA NCCL

Pull Request - State: closed - Opened by weicongw 6 months ago

#444 - Add bottlerocket user data format

Pull Request - State: closed - Opened by cartermckinnon 6 months ago

#443 - build(deps): bump golang.org/x/net from 0.22.0 to 0.23.0 in /kubetest2

Pull Request - State: open - Opened by dependabot[bot] 7 months ago
Labels: dependencies

#442 - build(deps): bump golang.org/x/net from 0.22.0 to 0.23.0

Pull Request - State: open - Opened by dependabot[bot] 7 months ago
Labels: dependencies

#441 - build(deps): bump golang.org/x/net from 0.17.0 to 0.23.0 in /e2e2

Pull Request - State: open - Opened by dependabot[bot] 7 months ago
Labels: dependencies

#440 - fix ami-type flag

Pull Request - State: closed - Opened by ndbaker1 8 months ago

#439 - build(deps): bump github.com/sigstore/cosign/v2 from 2.2.3 to 2.2.4 in /kubetest2

Pull Request - State: open - Opened by dependabot[bot] 8 months ago
Labels: dependencies

#438 - accept amitype in kubetest2 managed nodegroup creation

Pull Request - State: closed - Opened by ndbaker1 8 months ago - 1 comment

#435 - Update dependencies

Pull Request - State: closed - Opened by tzneal 8 months ago

#433 - build(deps): bump helm.sh/helm/v3 from 3.9.2 to 3.14.3

Pull Request - State: closed - Opened by dependabot[bot] 8 months ago - 1 comment
Labels: dependencies

#432 - build(deps): bump google.golang.org/protobuf from 1.30.0 to 1.33.0

Pull Request - State: closed - Opened by dependabot[bot] 8 months ago
Labels: dependencies

#431 - build(deps): bump google.golang.org/protobuf from 1.31.0 to 1.33.0 in /kubetest2

Pull Request - State: closed - Opened by dependabot[bot] 8 months ago
Labels: dependencies

#430 - build(deps): bump google.golang.org/protobuf from 1.30.0 to 1.33.0 in /e2e2

Pull Request - State: closed - Opened by dependabot[bot] 8 months ago
Labels: dependencies

#429 - build(deps): bump github.com/go-jose/go-jose/v3 from 3.0.0 to 3.0.3 in /kubetest2

Pull Request - State: closed - Opened by dependabot[bot] 8 months ago
Labels: dependencies

#427 - Add EFA NCCL test case, unmanaged nodegroup template

Pull Request - State: closed - Opened by cartermckinnon 9 months ago

#426 - build(deps): bump helm.sh/helm/v3 from 3.9.2 to 3.14.2

Pull Request - State: closed - Opened by dependabot[bot] 9 months ago - 1 comment
Labels: dependencies

#425 - add ulimit test in Dockerfile.kubetest2

Pull Request - State: closed - Opened by wwvela 9 months ago

#424 - build(deps): bump helm.sh/helm/v3 from 3.9.2 to 3.14.1

Pull Request - State: closed - Opened by dependabot[bot] 10 months ago - 1 comment
Labels: dependencies

#423 - Add m6i.xlarge to default instance types

Pull Request - State: closed - Opened by Issacwww 10 months ago

#422 - Add `--tune-vpc-cni` option to eksapi deployer

Pull Request - State: closed - Opened by cartermckinnon 10 months ago

#421 - Add addon management to eksapi deployer

Pull Request - State: closed - Opened by cartermckinnon 10 months ago

#420 - add e2e tests to check resource limits using ulimit

Pull Request - State: closed - Opened by wwvela 10 months ago

#419 - add node-name-strategy option

Pull Request - State: closed - Opened by Issacwww 10 months ago

#418 - Add `--user-data-format` option, nodeadm support

Pull Request - State: closed - Opened by cartermckinnon 11 months ago

#417 - Filter ENIs based on VPC CNI applied tag

Pull Request - State: closed - Opened by cartermckinnon 11 months ago

#416 - Delete leaked ENIs before cluster

Pull Request - State: closed - Opened by cartermckinnon 11 months ago

#414 - Add required tags to ASG for 1.25-

Pull Request - State: closed - Opened by cartermckinnon 11 months ago

#413 - build(deps): bump github.com/cloudflare/circl from 1.1.0 to 1.3.7 in /kubetest2

Pull Request - State: closed - Opened by dependabot[bot] 11 months ago - 1 comment
Labels: dependencies