Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / aws/aws-k8s-tester issues and pull requests
#512 - chore(efa-device-plugin): Add P5en to EFA and RDMA supported instances
Pull Request -
State: closed - Opened by mattcjo 7 days ago
#511 - update consolidation policy
Pull Request -
State: closed - Opened by tzneal 12 days ago
#510 - wait longer for node termination
Pull Request -
State: closed - Opened by tzneal 13 days ago
#509 - Update nvidia_static_cluster_nodepool.yaml.template
Pull Request -
State: closed - Opened by Issacwww 14 days ago
#508 - Emit node metrics with multiple dimension sets
Pull Request -
State: closed - Opened by cartermckinnon 16 days ago
#507 - make the inbound security rule for SSH a no-op
Pull Request -
State: closed - Opened by tzneal 17 days ago
#506 - Fix typo
Pull Request -
State: closed - Opened by Issacwww 20 days ago
#505 - Add support for nccom test for neuron instances
Pull Request -
State: closed - Opened by Pavani-Panakanti 20 days ago
- 2 comments
#504 - Add support for nvidia instance type when use static cluster
Pull Request -
State: closed - Opened by Issacwww 20 days ago
#503 - Update mult node nccl test
Pull Request -
State: closed - Opened by Pavani-Panakanti 27 days ago
- 2 comments
#502 - Ensure node for static cluster
Pull Request -
State: closed - Opened by Issacwww 27 days ago
#501 - Add osDistro metric dimension
Pull Request -
State: closed - Opened by cartermckinnon about 1 month ago
#500 - Add Batch Optimization Scripts for Neuron Instances
Pull Request -
State: open - Opened by mattcjo about 1 month ago
- 2 comments
#499 - Add support for static cluster
Pull Request -
State: closed - Opened by Issacwww about 1 month ago
#498 - Add Batch Optimization Scripts for NVIDIA Instances
Pull Request -
State: open - Opened by mattcjo about 2 months ago
#497 - Config MPI4 for EFA
Pull Request -
State: closed - Opened by Issacwww about 2 months ago
#496 - Config MPI5 for EFA
Pull Request -
State: closed - Opened by Issacwww about 2 months ago
#495 - Fix get_instance_type
Pull Request -
State: closed - Opened by Issacwww about 2 months ago
#494 - Fetch instance type with fallback
Pull Request -
State: closed - Opened by Issacwww about 2 months ago
#493 - fix codebuild
Pull Request -
State: closed - Opened by Issacwww about 2 months ago
#492 - Fix typo
Pull Request -
State: closed - Opened by Issacwww about 2 months ago
#491 - Collect node logs after --up, --down phases
Pull Request -
State: closed - Opened by cartermckinnon about 2 months ago
#490 - fix: Expose all 32 EFA interfaces on p5 launch template
Pull Request -
State: closed - Opened by bryantbiggs about 2 months ago
#489 - Fix Nvidia Image build
Pull Request -
State: closed - Opened by Issacwww about 2 months ago
- 1 comment
#488 - Fix unit test
Pull Request -
State: closed - Opened by Issacwww about 2 months ago
#487 - chore: Update GPU Dockerfile versions
Pull Request -
State: closed - Opened by bryantbiggs about 2 months ago
#486 - Add support to emit metric to the target AMP
Pull Request -
State: open - Opened by weicongw 2 months ago
- 1 comment
#485 - Bump Neuron SDK components versions
Pull Request -
State: closed - Opened by nkvetsinski 2 months ago
#484 - Add support to emit metric to the target AMP workspace
Pull Request -
State: closed - Opened by weicongw 2 months ago
#483 - Add support to emit metric to the target AMP workspace
Pull Request -
State: closed - Opened by weicongw 2 months ago
#482 - Opt in device plugin
Pull Request -
State: closed - Opened by Issacwww 2 months ago
#481 - Fix the AZs when creating subnets
Pull Request -
State: closed - Opened by weicongw 2 months ago
#480 - capacity-resevation requries efa
Pull Request -
State: open - Opened by Issacwww 2 months ago
#479 - Support Certs via Environement Variable
Pull Request -
State: open - Opened by Issacwww 3 months ago
#478 - Support additional certificate
Pull Request -
State: open - Opened by Issacwww 3 months ago
#477 - Add debug logging for e2e-nvidia setup
Pull Request -
State: closed - Opened by cartermckinnon 3 months ago
#476 - Remove unsupported instance types in isolated regions
Pull Request -
State: closed - Opened by ndbaker1 3 months ago
#475 - Add default instance types for managed nodegroups
Pull Request -
State: closed - Opened by cartermckinnon 3 months ago
#474 - Verify GPU Direct RDMA is used on supported instance.
Pull Request -
State: closed - Opened by weicongw 3 months ago
#473 - Verify GPU Direct RDMA is used on supported instance.
Pull Request -
State: closed - Opened by weicongw 3 months ago
#472 - Fix volume capacity issue
Pull Request -
State: closed - Opened by Issacwww 3 months ago
#471 - Enable EFA set up for bottlerocket
Pull Request -
State: closed - Opened by Issacwww 3 months ago
#470 - Add hpc benckmark to unit test, and add "capacity-reservation" flag to deployer
Pull Request -
State: closed - Opened by weicongw 3 months ago
#469 - Add --node-creation-timeout flag
Pull Request -
State: closed - Opened by cartermckinnon 3 months ago
#468 - Bump go version in kubetest2 image
Pull Request -
State: closed - Opened by ndbaker1 3 months ago
- 1 comment
#467 - Add BERT e2e training test
Pull Request -
State: open - Opened by mattcjo 4 months ago
#466 - Add bert e2e test for neuron device
Pull Request -
State: closed - Opened by weicongw 4 months ago
#465 - Fix GetJobLogs and e2e-neuron binary not exits issue.
Pull Request -
State: closed - Opened by weicongw 4 months ago
#464 - Pull the logs when test finished and remove unnecessary resources requests and limits in the nccl test manifest
Pull Request -
State: closed - Opened by weicongw 4 months ago
#463 - replace `wait.WithTimeout(timeout)` with `wait.WithContext(ctx))`
Pull Request -
State: closed - Opened by weicongw 4 months ago
#462 - Increase cluster creation time out
Pull Request -
State: closed - Opened by Issacwww 4 months ago
#461 - Add bert e2e test for neuron device
Pull Request -
State: closed - Opened by weicongw 4 months ago
- 1 comment
#460 - Add inference test e2e go binary to Dockerfile.kubetest2
Pull Request -
State: closed - Opened by mattcjo 4 months ago
#459 - Add BERT Inference Test
Pull Request -
State: closed - Opened by mattcjo 4 months ago
#458 - Use instance type from EC2 API instead of Node label
Pull Request -
State: closed - Opened by cartermckinnon 5 months ago
#457 - Add test case for unit test and delete the duplicated docker file.
Pull Request -
State: closed - Opened by weicongw 5 months ago
#456 - Add GPU unit test
Pull Request -
State: closed - Opened by weicongw 5 months ago
#455 - Add docker image for BERT e2e inference task
Pull Request -
State: closed - Opened by mattcjo 5 months ago
- 3 comments
#454 - Add docker image for BERT e2e training task
Pull Request -
State: closed - Opened by mattcjo 5 months ago
- 1 comment
#453 - Add --user-data-file option
Pull Request -
State: open - Opened by cartermckinnon 5 months ago
#452 - Add single node Neuron test to the e2e tester
Pull Request -
State: closed - Opened by weicongw 5 months ago
#451 - Add node metrics for time to register, ready
Pull Request -
State: closed - Opened by cartermckinnon 5 months ago
#450 - Add single node Neuron test to the e2e tester
Pull Request -
State: closed - Opened by weicongw 5 months ago
- 1 comment
#449 - Determine default instance types based on AMI architecture
Pull Request -
State: closed - Opened by cartermckinnon 6 months ago
#448 - Remove InstancesDistribution from unmanaged nodegroup cfn
Pull Request -
State: closed - Opened by ndbaker1 6 months ago
#447 - Integrate multi-node nccl testing into the tester package
Pull Request -
State: closed - Opened by weicongw 6 months ago
#446 - Update aws-efa-nccl-tests docker file to the latest cuda and nccl version
Pull Request -
State: closed - Opened by weicongw 6 months ago
- 1 comment
#445 - Add base docker image for EFA NCCL
Pull Request -
State: closed - Opened by weicongw 6 months ago
#444 - Add bottlerocket user data format
Pull Request -
State: closed - Opened by cartermckinnon 6 months ago
#443 - build(deps): bump golang.org/x/net from 0.22.0 to 0.23.0 in /kubetest2
Pull Request -
State: open - Opened by dependabot[bot] 7 months ago
Labels: dependencies
#442 - build(deps): bump golang.org/x/net from 0.22.0 to 0.23.0
Pull Request -
State: open - Opened by dependabot[bot] 7 months ago
Labels: dependencies
#441 - build(deps): bump golang.org/x/net from 0.17.0 to 0.23.0 in /e2e2
Pull Request -
State: open - Opened by dependabot[bot] 7 months ago
Labels: dependencies
#440 - fix ami-type flag
Pull Request -
State: closed - Opened by ndbaker1 8 months ago
#439 - build(deps): bump github.com/sigstore/cosign/v2 from 2.2.3 to 2.2.4 in /kubetest2
Pull Request -
State: open - Opened by dependabot[bot] 8 months ago
Labels: dependencies
#438 - accept amitype in kubetest2 managed nodegroup creation
Pull Request -
State: closed - Opened by ndbaker1 8 months ago
- 1 comment
#437 - build(deps): bump github.com/docker/docker from 25.0.4+incompatible to 25.0.5+incompatible in /kubetest2
Pull Request -
State: open - Opened by dependabot[bot] 8 months ago
Labels: dependencies
#436 - build(deps): bump github.com/docker/docker from 24.0.7+incompatible to 24.0.9+incompatible
Pull Request -
State: open - Opened by dependabot[bot] 8 months ago
Labels: dependencies
#435 - Update dependencies
Pull Request -
State: closed - Opened by tzneal 8 months ago
#434 - build(deps): bump github.com/docker/docker from 20.10.17+incompatible to 20.10.27+incompatible
Pull Request -
State: closed - Opened by dependabot[bot] 8 months ago
Labels: dependencies
#433 - build(deps): bump helm.sh/helm/v3 from 3.9.2 to 3.14.3
Pull Request -
State: closed - Opened by dependabot[bot] 8 months ago
- 1 comment
Labels: dependencies
#432 - build(deps): bump google.golang.org/protobuf from 1.30.0 to 1.33.0
Pull Request -
State: closed - Opened by dependabot[bot] 8 months ago
Labels: dependencies
#431 - build(deps): bump google.golang.org/protobuf from 1.31.0 to 1.33.0 in /kubetest2
Pull Request -
State: closed - Opened by dependabot[bot] 8 months ago
Labels: dependencies
#430 - build(deps): bump google.golang.org/protobuf from 1.30.0 to 1.33.0 in /e2e2
Pull Request -
State: closed - Opened by dependabot[bot] 8 months ago
Labels: dependencies
#429 - build(deps): bump github.com/go-jose/go-jose/v3 from 3.0.0 to 3.0.3 in /kubetest2
Pull Request -
State: closed - Opened by dependabot[bot] 8 months ago
Labels: dependencies
#428 - Remove replace directives from go.mod, remove workspace files
Pull Request -
State: closed - Opened by cartermckinnon 9 months ago
#427 - Add EFA NCCL test case, unmanaged nodegroup template
Pull Request -
State: closed - Opened by cartermckinnon 9 months ago
#426 - build(deps): bump helm.sh/helm/v3 from 3.9.2 to 3.14.2
Pull Request -
State: closed - Opened by dependabot[bot] 9 months ago
- 1 comment
Labels: dependencies
#425 - add ulimit test in Dockerfile.kubetest2
Pull Request -
State: closed - Opened by wwvela 9 months ago
#424 - build(deps): bump helm.sh/helm/v3 from 3.9.2 to 3.14.1
Pull Request -
State: closed - Opened by dependabot[bot] 10 months ago
- 1 comment
Labels: dependencies
#423 - Add m6i.xlarge to default instance types
Pull Request -
State: closed - Opened by Issacwww 10 months ago
#422 - Add `--tune-vpc-cni` option to eksapi deployer
Pull Request -
State: closed - Opened by cartermckinnon 10 months ago
#421 - Add addon management to eksapi deployer
Pull Request -
State: closed - Opened by cartermckinnon 10 months ago
#420 - add e2e tests to check resource limits using ulimit
Pull Request -
State: closed - Opened by wwvela 10 months ago
#419 - add node-name-strategy option
Pull Request -
State: closed - Opened by Issacwww 10 months ago
#418 - Add `--user-data-format` option, nodeadm support
Pull Request -
State: closed - Opened by cartermckinnon 11 months ago
#417 - Filter ENIs based on VPC CNI applied tag
Pull Request -
State: closed - Opened by cartermckinnon 11 months ago
#416 - Delete leaked ENIs before cluster
Pull Request -
State: closed - Opened by cartermckinnon 11 months ago
#415 - Remove AMISSMParameter option from unmanaged nodegroup template
Pull Request -
State: closed - Opened by cartermckinnon 11 months ago
#414 - Add required tags to ASG for 1.25-
Pull Request -
State: closed - Opened by cartermckinnon 11 months ago
#413 - build(deps): bump github.com/cloudflare/circl from 1.1.0 to 1.3.7 in /kubetest2
Pull Request -
State: closed - Opened by dependabot[bot] 11 months ago
- 1 comment
Labels: dependencies