Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / awslabs/benchmark-ai issues and pull requests
#1055 - nit change
Pull Request -
State: open - Opened by tejaschumbalkar over 2 years ago
#1054 - Update version of terraform and EKS cluster
Pull Request -
State: closed - Opened by Chancebair over 4 years ago
- 1 comment
#1053 - Updating Versions
Pull Request -
State: closed - Opened by Chancebair over 4 years ago
#1052 - Anubis-setup outputs var
Issue -
State: closed - Opened by ryansteakley over 4 years ago
- 1 comment
#1051 - Misc Housekeeping and bug fixing
Pull Request -
State: open - Opened by gavinmbell over 4 years ago
- 1 comment
#1050 - [sm-metrics] SM metrics using descriptor output.metrics
Pull Request -
State: closed - Opened by ryansteakley over 4 years ago
#1049 - [feature] Automatic dashboard creation
Pull Request -
State: closed - Opened by ryansteakley over 4 years ago
- 1 comment
#1048 - [sm-executor][feature] Merge metrics
Pull Request -
State: closed - Opened by ryansteakley over 4 years ago
#1047 - [feature] Sagemaker training job metrics
Pull Request -
State: closed - Opened by ryansteakley over 4 years ago
- 2 comments
#1046 - Documentation Cleanup
Pull Request -
State: closed - Opened by gavinmbell over 4 years ago
#1045 - removing duplicates
Pull Request -
State: closed - Opened by tejaschumbalkar over 4 years ago
#1044 - [feature] Custom-parameters (hyperparameters)
Pull Request -
State: closed - Opened by ryansteakley over 4 years ago
#1043 - Implementing suggested changes from #1041
Issue -
State: closed - Opened by tejaschumbalkar over 4 years ago
- 1 comment
#1042 - [SM-EXECUTOR][FIX] Pin SageMaker to < v2.0
Pull Request -
State: closed - Opened by ryansteakley over 4 years ago
#1041 - Mpijob migration
Pull Request -
State: closed - Opened by tejaschumbalkar over 4 years ago
- 3 comments
#1040 - Reverting #974
Pull Request -
State: closed - Opened by tejaschumbalkar over 4 years ago
- 2 comments
#1039 - fixing logging TypeError exception
Pull Request -
State: closed - Opened by tejaschumbalkar over 4 years ago
- 1 comment
#1038 - Removed ksonnet. Use kubeflow mpi-operator instead.
Pull Request -
State: closed - Opened by tejaschumbalkar over 4 years ago
#1037 - Test failures for job-status-trigger and sm-executor modules
Issue -
State: closed - Opened by tejaschumbalkar over 4 years ago
- 2 comments
#1036 - Upgrade MPI operator and k8s config templates
Pull Request -
State: closed - Opened by tejaschumbalkar over 4 years ago
- 4 comments
#1035 - Is there a definitive FAIL / SUCCEED event status?
Issue -
State: open - Opened by gavinmbell over 4 years ago
- 1 comment
Labels: enhancement, good first issue, bai-client
#1034 - Removed ksonnet. Use kubeflow mpi-operator instead.
Pull Request -
State: closed - Opened by SergTogul over 4 years ago
- 5 comments
#1033 - [feature] Add --terminate command to allow for smart-exiting of the watcher
Pull Request -
State: closed - Opened by ryansteakley over 4 years ago
- 1 comment
#1032 - [Feature] Get/Set client-id through commandline
Pull Request -
State: closed - Opened by ryansteakley over 4 years ago
- 3 comments
#1031 - [feature] Add command line option to explicitly whitelist CIDR blocks
Pull Request -
State: closed - Opened by ryansteakley over 4 years ago
- 3 comments
#1030 - Performance drop on single node cpu run - vanilla EC2 vs Anubis
Issue -
State: open - Opened by surajkota over 4 years ago
Labels: p0
#1029 - Fix Flake8 update and pin version
Pull Request -
State: closed - Opened by Chancebair over 4 years ago
- 2 comments
#1028 - Fix Flake8 update and pin version
Pull Request -
State: closed - Opened by Chancebair over 4 years ago
- 1 comment
#1027 - [Improvement] Remove --ignore=E902 in python-common.mk
Issue -
State: open - Opened by Chancebair over 4 years ago
Labels: good first issue, clean-up
#1026 - [Bug] Lint Failures on Build Stage
Issue -
State: closed - Opened by Chancebair over 4 years ago
Labels: bug
#1025 - Custom toml file for EKS performance test for TF is not running
Issue -
State: open - Opened by TusharKanekiDey over 4 years ago
- 2 comments
Labels: p0
#1024 - [Bug] [Baictl] Codepipeline Source Verification step
Issue -
State: open - Opened by Chancebair over 4 years ago
Labels: bug
#1023 - In Makefile use PROJECT in all places where feasible
Pull Request -
State: closed - Opened by gavinmbell almost 5 years ago
- 1 comment
#1022 - [Install] [Bug] ksonnet install broken
Issue -
State: open - Opened by Chancebair almost 5 years ago
- 1 comment
Labels: bug, baictl, infrastructure, p0
#1021 - fix: missing script-mode functionality
Pull Request -
State: closed - Opened by surajkota about 5 years ago
#1020 - change: make task_name mandatory for labels
Pull Request -
State: closed - Opened by surajkota about 5 years ago
- 1 comment
#1019 - fix format bugs
Pull Request -
State: closed - Opened by YangFei1990 about 5 years ago
#1018 - Add FSx into the test yamls
Pull Request -
State: closed - Opened by YangFei1990 about 5 years ago
#1017 - Add FSx support for yaml template
Pull Request -
State: closed - Opened by YangFei1990 about 5 years ago
#1016 - [Executor] script mode not working
Issue -
State: open - Opened by surajkota about 5 years ago
- 1 comment
Labels: service:executor, p0
#1015 - Plotting graphs for cron jobs
Issue -
State: open - Opened by surajkota about 5 years ago
- 5 comments
Labels: bug, p0, RM:user-metrics, RM:AWS-CloudWatchExport
#1014 - Use FSx for data in Anubis
Issue -
State: open - Opened by haohanchen-yagao about 5 years ago
Labels: enhancement
#1013 - Make anubis endpoint reachable from Native AWS
Issue -
State: closed - Opened by surajkota about 5 years ago
- 8 comments
Labels: bug, infrastructure, blocked, p0
#1012 - [SM-Executor] Adds support for 'args' for SageMaker benchmarks
Pull Request -
State: closed - Opened by perdasilva about 5 years ago
- 3 comments
#1011 - Race condition for iam perms vs container initialization.
Issue -
State: open - Opened by gavinmbell about 5 years ago
Labels: p2
#1010 - Set maximum of 8 custom labels in descriptor schema
Pull Request -
State: closed - Opened by jlcontreras about 5 years ago
#1009 - Limit size of info.labels to 8
Issue -
State: closed - Opened by jlcontreras about 5 years ago
Labels: bai-bff
#1008 - Fix metrics-pusher support for custom labels
Pull Request -
State: closed - Opened by jlcontreras about 5 years ago
#1007 - Use toml literal strings to declare metric patterns
Pull Request -
State: closed - Opened by jlcontreras about 5 years ago
#1006 - [Executor] Re-enable script mode integration test
Issue -
State: open - Opened by perdasilva about 5 years ago
Labels: bug, service:executor
#1005 - Using IAM roles instead or IAM users for setting up and accessing anubis
Issue -
State: open - Opened by surajkota about 5 years ago
Labels: infrastructure, documentation, p0
#1004 - [Workaround] Puller polls s3 to ensure credentials
Pull Request -
State: closed - Opened by perdasilva about 5 years ago
#1003 - [Feature] Show me all the currently running benchmarks...
Issue -
State: open - Opened by gavinmbell about 5 years ago
- 1 comment
Labels: enhancement, bai-bff, bai-client
#1002 - [bff][bugfix] tiny space bug stopping --sync-data flag from working
Pull Request -
State: closed - Opened by gavinmbell about 5 years ago
#1001 - Add system logs to increase visibility
Issue -
State: open - Opened by surajkota about 5 years ago
- 2 comments
Labels: enhancement
#1000 - [Feature] Enhancement to list-jobs
Issue -
State: open - Opened by surajkota about 5 years ago
Labels: enhancement, bai-bff, p2
#999 - TOML info for metric labels
Issue -
State: open - Opened by gavinmbell about 5 years ago
- 2 comments
Labels: RM:user-metrics
#998 - [SageMaker - exec] To pass more configuration information to SageMaker through TOML.
Issue -
State: open - Opened by gavinmbell about 5 years ago
- 1 comment
Labels: service:executor
#997 - No Limit for ES log query returns
Issue -
State: open - Opened by gavinmbell about 5 years ago
Labels: enhancement, good first issue, bai-bff
#996 - [Improvement] Increase error state visibility to end users
Issue -
State: open - Opened by perdasilva about 5 years ago
- 3 comments
Labels: enhancement, good first issue
#995 - Patch GHSA-844w-j86r-4x2j
Pull Request -
State: closed - Opened by Chancebair about 5 years ago
- 1 comment
#994 - [Metrics] Improve UX with output.metrics regex's
Issue -
State: open - Opened by jlcontreras about 5 years ago
Labels: enhancement, RM:user-metrics
#993 - Convert CloudWatch metrics to High resolution metrics
Pull Request -
State: closed - Opened by jlcontreras about 5 years ago
#992 - [watcher] Add missing dependency
Pull Request -
State: closed - Opened by jlcontreras about 5 years ago
#991 - Update metrics examples and information
Pull Request -
State: closed - Opened by jlcontreras about 5 years ago
- 1 comment
#990 - Update cmd event schema to include target-action-id field
Pull Request -
State: closed - Opened by jlcontreras about 5 years ago
- 1 comment
#989 - [Security] Updates external services to be annotated with the pl cidr ranges
Pull Request -
State: closed - Opened by perdasilva about 5 years ago
- 3 comments
#988 - Cast metric value to float before exporting to CloudWatch
Pull Request -
State: closed - Opened by jlcontreras about 5 years ago
#984 - [Security] BFF/Grafana visible to the outside world
Issue -
State: closed - Opened by perdasilva about 5 years ago
- 2 comments
Labels: p0, security
#981 - User metrics doesn't show up in cloudwatch
Issue -
State: closed - Opened by haohanchen-yagao over 5 years ago
- 2 comments
Labels: p1
#980 - Single instance GPU job failed
Issue -
State: open - Opened by akartsky over 5 years ago
- 3 comments
Labels: p0
#976 - Apache Header All Of The Things!
Pull Request -
State: closed - Opened by gavinmbell over 5 years ago
- 6 comments
#910 - Deadletter Topic and mechanics
Issue -
State: open - Opened by gavinmbell over 5 years ago
Labels: enhancement
#890 - [Customer Request] Trouble Running Script Mode Benchmark
Issue -
State: open - Opened by Chancebair over 5 years ago
- 4 comments
Labels: help wanted
#878 - [Executor] Refactor configuration
Issue -
State: open - Opened by perdasilva over 5 years ago
Labels: enhancement, service:executor
#859 - Undeploy Not Idempotent
Issue -
State: open - Opened by Chancebair over 5 years ago
- 3 comments
Labels: bug, the build, p0
#856 - Blackbox test fail
Issue -
State: closed - Opened by akhilmehra over 5 years ago
- 1 comment
#846 - Blackbox Tests fail
Issue -
State: closed - Opened by akhilmehra over 5 years ago
- 1 comment
Labels: ci
#843 - [Watcher] Init:ImagePullBackOff - bad status
Issue -
State: open - Opened by perdasilva over 5 years ago
- 1 comment
Labels: bug, good first issue, service:watcher, p1
#816 - Fix the badges
Issue -
State: open - Opened by stsukrov over 5 years ago
- 1 comment
Labels: bug, ci
#792 - [CI] Blackbox Tests not utilizing the bff
Issue -
State: open - Opened by Chancebair over 5 years ago
- 1 comment
Labels: ci
#772 - Document Test setup
Issue -
State: open - Opened by gavinmbell over 5 years ago
- 1 comment
Labels: pa-handoff
#771 - Move code out of buildspec
Issue -
State: open - Opened by gavinmbell over 5 years ago
Labels: pa-handoff, clean-up
#735 - Add "Manual Mode" to anubis-setup
Issue -
State: open - Opened by Chancebair over 5 years ago
- 1 comment
Labels: enhancement, ci, pa-handoff
#709 - [ci] Wait before get service bai-bff
Issue -
State: open - Opened by Chancebair over 5 years ago
- 1 comment
#702 - Fetcher deployment fails due to wrong kubectl version
Issue -
State: open - Opened by marcoabreu over 5 years ago
Labels: service:datafetcher, ci, the build
#691 - Investigate if it's possible to make S3 Mock work as a public one.
Issue -
State: open - Opened by stsukrov over 5 years ago
Labels: enhancement, clean-up
#593 - Parametrise metrics pusher backend in templates
Issue -
State: open - Opened by stsukrov over 5 years ago
- 1 comment
Labels: infrastructure, service:executor
#579 - Improve executor/src/transpiler/kubernetes_spec_logic.py
Issue -
State: open - Opened by jlcontreras over 5 years ago
Labels: enhancement, service:executor, clean-up
#555 - Investigate consistensy issues in data-puller
Issue -
State: closed - Opened by stsukrov over 5 years ago
- 2 comments
Labels: bug, service:datafetcher
#538 - Add taints to `bai-worker` nodes
Issue -
State: open - Opened by edisongustavo over 5 years ago
- 2 comments
Labels: enhancement, infrastructure, p0
#453 - make deploy should not depend on make publish anymore
Issue -
State: open - Opened by stsukrov over 5 years ago
Labels: enhancement, good first issue, ci
#401 - Executor randomly chooses AZ, but doesn't check whether it's suitable
Issue -
State: open - Opened by jlcontreras over 5 years ago
- 2 comments
Labels: bug, service:executor, p0, RM:improve-az-selection
#393 - FetcherStatus.FAILED message is empty
Issue -
State: open - Opened by marcoabreu over 5 years ago
- 1 comment
Labels: service:datafetcher, p2
#369 - Cancel jobs which require unavailable instance types
Issue -
State: open - Opened by jlcontreras over 5 years ago
- 1 comment
Labels: bug, service:executor
#344 - Adopt kustomize to manage different configurations (local/devo/prod)
Issue -
State: open - Opened by stsukrov over 5 years ago
- 1 comment
Labels: enhancement, baictl, clean-up
#326 - [BFF] Implement data validation (at ports and adaptors)
Issue -
State: open - Opened by gavinmbell over 5 years ago
- 4 comments
Labels: bai-bff
#320 - Upgrade to terraform 0.12
Issue -
State: open - Opened by edisongustavo over 5 years ago
- 3 comments
Labels: enhancement, infrastructure
#276 - Design bai scriptMode schanges in descriptor
Issue -
State: open - Opened by stsukrov over 5 years ago
Labels: enhancement, descriptor, service:executor
#263 - Readlock from puller
Issue -
State: open - Opened by stsukrov almost 6 years ago
Labels: enhancement, service:datafetcher