Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / NVIDIA/spark-rapids-tools issues and pull requests

#1135 - [FEA] Display full failure messages in failed CSV files

Pull Request - State: closed - Opened by amahussein 3 months ago
Labels: feature request, tools

#1134 - [FEA] Support custom XGBoost model file via user tools CLI

Issue - State: closed - Opened by mattahrens 3 months ago
Labels: feature request, user_tools

#1133 - [DOC] Fix User-tools README with absolute links and valid links

Issue - State: closed - Opened by amahussein 3 months ago
Labels: documentation, user_tools

#1132 - [BUG] Error mesasges in failed_job.csv and failed_stages.csv are not fully displayed

Issue - State: closed - Opened by wjxiz1992 3 months ago - 2 comments
Labels: bug, tools

#1130 - Fix Python runtime error caused by numpy 2.0.0 release

Pull Request - State: closed - Opened by amahussein 4 months ago
Labels: bug, user_tools

#1129 - Bump urllib3 from 1.26.18 to 1.26.19 in /data_validation

Pull Request - State: closed - Opened by dependabot[bot] 4 months ago - 2 comments
Labels: dependencies

#1128 - Fix Python runtime error caused by numpy 2.0.0 release

Pull Request - State: closed - Opened by amahussein 4 months ago - 2 comments
Labels: bug, user_tools

#1127 - [BUG] Python runtime failure due to incompatibe numpy

Issue - State: closed - Opened by amahussein 4 months ago - 1 comment
Labels: bug, user_tools

#1126 - [BUG] python user tools should always display processed apps - even if passed GPU event logs

Issue - State: closed - Opened by tgravescs 4 months ago - 6 comments
Labels: bug, user_tools, usability

#1125 - [FEA] Be able to recommend specific GPU SKU according to SQL nature

Issue - State: open - Opened by wjxiz1992 4 months ago - 2 comments
Labels: feature request, tools

#1124 - Handle different exception thrown by incomplete eventlogs

Pull Request - State: closed - Opened by amahussein 4 months ago
Labels: bug, tools

#1123 - Add an internal CLI to generate instance type descriptions for CSPs

Issue - State: closed - Opened by cindyyuanjiang 4 months ago
Labels: feature request, user_tools

#1122 - [BUG] Handle different exception thrown by incomplete eventlogs

Issue - State: closed - Opened by amahussein 4 months ago
Labels: bug, tools

#1120 - [FEA] Add Benchmarking to evaluate the core tools performance

Issue - State: closed - Opened by amahussein 4 months ago
Labels: feature request, tools

#1119 - Include number of executors per node in cluster information

Pull Request - State: closed - Opened by parthosa 4 months ago - 2 comments
Labels: bug, tools

#1118 - [BUG] Platform should be initialized after parsing the eventlogs

Issue - State: closed - Opened by amahussein 4 months ago
Labels: bug, tools

#1117 - [BUG] Cluster Information: Include number of executors per node

Issue - State: closed - Opened by parthosa 4 months ago - 1 comment
Labels: bug, tools

#1116 - [BUG] Revisit the categorization of unsupported ops in Qual tool output

Issue - State: open - Opened by amahussein 4 months ago
Labels: bug, tools

#1115 - [BUG] Qualification CLI does not generate AutoTuning for onPrem

Issue - State: closed - Opened by amahussein 4 months ago - 2 comments
Labels: bug, user_tools

#1114 - Disable the spark_rapids bootstrap command

Pull Request - State: closed - Opened by amahussein 4 months ago - 2 comments
Labels: feature request, user_tools, usability

#1113 - Fix typo in Profiler class using qual instead of prof

Pull Request - State: closed - Opened by amahussein 4 months ago
Labels: bug, tools

#1111 - Add support to Python 3.12

Pull Request - State: closed - Opened by amahussein 4 months ago
Labels: feature request, user_tools

#1110 - user-tools: Update log messages

Pull Request - State: closed - Opened by nartal1 4 months ago
Labels: bug, user_tools, usability

#1109 - [FEA] Qualification tool should recommend the cluster shape based on the best TCO according to our internal benchmark

Issue - State: open - Opened by viadea 4 months ago - 3 comments
Labels: feature request, user_tools

#1108 - Enable xgboost prediction model by default

Pull Request - State: closed - Opened by amahussein 4 months ago
Labels: feature request, user_tools

#1107 - [FEA] Enable xgboost prediction model by default

Issue - State: closed - Opened by amahussein 4 months ago
Labels: feature request, user_tools

#1106 - [FEA] Qualification tool should print Kryo related recommendations

Issue - State: closed - Opened by viadea 4 months ago
Labels: feature request, tools

#1105 - Add support to Python3.11

Pull Request - State: closed - Opened by amahussein 4 months ago
Labels: bug, user_tools

#1104 - Fix nan label issue in training

Pull Request - State: closed - Opened by leewyang 4 months ago
Labels: bug, user_tools

#1103 - [FEA] Disable the spark_rapids bootstrap command

Issue - State: closed - Opened by tgravescs 4 months ago
Labels: feature request, user_tools, usability

#1102 - Fix qualx app metrics

Pull Request - State: closed - Opened by leewyang 4 months ago
Labels: bug, user_tools

#1101 - [FEA] Skip mvn workflows for python-only changes

Issue - State: open - Opened by parthosa 4 months ago
Labels: feature request, user_tools, build

#1100 - [BUG] Refactor QualX for Linter and Test Compatibility

Issue - State: closed - Opened by parthosa 4 months ago - 2 comments
Labels: bug, user_tools

#1099 - [FEA] Disable grouping applications by name

Issue - State: closed - Opened by amahussein 4 months ago
Labels: feature request, user_tools, usability, affect-output

#1098 - Fix missing assignment to savings_recommendations

Pull Request - State: closed - Opened by amahussein 4 months ago
Labels: bug, user_tools

#1097 - [BUG] savings_recommendations referenced before assignment in user-tools

Issue - State: closed - Opened by amahussein 4 months ago
Labels: bug, user_tools

#1096 - clip appDuration to at least Duration

Pull Request - State: closed - Opened by leewyang 4 months ago - 5 comments
Labels: user_tools

#1095 - Handle QualX behaviour when Qual Tool does not generate any outputs

Pull Request - State: closed - Opened by parthosa 4 months ago
Labels: user_tools

#1094 - [BUG] Handle QualX behaviour when Qual Tool does not generate any outputs.

Issue - State: closed - Opened by parthosa 4 months ago
Labels: bug, user_tools

#1093 - Fix internal predict CLI and remove preprocessed argument

Pull Request - State: closed - Opened by leewyang 4 months ago
Labels: user_tools

#1092 - Fix missing appEndTime in raw_metrics folder

Pull Request - State: closed - Opened by amahussein 4 months ago
Labels: bug, tools

#1091 - [BUG] The raw_metrics CSV file miss App endTime and appDuration

Issue - State: closed - Opened by amahussein 4 months ago
Labels: bug, tools

#1090 - [FEA]Qualification tool to qualify a whole dataproc serverless DAG instead of a single Spark Application

Issue - State: open - Opened by viadea 4 months ago - 1 comment
Labels: feature request, user_tools

#1089 - Update QualX to return default speedups and fix App Duration for incomplete apps

Pull Request - State: closed - Opened by parthosa 4 months ago
Labels: user_tools

#1088 - [BUG] Add ShutdownHook to Core tools

Issue - State: open - Opened by amahussein 4 months ago
Labels: bug, tools

#1087 - [FEA] spark_rapids user tools generates the wrong worker info to pass into java qual tool on databricks aws

Issue - State: closed - Opened by tgravescs 4 months ago - 1 comment
Labels: feature request, ? - Needs Triage

#1086 - [FEA] spark_rapids tool should have --debug option to leave work_dir files around

Issue - State: open - Opened by tgravescs 4 months ago
Labels: feature request, user_tools

#1085 - Fix java Qual tool Autotuner output when GPU device is missing

Pull Request - State: closed - Opened by cindyyuanjiang 4 months ago - 3 comments
Labels: bug, tools, usability

#1084 - fix signature error from overlapping merges

Pull Request - State: closed - Opened by leewyang 4 months ago
Labels: user_tools

#1083 - sync w/ internal repo; update models

Pull Request - State: closed - Opened by leewyang 4 months ago
Labels: user_tools

#1082 - Reduce the maximum number of Java threads in CLI

Pull Request - State: closed - Opened by amahussein 4 months ago
Labels: bug, user_tools

#1081 - [FEA] Profiler should dump all SQL metrics

Issue - State: closed - Opened by tgravescs 4 months ago - 1 comment
Labels: feature request, tools

#1080 - Remove using Profiler metrics for QualX and Heuristics

Pull Request - State: closed - Opened by parthosa 4 months ago
Labels: user_tools

#1079 - [BUG] Reduce the maximum number of Java threads

Issue - State: closed - Opened by amahussein 4 months ago
Labels: bug, user_tools

#1078 - [FEA] Profiling tool support auto tuner on cpu logs

Issue - State: open - Opened by tgravescs 4 months ago - 1 comment
Labels: feature request, tools

#1077 - Bump requests from 2.31.0 to 2.32.2 in /data_validation

Pull Request - State: closed - Opened by dependabot[bot] 4 months ago
Labels: build, dependencies

#1076 - Port QualX repo and add CLI for train

Pull Request - State: closed - Opened by parthosa 4 months ago
Labels: user_tools

#1075 - Include entry points for internal usage

Pull Request - State: closed - Opened by parthosa 4 months ago
Labels: user_tools

#1074 - [FEA]Qualification tool to qualify a Dataproc Cluster instead of a single Spark Application

Issue - State: open - Opened by viadea 4 months ago
Labels: feature request, user_tools

#1073 - [BUG] prediction module loads the profiler output twice

Issue - State: closed - Opened by amahussein 4 months ago - 2 comments
Labels: bug, user_tools

#1072 - [BUG] Fallback to SPEEDUPS when data source information is missing

Issue - State: closed - Opened by kuhushukla 4 months ago - 1 comment
Labels: bug, user_tools

#1071 - [BUG] Support Python versions 3.10-312

Issue - State: closed - Opened by amahussein 4 months ago - 2 comments
Labels: bug, user_tools

#1070 - [BUG] Improve handling of Java cmd exceptions in python user tools

Issue - State: open - Opened by amahussein 4 months ago
Labels: bug, user_tools, usability

#1069 - Update the Qual tool AutoTuner Heuristics against CPU event logs

Pull Request - State: closed - Opened by tgravescs 4 months ago - 2 comments
Labels: feature request, tools

#1068 - [FEA] Enhance qualification tool Auto Tuner CPU event log recommendations

Issue - State: closed - Opened by tgravescs 4 months ago
Labels: feature request, tools

#1067 - [FEA] AutoTuner investigate configuring spark.executor.cores to an optimal ratio per GPU

Issue - State: open - Opened by tgravescs 4 months ago
Labels: feature request, tools

#1066 - Sync tools with plugin newly supported operators

Pull Request - State: closed - Opened by cindyyuanjiang 4 months ago - 1 comment
Labels: feature request, tools

#1065 - Handling FileNotFound exception in AutoTuner

Pull Request - State: closed - Opened by bilalbari 4 months ago
Labels: bug, tools

#1064 - Update tools with latest qualx and add CLI for train

Pull Request - State: closed - Opened by parthosa 4 months ago - 1 comment
Labels: feature request, user_tools

#1063 - [FEA] Identify Delta Live Tables job and disqualify it

Issue - State: open - Opened by amahussein 4 months ago
Labels: feature request, tools

#1062 - [FEA] AutoTuner on GPU eventlog look at decompressed input size to determine max partition bytes

Issue - State: open - Opened by tgravescs 4 months ago
Labels: feature request, tools

#1061 - [FEA] Add option to switch between Databricks configuration profiles in Qual tool

Issue - State: open - Opened by cindyyuanjiang 4 months ago - 4 comments
Labels: feature request, user_tools, usability

#1060 - Port training model code into user tools

Pull Request - State: closed - Opened by parthosa 4 months ago - 1 comment
Labels: user_tools

#1057 - [FEA] Tool should exit gracefully if the python version is not supported

Issue - State: open - Opened by viadea 4 months ago - 1 comment
Labels: feature request, user_tools, usability

#1056 - [FEA] Qualification tool can process a given day's event logs

Issue - State: open - Opened by viadea 4 months ago
Labels: feature request, user_tools

#1055 - [DOC] User doc tools should be clear about --filter_apps and also --filter-criteria

Issue - State: open - Opened by viadea 4 months ago - 2 comments
Labels: documentation, user_tools

#1054 - User tools fallback to default zone/region

Pull Request - State: closed - Opened by nartal1 4 months ago - 1 comment
Labels: user_tools, usability

#1053 - Handle missing pricing info for user qual tool on Databricks platforms

Pull Request - State: closed - Opened by cindyyuanjiang 4 months ago - 1 comment
Labels: feature request, user_tools

#1052 - Handle metric names from legacy spark

Pull Request - State: closed - Opened by amahussein 4 months ago
Labels: bug, tools

#1051 - Skip cost savings if pricing info is missing for Databricks qual tool

Issue - State: closed - Opened by cindyyuanjiang 4 months ago
Labels: feature request, user_tools

#1050 - Split job and stage level aggregated metrics into different files

Pull Request - State: closed - Opened by parthosa 4 months ago
Labels: feature request, user_tools, tools

#1049 - [FEA] Update prediction code with latest changes and new CLI for training

Issue - State: closed - Opened by parthosa 4 months ago
Labels: feature request, user_tools

#1048 - [FEA] Include attemptId in stage level aggregated metrics

Issue - State: open - Opened by parthosa 4 months ago
Labels: feature request, tools

#1047 - Update prediction code to use separate job and stage level aggregates

Pull Request - State: closed - Opened by parthosa 4 months ago
Labels: user_tools

#1046 - [BUG] Qualification does not catch unsupported func

Issue - State: closed - Opened by nvliyuan 4 months ago - 2 comments
Labels: bug, tools

#1045 - [FEA] Sync plugin newly supported expression

Issue - State: closed - Opened by cindyyuanjiang 4 months ago
Labels: feature request, tools

#1044 - Split JobStageAggTaskMetrics file into two different files

Pull Request - State: closed - Opened by parthosa 4 months ago - 2 comments
Labels: user_tools, tools

#1042 - [BUG] MatchError in getDataSourceInfo

Issue - State: closed - Opened by amahussein 4 months ago - 1 comment
Labels: bug, tools

#1041 - [FEA] Generate all CSVs from Profiler in Qualification

Issue - State: closed - Opened by parthosa 4 months ago
Labels: feature request, tools

#1040 - [FEA] Migrate estimation model to Qualification CLI

Issue - State: closed - Opened by parthosa 4 months ago - 2 comments
Labels: feature request

#1039 - Hook up the auto tuner in the qualification tool

Pull Request - State: closed - Opened by tgravescs 4 months ago - 1 comment
Labels: feature request, tools

#1038 - [FEA] Improve Logging in Tools

Issue - State: open - Opened by parthosa 4 months ago - 2 comments
Labels: feature request, good first issue, user_tools, usability

#1037 - [FEA] Profiling GPU event logs - duration percentage for the operation per stage

Issue - State: open - Opened by tgravescs 4 months ago
Labels: feature request, tools

#1036 - [FEA] Profiling GPU event log output topN operators

Issue - State: open - Opened by tgravescs 4 months ago
Labels: feature request, tools