Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / tensorflow/data-validation issues and pull requests

#259 - Add build workflow via docker

Pull Request - State: open - Opened by aktech 8 days ago - 1 comment

#258 - `data-validation` package fails to build/install

Issue - State: open - Opened by peytondmurray about 1 month ago

#250 - Update pyarrow>=14.0.1,<15'

Pull Request - State: open - Opened by serhio-k 10 months ago

#249 - Update pyarrow version range to address vulnerability CVE-2023-47248

Issue - State: open - Opened by serhio-k 10 months ago - 3 comments
Labels: stat:awaiting tensorflower, type:feature

#241 - Update custom_data_validation.md

Pull Request - State: open - Opened by singhniraj08 over 1 year ago - 8 comments

#237 - Fix Install Numpy link

Pull Request - State: open - Opened by singhniraj08 over 1 year ago - 7 comments

#205 - Missing support for M1 Mac

Issue - State: closed - Opened by utkarshagarwal over 2 years ago - 10 comments
Labels: stat:awaiting tensorflower, type:build/install

#190 - TFDV on Dataflow getting OOMs frequently

Issue - State: open - Opened by cyc about 3 years ago - 12 comments
Labels: stat:awaiting tensorflower, type:performance

#100 - Missing wheel for Python 3.7 under Windows

Issue - State: closed - Opened by Ark-kun over 4 years ago - 1 comment
Labels: type:docs, stat:awaiting tensorflower

#98 - The generate_statistics_from_csv very slowly for large dataset in single server

Issue - State: open - Opened by yajunwong over 4 years ago - 4 comments
Labels: stat:awaiting tensorflower, type:performance

#97 - Test Copybara Do Not Merge

Pull Request - State: closed - Opened by dhruvesh09 almost 5 years ago

#96 - Make _WeightedCounter serializable

Pull Request - State: closed - Opened by santosh-d3vpl3x almost 5 years ago - 6 comments
Labels: cla: yes

#95 - We can not use INT with missing values?

Issue - State: closed - Opened by sfujiwara almost 5 years ago - 6 comments
Labels: stat:awaiting response, type:support

#94 - infer_schema(..., infer_feature_shape=True) should parse VarLenFeature to SparseFeature

Issue - State: closed - Opened by schmidt-jake almost 5 years ago - 8 comments
Labels: stat:awaiting response, type:support

#93 - range pin on scikit-learn is not permissive enough to use modern versions of skl

Issue - State: closed - Opened by kwlzn almost 5 years ago - 3 comments
Labels: stat:awaiting response, type:feature

#92 - generate_statistics_from_pyarrow table or parquet

Issue - State: open - Opened by tanguycdls almost 5 years ago - 22 comments
Labels: stat:awaiting tensorflower, type:support

#91 - generate_statistics_from_dataframe fails for large text columns (INT overflow)

Issue - State: closed - Opened by wsuchy almost 5 years ago - 1 comment
Labels: stat:awaiting tensorflower, type:bug

#90 - List datatype (multiple category feature type) not supported

Issue - State: closed - Opened by wsuchy almost 5 years ago - 7 comments
Labels: type:support

#89 - Added missing code to merge cross feature stats

Pull Request - State: closed - Opened by wsuchy almost 5 years ago - 4 comments

#88 - CrossFeature statistics aren't present in the result PB

Issue - State: closed - Opened by wsuchy almost 5 years ago - 1 comment
Labels: stat:awaiting tensorflower, type:bug

#87 - Types missing in documentation / code for tfdv.CombinerStatsGenerator

Issue - State: closed - Opened by wsuchy almost 5 years ago - 3 comments
Labels: stat:awaiting response, type:support

#86 - TFDV fails with other compression types

Issue - State: closed - Opened by Arvinds-ds about 5 years ago - 4 comments
Labels: type:support

#85 - Unable to run "../bazel_bin_ppc64le_0.27.1 run -c opt tensorflow_data_validation:build_pip_package" on the ppc64le

Issue - State: closed - Opened by xauthulei about 5 years ago - 1 comment
Labels: type:build/install

#84 - key "ARROW_HEADER_DIR" not found in dictionary

Issue - State: closed - Opened by xauthulei about 5 years ago - 2 comments
Labels: stat:awaiting tensorflower, type:build/install

#83 - ImportError: libarrow.so.14: cannot open shared object file: No such file or directory when running on Dataflow

Issue - State: closed - Opened by andrewsmartin about 5 years ago - 5 comments
Labels: stat:awaiting response, type:support

#82 - Facets axis labels are not visible in Colab

Issue - State: closed - Opened by ageron about 5 years ago - 5 comments
Labels: stat:awaiting tensorflower, type:support

#81 - How to generate statistics from multiple csv files

Issue - State: closed - Opened by theoqian about 5 years ago - 2 comments
Labels: stat:awaiting tensorflower, type:support

#80 - Fixed DOMException issue faced when embedding tfdv visualization in iframe

Pull Request - State: closed - Opened by ajchili about 5 years ago - 4 comments

#79 - Embedding Data Validation Visualization in iframe Fails due to DOMException

Issue - State: closed - Opened by ajchili about 5 years ago - 1 comment
Labels: stat:awaiting tensorflower, type:support

#78 - TDFV==0.14.0 Wheel Fails Integrity Check

Issue - State: closed - Opened by jhamet93 about 5 years ago - 8 comments
Labels: stat:awaiting response, type:support

#77 - Cannot determine feature type error

Issue - State: closed - Opened by htahir1 about 5 years ago - 5 comments
Labels: stat:awaiting tensorflower, type:support

#76 - tfdv manylinux pypi packages are built/linked on too new of a platform for general compatibility

Issue - State: closed - Opened by kwlzn about 5 years ago - 8 comments
Labels: stat:awaiting tensorflower, type:build/install

#75 - GenerateStatistics API Change

Issue - State: open - Opened by paulgc about 5 years ago
Labels: Announcement

#74 - get_statistics_html should support multiple datasets

Issue - State: closed - Opened by htahir1 about 5 years ago - 3 comments
Labels: stat:awaiting tensorflower, type:feature

#73 - Replacing feature name with feature path in statistics proto

Issue - State: open - Opened by paulgc about 5 years ago
Labels: Announcement

#72 - Newline in CSV quoted string breaks reader

Issue - State: open - Opened by jondot about 5 years ago - 5 comments
Labels: stat:awaiting tensorflower, type:feature

#71 - Is it possible to pinpoint the exact example that caused the anomalies?

Issue - State: closed - Opened by benjamintanweihao over 5 years ago - 3 comments
Labels: type:support

#69 - Support compressed files in CSV/TFR reader

Issue - State: closed - Opened by Arvinds-ds over 5 years ago - 1 comment
Labels: stat:awaiting tensorflower, type:feature

#68 - Add link in readme to paper

Issue - State: closed - Opened by impredicative over 5 years ago - 1 comment
Labels: type:docs, type:feature

#67 - Custom statistics with CombinerStatsGenerator and sequential data

Issue - State: closed - Opened by TimSmole over 5 years ago - 2 comments
Labels: stat:awaiting tensorflower, type:feature

#66 - Add correlations to Facets charts/tables

Issue - State: open - Opened by ianhellstrom over 5 years ago - 6 comments
Labels: stat:awaiting response, type:feature

#65 - pip install fails in python 2.7 due to latest scikit-learn

Issue - State: closed - Opened by paulgc over 5 years ago - 1 comment

#64 - Pin scikit-learn to version which support Py2

Pull Request - State: closed - Opened by brianmartin over 5 years ago - 2 comments

#63 - Documentation about anomalies and what constraints trigger them

Issue - State: closed - Opened by martin-laurent over 5 years ago - 1 comment
Labels: type:docs, stat:awaiting tensorflower, type:feature

#62 - Unclear anomaly_info

Issue - State: closed - Opened by martin-laurent over 5 years ago - 3 comments
Labels: type:docs, stat:awaiting response

#61 - Add Py_XDECREF to prevent memory leak in FastExampleDecoder

Pull Request - State: closed - Opened by cyc over 5 years ago - 6 comments

#60 - Ignoring feature of type datetime64[ns] when generate statistics from dataframe

Issue - State: closed - Opened by gustavorps over 5 years ago - 4 comments
Labels: type:support

#59 - Not able to run "bazel run -c opt tensorflow_data_validation:build_pip_package" on Mac OSX

Issue - State: closed - Opened by aaronlelevier over 5 years ago - 1 comment
Labels: type:support

#58 - TFDV sometimes erroneously sets min_fraction to 1.0

Issue - State: closed - Opened by cyc over 5 years ago - 2 comments
Labels: bug, stat:awaiting tensorflower, type:bug

#57 - Add a tfdv.load_anomalies_text function similar to tfdv.load_schema_text

Issue - State: closed - Opened by loiccordone over 5 years ago - 1 comment
Labels: stat:awaiting tensorflower, type:feature

#56 - Any way to speed up tfdv.TFExampleDecoder?

Issue - State: closed - Opened by cyc over 5 years ago - 5 comments
Labels: stat:awaiting tensorflower, type:support

#55 - is there docs about the statics and math part?

Issue - State: closed - Opened by yupbank over 5 years ago - 2 comments
Labels: type:docs, stat:awaiting tensorflower, type:feature

#54 - planned support for hive table?

Issue - State: open - Opened by zhaiyuyong over 5 years ago - 2 comments
Labels: stat:awaiting tensorflower, type:feature

#53 - Make desired_batch_size argument public in GenerateStatistics API

Issue - State: closed - Opened by AdrianLsk over 5 years ago - 1 comment

#52 - Project import generated by Copybara.

Pull Request - State: closed - Opened by paulgc over 5 years ago

#51 - [Improvement Suggestions] Specific na_values

Issue - State: closed - Opened by fernandofsilva over 5 years ago - 1 comment
Labels: type:support

#50 - Add get_statistics_html function to display_util.py.

Pull Request - State: closed - Opened by cbreuel over 5 years ago

#49 - Project import generated by Copybara.

Pull Request - State: closed - Opened by paulgc over 5 years ago

#48 - overflow encountered in long_scalars

Issue - State: closed - Opened by cbreuel almost 6 years ago - 3 comments
Labels: stat:awaiting tensorflower, type:support

#47 - Project import generated by Copybara.

Pull Request - State: closed - Opened by paulgc almost 6 years ago

#46 - Fix pipeline option variable name

Pull Request - State: closed - Opened by daikeshi almost 6 years ago - 3 comments

#45 - tf.SequenceExample support

Issue - State: closed - Opened by martin-laurent almost 6 years ago - 5 comments
Labels: stat:awaiting response, type:feature

#44 - docs: Fix get_started formatting

Pull Request - State: closed - Opened by brianmartin almost 6 years ago

#43 - Project import generated by Copybara.

Pull Request - State: closed - Opened by paulgc almost 6 years ago

#41 - Inconsistency between statistics and inferred schema

Issue - State: closed - Opened by martin-laurent almost 6 years ago - 2 comments
Labels: stat:awaiting response, type:support

#40 - ImportError: cannot import name pywrap_tensorflow_data_validation

Issue - State: closed - Opened by uguisu almost 6 years ago - 4 comments
Labels: stat:awaiting response, type:support

#39 - Can't find the "Schema change generator" code

Issue - State: closed - Opened by yonromai almost 6 years ago - 4 comments
Labels: stat:awaiting tensorflower

#38 - [DataflowRuntimeException] ImportError: No module named tfdv.statistics.stats_impl

Issue - State: closed - Opened by yonromai almost 6 years ago - 9 comments
Labels: stat:awaiting response, type:support

#37 - schema not matching

Issue - State: closed - Opened by joyjeni almost 6 years ago - 3 comments
Labels: stat:awaiting tensorflower

#36 - Project import generated by Copybara.

Pull Request - State: closed - Opened by paulgc almost 6 years ago

#35 - AttributeError: 'module' object has no attribute '_QuantilesCombinerSpec'

Issue - State: closed - Opened by joyjeni almost 6 years ago - 2 comments
Labels: stat:awaiting response

#34 - AttributeError: 'module' object has no attribute '_QuantilesCombinerSpec'

Issue - State: closed - Opened by peteboothroyd almost 6 years ago - 3 comments
Labels: stat:awaiting response

#33 - [Improvement Suggestions]

Issue - State: open - Opened by vincentteyssier almost 6 years ago - 1 comment
Labels: stat:awaiting tensorflower, type:feature

#32 - Project import generated by Copybara.

Pull Request - State: closed - Opened by paulgc almost 6 years ago

#31 - AttributeError: 'module' object has no attribute '_QuantilesCombinerSpec'

Issue - State: closed - Opened by dpoulopoulos almost 6 years ago - 1 comment

#30 - Missing API for manually setting domain

Issue - State: closed - Opened by AdrianLsk almost 6 years ago - 2 comments
Labels: enhancement

#29 - AttributeError: 'module' object has no attribute 'uint64'

Issue - State: closed - Opened by AndreaPi almost 6 years ago - 2 comments

#28 - tfdv.generate_statistics_from_csv not working anymore with current branch

Issue - State: closed - Opened by AndreaPi almost 6 years ago - 4 comments

#27 - Documentation typo fixes

Pull Request - State: closed - Opened by naveedgol almost 6 years ago

#26 - Slow performance when computing stats for moderately large data set

Issue - State: closed - Opened by AndreaPi almost 6 years ago - 12 comments
Labels: stat:awaiting response, type:bug

#25 - Project import generated by Copybara.

Pull Request - State: closed - Opened by paulgc almost 6 years ago

#24 - Corrected typo: references issue below

Pull Request - State: closed - Opened by AndreaPi almost 6 years ago - 3 comments

#23 - import tensorflow_data_validation error

Issue - State: closed - Opened by eric-tc almost 6 years ago - 3 comments
Labels: stat:awaiting response

#22 - Error in documentation

Issue - State: closed - Opened by AndreaPi almost 6 years ago - 1 comment
Labels: type:docs

#21 - Error in generate statistics due to Python Snappy

Issue - State: closed - Opened by vincentteyssier almost 6 years ago - 5 comments
Labels: type:build/install

#20 - Added a PTransform for decoding TF examples in beam pipeline

Pull Request - State: closed - Opened by terrytangyuan almost 6 years ago - 5 comments

#19 - Project import generated by Copybara.

Pull Request - State: closed - Opened by paulgc almost 6 years ago

#18 - remove space to fix link

Pull Request - State: closed - Opened by phpisciuneri almost 6 years ago

#17 - _pywrap_tensorflow_data_validation.so: undefined symbol: PyInstanceMethod_New

Issue - State: closed - Opened by dengyiping2014 about 6 years ago - 5 comments
Labels: type:build/install

#16 - GCP Dataflow file pattern *.tfrecord

Issue - State: closed - Opened by alelouis about 6 years ago - 1 comment

#15 - Project import generated by Copybara.

Pull Request - State: closed - Opened by paulgc about 6 years ago

#14 - Open multiple .tfrecord

Issue - State: closed - Opened by alelouis about 6 years ago - 2 comments

#13 - Error while using tfdv.generate_statistics_from_csv('path_to_csv_file')

Issue - State: closed - Opened by akki3d76 about 6 years ago - 4 comments
Labels: stat:awaiting response, type:bug

#12 - ImportError: cannot import name types

Issue - State: closed - Opened by bartcode about 6 years ago - 7 comments

#11 - Statistics visualization doesn't work in Firefox

Issue - State: closed - Opened by advancera about 6 years ago - 12 comments
Labels: stat:awaiting response, type:support

#10 - Python 3 Support

Issue - State: closed - Opened by paulgc about 6 years ago - 11 comments
Labels: Announcement

#9 - Project import generated by Copybara.

Pull Request - State: closed - Opened by paulgc about 6 years ago