Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / apache/hudi issues and pull requests

#10579 - [HUDI-7355] Empty commit should be enbale default to avoid Timeout Exception

Pull Request - State: closed - Opened by xuzifu666 8 months ago - 1 comment

#10578 - Bloom improvements

Pull Request - State: open - Opened by the-other-tim-brown 8 months ago - 2 comments

#10577 - [HUDI-6868] Support extracting passwords from credential store for Hive Sync

Pull Request - State: closed - Opened by ad1happy2go 8 months ago - 1 comment

#10575 - [HUDI-7347] Introduce SeekableDataInputStream for random access

Pull Request - State: closed - Opened by yihua 8 months ago - 1 comment
Labels: release-1.0.0

#10574 - [HUDI-7346] Remove usage of org.apache.hadoop.hbase.util.Bytes

Pull Request - State: closed - Opened by yihua 8 months ago - 1 comment
Labels: release-1.0.0

#10573 - [HUDI-7344] Use Java {Input/Output}Stream instead of FSData{Input/Output}Stream when possible

Pull Request - State: closed - Opened by yihua 8 months ago - 1 comment
Labels: release-1.0.0

#10572 - [HUDI-7351] Hive-sync partition pushdown does not work with glue

Pull Request - State: closed - Opened by parisni 8 months ago - 2 comments
Labels: aws-support

#10571 - [HUDI-7345] Remove usage of org.apache.hadoop.util.VersionUtil

Pull Request - State: closed - Opened by yihua 8 months ago - 1 comment
Labels: release-1.0.0

#10570 - [HUDI-7343] Replace Path.SEPARATOR with HoodieLocation.SEPARATOR

Pull Request - State: closed - Opened by yihua 8 months ago - 2 comments
Labels: release-1.0.0

#10568 - [HUDI-7342] Use BaseFileUtils to hide format-specific logic in HoodiePartitionMetadata

Pull Request - State: closed - Opened by yihua 8 months ago - 1 comment
Labels: release-1.0.0

#10567 - [HUDI-7336] Introduce new HoodieStorage abstraction

Pull Request - State: closed - Opened by yihua 8 months ago - 4 comments
Labels: release-1.0.0

#10566 - [SUPPORT] Hudi CLI bundle not working

Issue - State: open - Opened by CTTY 8 months ago - 1 comment
Labels: priority:major, cli, table-service

#10565 - [HUDI-7337] Implement MetricsReporter that reports metrics to M3

Pull Request - State: open - Opened by kbuci 8 months ago - 1 comment
Labels: release-0.14.2

#10564 - [HUDI-7335] Create hudi-hadoop-common for hadoop-specific implementation

Pull Request - State: closed - Opened by yihua 8 months ago - 1 comment
Labels: release-1.0.0

#10563 - added new videos for hudi oss site

Pull Request - State: closed - Opened by nfarah86 8 months ago - 5 comments

#10562 - [MINOR] add logger to CompactionPlanOperator & ClusteringPlanOperator

Pull Request - State: closed - Opened by eric9204 8 months ago - 1 comment

#10561 - Issue with reading the debezium inputs

Issue - State: open - Opened by zyperd 8 months ago - 4 comments
Labels: priority:major, hudistreamer, on-call-triaged

#10560 - new hudi content for 01-2024

Pull Request - State: closed - Opened by nfarah86 8 months ago

#10559 - Hudi behaviour if AWS Glue concurrency is triggered[SUPPORT]

Issue - State: open - Opened by rishabhreply 8 months ago - 10 comments
Labels: priority:minor, on-call-triaged, concurrency-control

#10557 - updated button size so join now is on one line

Pull Request - State: closed - Opened by nfarah86 8 months ago

#10556 - [HUDI-7327] remove meta cols from incoming schema in stream sync

Pull Request - State: closed - Opened by jonvex 8 months ago - 3 comments

#10554 - [MINOR] Fix UT error in HUDI-6941 with stage task numbers

Pull Request - State: closed - Opened by xuzifu666 8 months ago - 1 comment

#10553 - [SUPPORT] Cannot create a hudi table that has a column starting with a digit

Issue - State: closed - Opened by Mourya1319 8 months ago - 9 comments
Labels: schema-and-data-types, priority:minor, spark-sql, on-call-triaged

#10552 - [HUDI-6902] Detect the cause of orphan processes_2

Pull Request - State: closed - Opened by linliu-code 8 months ago - 2 comments

#10551 - [HUDI-7334] Remove EMBEDDED_KV_STORE based FSV usage in tests

Pull Request - State: closed - Opened by linliu-code 8 months ago - 5 comments

#10550 - [MINOR] Splitting UT Spark DataSource into two

Pull Request - State: open - Opened by vinothchandar 8 months ago - 2 comments

#10549 - [HUDI-7323] Use a schema supplier instead of a static value

Pull Request - State: closed - Opened by the-other-tim-brown 8 months ago - 1 comment
Labels: schema-and-data-types, hudistreamer

#10548 - [HUDI-6902] Remove hdfstestservice usages

Pull Request - State: open - Opened by linliu-code 8 months ago - 1 comment

#10547 - [MINOR] Reduce UT spark-datasource test times

Pull Request - State: closed - Opened by vinothchandar 8 months ago - 2 comments

#10545 - [BUG]Data duplication, multiple data primary keys are duplicated

Issue - State: closed - Opened by waywtdcc 8 months ago - 4 comments
Labels: priority:minor, data-duplication

#10544 - [SUPPORT]When I write HUDI with Flink use asynchronous compaction, I use the --service parameter, but I run into a problem

Issue - State: open - Opened by LIKE-HUB 8 months ago - 2 comments
Labels: priority:major, flink, table-service

#10542 - [SUPPORT] Dataloss in FlinkCDC into Hudi without any exception or other infomation

Issue - State: open - Opened by xuzifu666 8 months ago - 10 comments
Labels: flink, data-loss, change-data-capture

#10541 - [HUDI-7317] FlinkTableFactory snatifyCheck should contains index type

Pull Request - State: closed - Opened by xuzifu666 9 months ago - 1 comment

#10539 - [SUPPORT] Flink streaming read MOR table, thrown Unexpected cdc file split infer case: LOG_FILE Exception

Issue - State: closed - Opened by nicholasxu 9 months ago - 7 comments
Labels: priority:major, flink

#10538 - [SUPPORT] Multi Writing unable to acquire lock by using flink with diffrent LockProvider

Issue - State: open - Opened by hanson2021 9 months ago - 2 comments
Labels: priority:major, concurrency-control

#10537 - [HUDI-7315] Disable constructing NOT filter predicate when pushing do…

Pull Request - State: closed - Opened by paul8263 9 months ago - 1 comment

#10536 - [HUDI-7314] Hudi Create table support index type check

Pull Request - State: closed - Opened by xuzifu666 9 months ago - 2 comments

#10535 - initial commit and added applied intuition and penn entertainment to …

Pull Request - State: closed - Opened by nfarah86 9 months ago - 1 comment

#10534 - [HUDI-6902] Take a dump at hadoop-mr-java-client module

Pull Request - State: closed - Opened by linliu-code 9 months ago - 2 comments

#10533 - [SUPPORT] Spark readStream fails with [COLUMN_ALREADY_EXISTS] for streaming tables created with "hoodie.schema.on.read.enable" & "hoodie.datasource.write.reconcile.schema" enabled

Issue - State: open - Opened by imonteroq 9 months ago - 5 comments
Labels: priority:major, on-call-triaged, reader-core, schema-evolution, release-0.14.2

#10532 - [HUDI-7277] fix `hoodie.bulkinsert.shuffle.parallelism` not activated…

Pull Request - State: closed - Opened by KnightChess 9 months ago - 2 comments
Labels: spark

#10531 - [HUDI-7311] Add implicit literal type conversion before filter push down

Pull Request - State: closed - Opened by paul8263 9 months ago - 5 comments
Labels: schema-and-data-types, flink-sql

#10529 - initial commit for web banner

Pull Request - State: closed - Opened by nfarah86 9 months ago - 2 comments

#10526 - [MINOR] Fix flaky testMultiWriterWithAsyncTableServicesWithConflict test

Pull Request - State: open - Opened by Zouxxyy 9 months ago - 1 comment
Labels: test-stability

#10525 - [HUDI-6902] Try container for Azure tests

Pull Request - State: closed - Opened by linliu-code 9 months ago - 4 comments

#10524 - [HUDI-7309] Disable constructing AND & OR filter predicates when filt…

Pull Request - State: closed - Opened by paul8263 9 months ago - 1 comment

#10523 - [HUDI-7308] LockManager::unlock should not call updateLockHeldTimerMetrics if lockDurationTimer has not been started

Pull Request - State: closed - Opened by kbuci 9 months ago - 6 comments
Labels: metrics, stability

#10522 - Update docker_demo.md

Pull Request - State: closed - Opened by DanRoscigno 9 months ago

#10521 - [DOCS] Diagram Change for File Layout Page

Pull Request - State: closed - Opened by dipankarmazumdar 9 months ago

#10520 - [HUDI-6902] Shutdown metric hooks properly

Pull Request - State: closed - Opened by linliu-code 9 months ago - 1 comment

#10519 - [SUPPORT] - FileNotFound Exception while using hoodie.datasource.read.incr.fallback.fulltablescan.enable=true

Issue - State: open - Opened by remeajayi2022 9 months ago - 4 comments
Labels: spark, incremental-etl

#10518 - [HUDI-7305] Fix cast exception for byte/short/float partitioned field

Pull Request - State: closed - Opened by stream2000 9 months ago - 1 comment
Labels: schema-and-data-types, spark

#10517 - [HUDI-7303] Fix date field type unexpectedly convert to Long when usi…

Pull Request - State: closed - Opened by paul8263 9 months ago - 4 comments
Labels: schema-and-data-types, flink

#10515 - [HUDI-7302] Consistent hashing row writer support sorting

Pull Request - State: closed - Opened by stream2000 9 months ago - 5 comments

#10513 - [HUDI-6902] Fix a unit test

Pull Request - State: closed - Opened by linliu-code 9 months ago - 1 comment

#10512 - [HUDI-6902] Containerize the Azure CI

Pull Request - State: open - Opened by linliu-code 9 months ago - 20 comments

#10511 - [SUPPORT] MOR hudi 0.14, Bloom Filters are not being used on query time

Issue - State: open - Opened by bk-mz 9 months ago - 21 comments
Labels: performance, priority:major, index, metadata

#10509 - Fix complex partition generator

Pull Request - State: closed - Opened by parisni 9 months ago - 2 comments

#10508 - [SUPPORT] Migration partitionned table with complex key generator to 0.14.1 leads to duplicates when recordkey length =1

Issue - State: open - Opened by parisni 9 months ago - 6 comments
Labels: priority:blocker, data-consistency, release-0.14.2

#10505 - [HUDI-7299] BucketIndex table should forbit append mode

Pull Request - State: closed - Opened by xuzifu666 9 months ago - 1 comment

#10504 - [SUPPORT] spark task can not finish when doAppend

Issue - State: open - Opened by KnightChess 9 months ago - 2 comments
Labels: spark

#10503 - [Support] An error occurred while calling o1748.load.\n: java.io.FileNotFoundException

Issue - State: open - Opened by gsudhanshu 9 months ago - 18 comments
Labels: priority:major, behavior-unexpected, py-spark

#10501 - [HUDI-7284] Fix cluster stream sync check

Pull Request - State: closed - Opened by jonvex 9 months ago - 1 comment
Labels: table-service, release-0.14.2

#10500 - [HUDI-7298] Write bad records to error table in more cases instead of failing stream

Pull Request - State: closed - Opened by jonvex 9 months ago - 3 comments

#10499 - [SUPPORT] Hudi DeltaStreamer with Flattening Transformer

Issue - State: closed - Opened by soumilshah1995 9 months ago - 5 comments
Labels: priority:major, hudistreamer

#10497 - [HUDI-7297] Fix ambiguous error message when field type defined in sc…

Pull Request - State: closed - Opened by paul8263 9 months ago - 2 comments

#10494 - [HUDI-7270] Support schema evolution by Flink SQL using HoodieCatalog

Pull Request - State: closed - Opened by beyond1920 9 months ago - 2 comments

#10493 - [HUDI-7291] Pushing Down Partition Pruning Conditions to Column Stats Earlier During Data Skipping

Pull Request - State: closed - Opened by majian1998 9 months ago - 1 comment
Labels: performance, spark, data-skipping

#10492 - [HUDI-7296] Reduce CI Time by Minimizing Duplicate Code Coverage in Tests

Pull Request - State: closed - Opened by jonvex 9 months ago - 3 comments
Labels: release-1.0.0

#10489 - [HUDI-7295]solving the problem of disordered output split in incremental read sc…

Pull Request - State: closed - Opened by empcl 9 months ago - 3 comments
Labels: size:XS

#10483 - Hard deletion using deltastreamer

Issue - State: closed - Opened by Kangho-Lee 9 months ago - 5 comments
Labels: priority:major, on-call-triaged, schema-evolution

#10479 - [HUDI-7290] Don't assume ReplaceCommits are always Clustering

Pull Request - State: open - Opened by jonvex 9 months ago - 3 comments

#10468 - multi-writer jobs wait forever to finish it off (Using OPTIMISTIC_CONCURRENCY_CONTROL)

Issue - State: closed - Opened by SamarthRaval 9 months ago - 5 comments
Labels: priority:minor, concurrency-control

#10466 - If Sanitastiion Enabled In HudiStreamer It is taking too much time

Issue - State: open - Opened by Amar1404 9 months ago - 1 comment
Labels: priority:major, hudistreamer

#10465 - [SUPPORT]Flink writes MOR table, both RO table and RT table read nothing by hive

Issue - State: open - Opened by nicholasxu 9 months ago - 15 comments
Labels: priority:major, hive, on-call-triaged

#10464 - [WIP] [HUDI-6902] Create a dummy PR to trigger tests

Pull Request - State: closed - Opened by linliu-code 9 months ago - 2 comments

#10463 - [DOCS] Add parquet merge schema config

Pull Request - State: closed - Opened by rohitmittapalli 9 months ago

#10460 - [MINOR] Add parallel listing of existing partitions

Pull Request - State: open - Opened by VitoMakarevich 9 months ago - 3 comments

#10458 - [SUPPORT] HUDI baseFile is empty String and this causes IllegalArgumentException

Issue - State: open - Opened by nicholasxu 9 months ago - 4 comments
Labels: docs, priority:major, flink, on-call-triaged, release-0.14.2

#10456 - Partitioning data into two keys is taking more time (10x) than partitioning into one key.

Issue - State: open - Opened by maheshguptags 9 months ago - 27 comments
Labels: performance, priority:critical, flink

#10434 - [SUPPORT] Hope Hudi 0.13. 1 can support Flink 1.17+

Issue - State: closed - Opened by lmhongwei 9 months ago - 3 comments
Labels: priority:minor, feature-enquiry, version-compatibility

#10433 - Improve datadog reporter

Pull Request - State: closed - Opened by parisni 9 months ago - 2 comments
Labels: size:S