Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / apache/hudi issues and pull requests
#10580 - [SUPPORT] Querying Hudi tables with Spark+Velox(C++), ObjectSizeCalculator.getObjectSize hangs causing about a 50-second delay in queries
Issue -
State: open - Opened by majian1998 8 months ago
- 6 comments
Labels: performance, priority:major
#10579 - [HUDI-7355] Empty commit should be enbale default to avoid Timeout Exception
Pull Request -
State: closed - Opened by xuzifu666 8 months ago
- 1 comment
#10578 - Bloom improvements
Pull Request -
State: open - Opened by the-other-tim-brown 8 months ago
- 2 comments
#10577 - [HUDI-6868] Support extracting passwords from credential store for Hive Sync
Pull Request -
State: closed - Opened by ad1happy2go 8 months ago
- 1 comment
#10576 - [BUG] Failure Encountered When Reading Hudi with Flink in Batch Runtime Mode and FlinkOptions.READ_AS_STREAMING=false
Issue -
State: open - Opened by ailinzhou 8 months ago
- 3 comments
Labels: priority:major, flink
#10575 - [HUDI-7347] Introduce SeekableDataInputStream for random access
Pull Request -
State: closed - Opened by yihua 8 months ago
- 1 comment
Labels: release-1.0.0
#10574 - [HUDI-7346] Remove usage of org.apache.hadoop.hbase.util.Bytes
Pull Request -
State: closed - Opened by yihua 8 months ago
- 1 comment
Labels: release-1.0.0
#10573 - [HUDI-7344] Use Java {Input/Output}Stream instead of FSData{Input/Output}Stream when possible
Pull Request -
State: closed - Opened by yihua 8 months ago
- 1 comment
Labels: release-1.0.0
#10572 - [HUDI-7351] Hive-sync partition pushdown does not work with glue
Pull Request -
State: closed - Opened by parisni 8 months ago
- 2 comments
Labels: aws-support
#10571 - [HUDI-7345] Remove usage of org.apache.hadoop.util.VersionUtil
Pull Request -
State: closed - Opened by yihua 8 months ago
- 1 comment
Labels: release-1.0.0
#10570 - [HUDI-7343] Replace Path.SEPARATOR with HoodieLocation.SEPARATOR
Pull Request -
State: closed - Opened by yihua 8 months ago
- 2 comments
Labels: release-1.0.0
#10569 - [SUPPORT] feature hive sync partition push down fails with glue
Issue -
State: closed - Opened by parisni 8 months ago
#10568 - [HUDI-7342] Use BaseFileUtils to hide format-specific logic in HoodiePartitionMetadata
Pull Request -
State: closed - Opened by yihua 8 months ago
- 1 comment
Labels: release-1.0.0
#10567 - [HUDI-7336] Introduce new HoodieStorage abstraction
Pull Request -
State: closed - Opened by yihua 8 months ago
- 4 comments
Labels: release-1.0.0
#10566 - [SUPPORT] Hudi CLI bundle not working
Issue -
State: open - Opened by CTTY 8 months ago
- 1 comment
Labels: priority:major, cli, table-service
#10565 - [HUDI-7337] Implement MetricsReporter that reports metrics to M3
Pull Request -
State: open - Opened by kbuci 8 months ago
- 1 comment
Labels: release-0.14.2
#10564 - [HUDI-7335] Create hudi-hadoop-common for hadoop-specific implementation
Pull Request -
State: closed - Opened by yihua 8 months ago
- 1 comment
Labels: release-1.0.0
#10563 - added new videos for hudi oss site
Pull Request -
State: closed - Opened by nfarah86 8 months ago
- 5 comments
#10562 - [MINOR] add logger to CompactionPlanOperator & ClusteringPlanOperator
Pull Request -
State: closed - Opened by eric9204 8 months ago
- 1 comment
#10561 - Issue with reading the debezium inputs
Issue -
State: open - Opened by zyperd 8 months ago
- 4 comments
Labels: priority:major, hudistreamer, on-call-triaged
#10560 - new hudi content for 01-2024
Pull Request -
State: closed - Opened by nfarah86 8 months ago
#10559 - Hudi behaviour if AWS Glue concurrency is triggered[SUPPORT]
Issue -
State: open - Opened by rishabhreply 8 months ago
- 10 comments
Labels: priority:minor, on-call-triaged, concurrency-control
#10558 - [SUPPORT] After upgrading hudi 0.14.1, use Spark SQL merge into to update the matched_action, the case of the column name and the expression name does not match, resulting in an exception.
Issue -
State: open - Opened by yihao-tcf 8 months ago
- 8 comments
Labels: priority:blocker, spark-sql, on-call-triaged, release-0.14.2
#10557 - updated button size so join now is on one line
Pull Request -
State: closed - Opened by nfarah86 8 months ago
#10556 - [HUDI-7327] remove meta cols from incoming schema in stream sync
Pull Request -
State: closed - Opened by jonvex 8 months ago
- 3 comments
#10555 - [SUPPORT] Error Category: UNCLASSIFIED_ERROR; An error occurred while calling o230.save. Parquet/Avro schema mismatch: Avro field 'id' not found
Issue -
State: open - Opened by jayesh2424 8 months ago
- 7 comments
Labels: schema-and-data-types, py-spark
#10554 - [MINOR] Fix UT error in HUDI-6941 with stage task numbers
Pull Request -
State: closed - Opened by xuzifu666 8 months ago
- 1 comment
#10553 - [SUPPORT] Cannot create a hudi table that has a column starting with a digit
Issue -
State: closed - Opened by Mourya1319 8 months ago
- 9 comments
Labels: schema-and-data-types, priority:minor, spark-sql, on-call-triaged
#10552 - [HUDI-6902] Detect the cause of orphan processes_2
Pull Request -
State: closed - Opened by linliu-code 8 months ago
- 2 comments
#10551 - [HUDI-7334] Remove EMBEDDED_KV_STORE based FSV usage in tests
Pull Request -
State: closed - Opened by linliu-code 8 months ago
- 5 comments
#10550 - [MINOR] Splitting UT Spark DataSource into two
Pull Request -
State: open - Opened by vinothchandar 8 months ago
- 2 comments
#10549 - [HUDI-7323] Use a schema supplier instead of a static value
Pull Request -
State: closed - Opened by the-other-tim-brown 8 months ago
- 1 comment
Labels: schema-and-data-types, hudistreamer
#10548 - [HUDI-6902] Remove hdfstestservice usages
Pull Request -
State: open - Opened by linliu-code 8 months ago
- 1 comment
#10547 - [MINOR] Reduce UT spark-datasource test times
Pull Request -
State: closed - Opened by vinothchandar 8 months ago
- 2 comments
#10546 - [HUDI-7318] HoodieTableMetaClient need support a config to user for whether to checkTableValidity
Pull Request -
State: closed - Opened by xuzifu666 8 months ago
- 2 comments
#10545 - [BUG]Data duplication, multiple data primary keys are duplicated
Issue -
State: closed - Opened by waywtdcc 8 months ago
- 4 comments
Labels: priority:minor, data-duplication
#10544 - [SUPPORT]When I write HUDI with Flink use asynchronous compaction, I use the --service parameter, but I run into a problem
Issue -
State: open - Opened by LIKE-HUB 8 months ago
- 2 comments
Labels: priority:major, flink, table-service
#10543 - [SUPPORT]When I write HUDI with Flink use asynchronous compaction, I use the --service parameter, but I run into a problem
Issue -
State: closed - Opened by LIKE-HUB 8 months ago
- 1 comment
#10542 - [SUPPORT] Dataloss in FlinkCDC into Hudi without any exception or other infomation
Issue -
State: open - Opened by xuzifu666 8 months ago
- 10 comments
Labels: flink, data-loss, change-data-capture
#10541 - [HUDI-7317] FlinkTableFactory snatifyCheck should contains index type
Pull Request -
State: closed - Opened by xuzifu666 9 months ago
- 1 comment
#10540 - [HUDI-7316] Update AbstractHoodieLogRecordReader (and implementation) to accept already-constructed HoodieTableMetaClient and update Spark engine callers to pass meta client directly
Pull Request -
State: closed - Opened by kbuci 9 months ago
- 1 comment
Labels: performance, metadata
#10539 - [SUPPORT] Flink streaming read MOR table, thrown Unexpected cdc file split infer case: LOG_FILE Exception
Issue -
State: closed - Opened by nicholasxu 9 months ago
- 7 comments
Labels: priority:major, flink
#10538 - [SUPPORT] Multi Writing unable to acquire lock by using flink with diffrent LockProvider
Issue -
State: open - Opened by hanson2021 9 months ago
- 2 comments
Labels: priority:major, concurrency-control
#10537 - [HUDI-7315] Disable constructing NOT filter predicate when pushing do…
Pull Request -
State: closed - Opened by paul8263 9 months ago
- 1 comment
#10536 - [HUDI-7314] Hudi Create table support index type check
Pull Request -
State: closed - Opened by xuzifu666 9 months ago
- 2 comments
#10535 - initial commit and added applied intuition and penn entertainment to …
Pull Request -
State: closed - Opened by nfarah86 9 months ago
- 1 comment
#10534 - [HUDI-6902] Take a dump at hadoop-mr-java-client module
Pull Request -
State: closed - Opened by linliu-code 9 months ago
- 2 comments
#10533 - [SUPPORT] Spark readStream fails with [COLUMN_ALREADY_EXISTS] for streaming tables created with "hoodie.schema.on.read.enable" & "hoodie.datasource.write.reconcile.schema" enabled
Issue -
State: open - Opened by imonteroq 9 months ago
- 5 comments
Labels: priority:major, on-call-triaged, reader-core, schema-evolution, release-0.14.2
#10532 - [HUDI-7277] fix `hoodie.bulkinsert.shuffle.parallelism` not activated…
Pull Request -
State: closed - Opened by KnightChess 9 months ago
- 2 comments
Labels: spark
#10531 - [HUDI-7311] Add implicit literal type conversion before filter push down
Pull Request -
State: closed - Opened by paul8263 9 months ago
- 5 comments
Labels: schema-and-data-types, flink-sql
#10530 - [HUDI-7312] Spark3ParsePartitionUtil support inferPartitionColumnValue with all unnest type
Pull Request -
State: closed - Opened by xuzifu666 9 months ago
- 1 comment
#10529 - initial commit for web banner
Pull Request -
State: closed - Opened by nfarah86 9 months ago
- 2 comments
#10528 - [HUDI-7310] Optimize Column Stats Partition Pruning for Non-Partition Pruning Queries
Pull Request -
State: closed - Opened by majian1998 9 months ago
- 1 comment
#10527 - [MINOR] Added descriptive exception if column present in required avro schema does not exist in hudi table
Pull Request -
State: closed - Opened by prathit06 9 months ago
- 1 comment
#10526 - [MINOR] Fix flaky testMultiWriterWithAsyncTableServicesWithConflict test
Pull Request -
State: open - Opened by Zouxxyy 9 months ago
- 1 comment
Labels: test-stability
#10525 - [HUDI-6902] Try container for Azure tests
Pull Request -
State: closed - Opened by linliu-code 9 months ago
- 4 comments
#10524 - [HUDI-7309] Disable constructing AND & OR filter predicates when filt…
Pull Request -
State: closed - Opened by paul8263 9 months ago
- 1 comment
#10523 - [HUDI-7308] LockManager::unlock should not call updateLockHeldTimerMetrics if lockDurationTimer has not been started
Pull Request -
State: closed - Opened by kbuci 9 months ago
- 6 comments
Labels: metrics, stability
#10522 - Update docker_demo.md
Pull Request -
State: closed - Opened by DanRoscigno 9 months ago
#10521 - [DOCS] Diagram Change for File Layout Page
Pull Request -
State: closed - Opened by dipankarmazumdar 9 months ago
#10520 - [HUDI-6902] Shutdown metric hooks properly
Pull Request -
State: closed - Opened by linliu-code 9 months ago
- 1 comment
#10519 - [SUPPORT] - FileNotFound Exception while using hoodie.datasource.read.incr.fallback.fulltablescan.enable=true
Issue -
State: open - Opened by remeajayi2022 9 months ago
- 4 comments
Labels: spark, incremental-etl
#10518 - [HUDI-7305] Fix cast exception for byte/short/float partitioned field
Pull Request -
State: closed - Opened by stream2000 9 months ago
- 1 comment
Labels: schema-and-data-types, spark
#10517 - [HUDI-7303] Fix date field type unexpectedly convert to Long when usi…
Pull Request -
State: closed - Opened by paul8263 9 months ago
- 4 comments
Labels: schema-and-data-types, flink
#10516 - [HUDI-7304] Change DataSourceInternalWriterHelper::onDataWriterCommit LOG level avoid large mertric messages
Pull Request -
State: closed - Opened by xuzifu666 9 months ago
- 2 comments
#10515 - [HUDI-7302] Consistent hashing row writer support sorting
Pull Request -
State: closed - Opened by stream2000 9 months ago
- 5 comments
#10514 - [MINOR] Revert "[MINOR] Handle parsing of all zero timestamps with MDT suffixes."
Pull Request -
State: closed - Opened by linliu-code 9 months ago
- 1 comment
#10513 - [HUDI-6902] Fix a unit test
Pull Request -
State: closed - Opened by linliu-code 9 months ago
- 1 comment
#10512 - [HUDI-6902] Containerize the Azure CI
Pull Request -
State: open - Opened by linliu-code 9 months ago
- 20 comments
#10511 - [SUPPORT] MOR hudi 0.14, Bloom Filters are not being used on query time
Issue -
State: open - Opened by bk-mz 9 months ago
- 21 comments
Labels: performance, priority:major, index, metadata
#10509 - Fix complex partition generator
Pull Request -
State: closed - Opened by parisni 9 months ago
- 2 comments
#10508 - [SUPPORT] Migration partitionned table with complex key generator to 0.14.1 leads to duplicates when recordkey length =1
Issue -
State: open - Opened by parisni 9 months ago
- 6 comments
Labels: priority:blocker, data-consistency, release-0.14.2
#10507 - [SUPPORT] Hudi Record Index not working as Expected: gives warning as "WARN SparkMetadataTableRecordIndex: Record index not initialized so falling back to GLOBAL_SIMPLE for tagging records"
Issue -
State: open - Opened by zeeshan-media 9 months ago
- 9 comments
Labels: priority:major, index, metadata, on-call-triaged
#10505 - [HUDI-7299] BucketIndex table should forbit append mode
Pull Request -
State: closed - Opened by xuzifu666 9 months ago
- 1 comment
#10504 - [SUPPORT] spark task can not finish when doAppend
Issue -
State: open - Opened by KnightChess 9 months ago
- 2 comments
Labels: spark
#10503 - [Support] An error occurred while calling o1748.load.\n: java.io.FileNotFoundException
Issue -
State: open - Opened by gsudhanshu 9 months ago
- 18 comments
Labels: priority:major, behavior-unexpected, py-spark
#10502 - [SUPPORT]HoodieIOException:Unable to create /data13/hadoop/yam/local/usercache/D4046B698E12F32FEF9F968718893E9D/appcache/applicatfon_1701998877139_1409559/hudi- BITCASK-8549d240-78a0-43af-a30c-0ef6d6327cd0
Issue -
State: open - Opened by cbg-wx 9 months ago
- 3 comments
#10501 - [HUDI-7284] Fix cluster stream sync check
Pull Request -
State: closed - Opened by jonvex 9 months ago
- 1 comment
Labels: table-service, release-0.14.2
#10500 - [HUDI-7298] Write bad records to error table in more cases instead of failing stream
Pull Request -
State: closed - Opened by jonvex 9 months ago
- 3 comments
#10499 - [SUPPORT] Hudi DeltaStreamer with Flattening Transformer
Issue -
State: closed - Opened by soumilshah1995 9 months ago
- 5 comments
Labels: priority:major, hudistreamer
#10497 - [HUDI-7297] Fix ambiguous error message when field type defined in sc…
Pull Request -
State: closed - Opened by paul8263 9 months ago
- 2 comments
#10494 - [HUDI-7270] Support schema evolution by Flink SQL using HoodieCatalog
Pull Request -
State: closed - Opened by beyond1920 9 months ago
- 2 comments
#10493 - [HUDI-7291] Pushing Down Partition Pruning Conditions to Column Stats Earlier During Data Skipping
Pull Request -
State: closed - Opened by majian1998 9 months ago
- 1 comment
Labels: performance, spark, data-skipping
#10492 - [HUDI-7296] Reduce CI Time by Minimizing Duplicate Code Coverage in Tests
Pull Request -
State: closed - Opened by jonvex 9 months ago
- 3 comments
Labels: release-1.0.0
#10489 - [HUDI-7295]solving the problem of disordered output split in incremental read sc…
Pull Request -
State: closed - Opened by empcl 9 months ago
- 3 comments
Labels: size:XS
#10486 - [SUPPORT] Flink write to COW Hudi table,hive aggregate query results has duplicate data but select * did not
Issue -
State: closed - Opened by CamelliaYjli 9 months ago
- 11 comments
Labels: hive, flink
#10484 - [SUPPORT] ClassCastException when upsert COW table with RECORD_INDEX index type
Issue -
State: closed - Opened by lei-su-awx 9 months ago
- 3 comments
#10483 - Hard deletion using deltastreamer
Issue -
State: closed - Opened by Kangho-Lee 9 months ago
- 5 comments
Labels: priority:major, on-call-triaged, schema-evolution
#10479 - [HUDI-7290] Don't assume ReplaceCommits are always Clustering
Pull Request -
State: open - Opened by jonvex 9 months ago
- 3 comments
#10468 - multi-writer jobs wait forever to finish it off (Using OPTIMISTIC_CONCURRENCY_CONTROL)
Issue -
State: closed - Opened by SamarthRaval 9 months ago
- 5 comments
Labels: priority:minor, concurrency-control
#10466 - If Sanitastiion Enabled In HudiStreamer It is taking too much time
Issue -
State: open - Opened by Amar1404 9 months ago
- 1 comment
Labels: priority:major, hudistreamer
#10465 - [SUPPORT]Flink writes MOR table, both RO table and RT table read nothing by hive
Issue -
State: open - Opened by nicholasxu 9 months ago
- 15 comments
Labels: priority:major, hive, on-call-triaged
#10464 - [WIP] [HUDI-6902] Create a dummy PR to trigger tests
Pull Request -
State: closed - Opened by linliu-code 9 months ago
- 2 comments
#10463 - [DOCS] Add parquet merge schema config
Pull Request -
State: closed - Opened by rohitmittapalli 9 months ago
#10460 - [MINOR] Add parallel listing of existing partitions
Pull Request -
State: open - Opened by VitoMakarevich 9 months ago
- 3 comments
#10458 - [SUPPORT] HUDI baseFile is empty String and this causes IllegalArgumentException
Issue -
State: open - Opened by nicholasxu 9 months ago
- 4 comments
Labels: docs, priority:major, flink, on-call-triaged, release-0.14.2
#10456 - Partitioning data into two keys is taking more time (10x) than partitioning into one key.
Issue -
State: open - Opened by maheshguptags 9 months ago
- 27 comments
Labels: performance, priority:critical, flink
#10449 - [MINOR] Disable org.apache.hudi.TestHoodieSparkSqlWriter#testInsertDatasetWithTimelineTimezoneUTC
Pull Request -
State: closed - Opened by jonvex 9 months ago
- 2 comments
#10434 - [SUPPORT] Hope Hudi 0.13. 1 can support Flink 1.17+
Issue -
State: closed - Opened by lmhongwei 9 months ago
- 3 comments
Labels: priority:minor, feature-enquiry, version-compatibility
#10433 - Improve datadog reporter
Pull Request -
State: closed - Opened by parisni 9 months ago
- 2 comments
Labels: size:S