Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / databricks/spark-xml issues and pull requests

#686 - Error using from_xml with StructType for schema

Issue - State: closed - Opened by ianepreston 4 months ago - 2 comments

#685 - The problem with the case of words for identical names

Issue - State: open - Opened by hipp0gryph 4 months ago - 3 comments

#684 - Found duplicates, but no duplicates into files

Issue - State: closed - Opened by hipp0gryph 5 months ago - 2 comments

#683 - Empty line between tags when writing xml

Issue - State: closed - Opened by sarg90 5 months ago - 1 comment

#682 - Getting Multi Generator issue while flattening the XML file

Issue - State: closed - Opened by a-sameer18 5 months ago - 1 comment

#680 - Update for 0.18.0, move CICD configs to supported Spark versions

Pull Request - State: closed - Opened by srowen 6 months ago

#679 - Fix for xml expression to not parse arbitrary strings

Pull Request - State: closed - Opened by xanderbailey 6 months ago - 6 comments
Labels: bug

#678 - Wrapping elements of nested array

Issue - State: closed - Opened by vitaliyb-adorama 7 months ago - 1 comment

#677 - NumPartitions == num files. Can I choose partitions manually?

Issue - State: closed - Opened by hipp0gryph 7 months ago - 1 comment

#675 - XSDToSchema fails on choice of sequence

Issue - State: closed - Opened by iWantToKeepAnon 9 months ago - 2 comments

#673 - parse XML without the default AttributePrefix "_" in PySpark

Issue - State: closed - Opened by schneifejan 10 months ago - 2 comments

#672 - Splitting XML into single-column rows

Issue - State: closed - Opened by rjrudin 10 months ago - 9 comments

#671 - Incorrect inferring schema if ignoreNamespace is true and namespace = tag

Issue - State: closed - Opened by hipp0gryph 10 months ago - 4 comments

#670 - ignoreSurroundingSpaces not working - Pyspark

Issue - State: closed - Opened by DeemoONeill 10 months ago - 8 comments

#669 - Convert xml to dataframe based on pyspark - using rowValidationXSDPath

Issue - State: closed - Opened by yu-tracy 11 months ago - 4 comments

#668 - Extract multiple tables from the same XML file

Issue - State: closed - Opened by vwiencek 12 months ago - 1 comment

#667 - Reading file with ContentType application/octet-stream

Issue - State: closed - Opened by mahmoud-masmoudi-dev 12 months ago - 3 comments

#666 - Azure Synapse Spark 3.3 Runtime : spark-xml fails on writing xml

Issue - State: closed - Opened by thinh-ngu 12 months ago - 1 comment

#665 - Use defined timezone on write for formats that need TZ info

Pull Request - State: closed - Opened by srowen 12 months ago
Labels: bug

#664 - Generated files does not have .xml extension

Issue - State: closed - Opened by dolfinus 12 months ago - 2 comments

#663 - Cannot write dataframe with custom timestampFormat

Issue - State: closed - Opened by dolfinus 12 months ago - 7 comments
Labels: bug

#662 - Timestamps not matching format are replaced with nulls

Issue - State: closed - Opened by dolfinus 12 months ago - 2 comments

#661 - Failed to find data source: xml.

Issue - State: closed - Opened by luisenriqueramos1977 about 1 year ago - 3 comments

#660 - Shortcut common type inference cases to fail fast, speed up inference

Pull Request - State: closed - Opened by srowen about 1 year ago - 1 comment
Labels: enhancement

#659 - Update to test vs Spark 3.4, and tested Spark/Scala/Java configs

Pull Request - State: closed - Opened by srowen about 1 year ago
Labels: enhancement

#658 - Vulnerabilities from dependencies: CVE-2023-22946

Issue - State: closed - Opened by sasauz about 1 year ago - 1 comment

#656 - Problem with extra line breaks inside tags during writing XML file

Issue - State: closed - Opened by VladIsLuve about 1 year ago - 1 comment

#655 - Problem with reading cp1251 file

Issue - State: closed - Opened by VladIsLuve about 1 year ago - 7 comments

#654 - Note plan to merge spark-xml to Apache Spark 4.0

Pull Request - State: closed - Opened by srowen about 1 year ago

#653 - Document that spark-xml is in maintenance mode

Issue - State: closed - Opened by HyukjinKwon about 1 year ago
Labels: enhancement

#652 - strange tag while writing xml with nullValue

Issue - State: closed - Opened by groneveld about 1 year ago - 8 comments

#650 - spark.sql.session.timeZone not taken into account while reading XML

Issue - State: closed - Opened by BaptistePiron about 1 year ago - 3 comments

#648 - "hidden" _metadata column is not identifying for the XML input file format

Issue - State: closed - Opened by ChackoSmitha over 1 year ago - 3 comments

#647 - Can't import XML file

Issue - State: closed - Opened by sanyam-dev over 1 year ago - 1 comment

#646 - Using spark-xml to parse nested xml structure in jupyter notebook

Issue - State: closed - Opened by Xabitsuki over 1 year ago - 2 comments

#645 - Reader can't read XML file if the rootTag and rowTag are the same

Issue - State: closed - Opened by irajhedayati over 1 year ago - 6 comments

#644 - Disallow strings ending in D or F as doubles when inferring schema

Pull Request - State: closed - Opened by srowen over 1 year ago
Labels: bug

#643 - Schema for stringvalue not inferred correctly

Issue - State: closed - Opened by ShubhamG25 over 1 year ago - 2 comments
Labels: bug

#642 - fs.azure.account.key error when reading files from Azure and OAuth

Issue - State: closed - Opened by DragonEnergy over 1 year ago - 2 comments

#640 - EMRServerless

Issue - State: closed - Opened by akash1302 over 1 year ago - 1 comment

#639 - ignoreCorruptFiles and GZIP corrupted xml files

Issue - State: closed - Opened by slavokx over 1 year ago - 3 comments

#638 - In XSD, handle frac digits for decimal types. Also try to support custom type declarations in the XSD

Pull Request - State: closed - Opened by srowen over 1 year ago
Labels: enhancement

#637 - Restore functionality of ignoreSurroundingSpaces when field is a simple string

Pull Request - State: closed - Opened by srowen over 1 year ago
Labels: bug

#636 - ignoreSurroundingSpaces is not working after upgrading to version 0.16.0

Issue - State: closed - Opened by irajhedayati over 1 year ago - 1 comment
Labels: bug

#635 - Allow for top-level simple type declarations in XSD, referenced by name, when parsing XSD as schema

Issue - State: closed - Opened by srowen over 1 year ago - 1 comment
Labels: enhancement

#634 - Search recursively with xml

Issue - State: open - Opened by DanialP over 1 year ago - 2 comments

#634 - Search recursively with xml

Issue - State: closed - Opened by DanialP over 1 year ago - 2 comments

#633 - rowValidationXSDPath schema does not enforce data types from Int to String

Issue - State: closed - Opened by Yanis77240 over 1 year ago - 4 comments

#632 - Issue with scala: java.lang.NoClassDefFoundError: scala/$less$colon$less

Issue - State: closed - Opened by coperator over 1 year ago - 4 comments

#631 - Parse complexContent with extension element

Pull Request - State: closed - Opened by shuch3ng over 1 year ago
Labels: enhancement

#629 - Initial pass at supports 'paths' data source option with multiple file paths

Pull Request - State: closed - Opened by srowen over 1 year ago - 4 comments
Labels: enhancement

#628 - read multiple XML file. and get file name as metadata

Issue - State: closed - Opened by writetoarun over 1 year ago - 2 comments

#628 - read multiple XML file. and get file name as metadata

Issue - State: closed - Opened by writetoarun over 1 year ago - 2 comments

#627 - when used along with select("*", "_metadata") error

Issue - State: closed - Opened by writetoarun over 1 year ago - 2 comments

#626 - decimalCannotGreaterThanPrecisionError with DecimalType() and value = 0.0X

Issue - State: closed - Opened by marcuskw over 1 year ago - 1 comment

#624 - Draft change to add custom timestamp format timezone support

Pull Request - State: closed - Opened by srowen almost 2 years ago

#624 - Draft change to add custom timestamp format timezone support

Pull Request - State: closed - Opened by srowen almost 2 years ago

#623 - Handle decimals with scale greater than precision

Pull Request - State: closed - Opened by srowen almost 2 years ago - 4 comments
Labels: bug

#622 - DecimalType parsing fails on some values

Issue - State: closed - Opened by agolovenko almost 2 years ago - 2 comments

#621 - Allow custom timestamp with Spark timezone property

Pull Request - State: closed - Opened by JorisTruong almost 2 years ago - 4 comments
Labels: enhancement

#621 - Allow custom timestamp with Spark timezone property

Pull Request - State: closed - Opened by JorisTruong almost 2 years ago - 4 comments
Labels: enhancement

#619 - Parse ref attribute in XSD element

Pull Request - State: closed - Opened by shuch3ng almost 2 years ago
Labels: enhancement

#618 - Allow xpath for rowTag

Issue - State: closed - Opened by singlewind almost 2 years ago - 3 comments

#617 - ref attribute in XSDToSchema

Issue - State: closed - Opened by shuch3ng almost 2 years ago - 7 comments

#617 - ref attribute in XSDToSchema

Issue - State: closed - Opened by shuch3ng almost 2 years ago - 7 comments

#616 - feat: added timeZone option

Pull Request - State: closed - Opened by JorisTruong almost 2 years ago

#615 - Upgrade SBT

Pull Request - State: closed - Opened by ganeshchand almost 2 years ago

#614 - Text fields with embedded tags

Issue - State: closed - Opened by kornel-at-swoop almost 2 years ago - 9 comments
Labels: bug

#613 - Upgrade SBT version

Issue - State: closed - Opened by ganeshchand almost 2 years ago - 3 comments

#612 - XML Timestamp parsing without timezone

Issue - State: closed - Opened by JorisTruong almost 2 years ago - 1 comment
Labels: enhancement

#611 - Write an xml element value with escape characters as it is in the input text

Issue - State: closed - Opened by DipeshV almost 2 years ago - 5 comments

#611 - Write an xml element value with escape characters as it is in the input text

Issue - State: closed - Opened by DipeshV almost 2 years ago - 5 comments

#610 - schema not respected when reading multiple xml files

Issue - State: closed - Opened by JohnStokes228 almost 2 years ago - 7 comments

#609 - facing error while writing the dataframe to xml file in local.

Issue - State: closed - Opened by ParvezAlam11 almost 2 years ago - 3 comments

#609 - facing error while writing the dataframe to xml file in local.

Issue - State: closed - Opened by ParvezAlam11 almost 2 years ago - 3 comments

#608 - XML parser behaves differently for StringType field when custom schema is used

Issue - State: closed - Opened by atomobianco almost 2 years ago - 1 comment

#605 - spark.read.format("xml").load(path) does not handle URIs with a comma (,)

Issue - State: closed - Opened by embrike about 2 years ago - 4 comments

#604 - DEPRECATED treatEmptyValueAsNulls works/suggested nullValue set to "" does not

Issue - State: closed - Opened by clemj21 about 2 years ago - 3 comments

#604 - DEPRECATED treatEmptyValueAsNulls works/suggested nullValue set to "" does not

Issue - State: closed - Opened by clemj21 about 2 years ago - 3 comments

#603 - Add arrayElementName option

Pull Request - State: closed - Opened by srowen about 2 years ago
Labels: enhancement

#603 - Add arrayElementName option

Pull Request - State: closed - Opened by srowen about 2 years ago
Labels: enhancement

#602 - Can "item" of ArrayType be renamed via an option when writing an XML file?

Issue - State: closed - Opened by giuseppeceravolo about 2 years ago - 12 comments

#600 - Task failed while writing rows.

Issue - State: closed - Opened by xifan987 about 2 years ago - 2 comments

#599 - Spark-xml not running on Databricks Runtime 11.0

Issue - State: closed - Opened by TheDataDexter about 2 years ago - 3 comments

#598 - Misc: remove Experimental tags, update build to 0.16.0; add Spark 3.3 CI/CD

Pull Request - State: closed - Opened by srowen about 2 years ago

#597 - rowtag not recognised when using ext_from_xml

Issue - State: closed - Opened by charlottevdscheun about 2 years ago - 2 comments

#596 - [Clean Up] Remove some duplicated code with Spark and use the ones directly from Spark Repo

Pull Request - State: closed - Opened by ericsun95 about 2 years ago
Labels: enhancement

#595 - Rationalize logging of record, exception in error cases

Pull Request - State: closed - Opened by srowen about 2 years ago - 1 comment
Labels: enhancement