Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / databricks/spark-xml issues and pull requests
#686 - Error using from_xml with StructType for schema
Issue -
State: closed - Opened by ianepreston 4 months ago
- 2 comments
#685 - The problem with the case of words for identical names
Issue -
State: open - Opened by hipp0gryph 4 months ago
- 3 comments
#684 - Found duplicates, but no duplicates into files
Issue -
State: closed - Opened by hipp0gryph 5 months ago
- 2 comments
#683 - Empty line between tags when writing xml
Issue -
State: closed - Opened by sarg90 5 months ago
- 1 comment
#682 - Getting Multi Generator issue while flattening the XML file
Issue -
State: closed - Opened by a-sameer18 5 months ago
- 1 comment
#681 - Caused by: java.io.InvalidClassException: com.databricks.spark.xml.XmlOptions; local class incompatible
Issue -
State: open - Opened by gc-avanade 6 months ago
- 2 comments
#680 - Update for 0.18.0, move CICD configs to supported Spark versions
Pull Request -
State: closed - Opened by srowen 6 months ago
#679 - Fix for xml expression to not parse arbitrary strings
Pull Request -
State: closed - Opened by xanderbailey 6 months ago
- 6 comments
Labels: bug
#678 - Wrapping elements of nested array
Issue -
State: closed - Opened by vitaliyb-adorama 7 months ago
- 1 comment
#677 - NumPartitions == num files. Can I choose partitions manually?
Issue -
State: closed - Opened by hipp0gryph 7 months ago
- 1 comment
#676 - Remove New Line coming in between records during spark write dataframe to XML
Issue -
State: closed - Opened by avinashpandu 8 months ago
- 5 comments
#675 - XSDToSchema fails on choice of sequence
Issue -
State: closed - Opened by iWantToKeepAnon 9 months ago
- 2 comments
#674 - Add notes about file extensions and _corrupt_record to documentation
Pull Request -
State: closed - Opened by dolfinus 10 months ago
#673 - parse XML without the default AttributePrefix "_" in PySpark
Issue -
State: closed - Opened by schneifejan 10 months ago
- 2 comments
#672 - Splitting XML into single-column rows
Issue -
State: closed - Opened by rjrudin 10 months ago
- 9 comments
#671 - Incorrect inferring schema if ignoreNamespace is true and namespace = tag
Issue -
State: closed - Opened by hipp0gryph 10 months ago
- 4 comments
#670 - ignoreSurroundingSpaces not working - Pyspark
Issue -
State: closed - Opened by DeemoONeill 10 months ago
- 8 comments
#669 - Convert xml to dataframe based on pyspark - using rowValidationXSDPath
Issue -
State: closed - Opened by yu-tracy 11 months ago
- 4 comments
#668 - Extract multiple tables from the same XML file
Issue -
State: closed - Opened by vwiencek 12 months ago
- 1 comment
#667 - Reading file with ContentType application/octet-stream
Issue -
State: closed - Opened by mahmoud-masmoudi-dev 12 months ago
- 3 comments
#666 - Azure Synapse Spark 3.3 Runtime : spark-xml fails on writing xml
Issue -
State: closed - Opened by thinh-ngu 12 months ago
- 1 comment
#665 - Use defined timezone on write for formats that need TZ info
Pull Request -
State: closed - Opened by srowen 12 months ago
Labels: bug
#664 - Generated files does not have .xml extension
Issue -
State: closed - Opened by dolfinus 12 months ago
- 2 comments
#663 - Cannot write dataframe with custom timestampFormat
Issue -
State: closed - Opened by dolfinus 12 months ago
- 7 comments
Labels: bug
#662 - Timestamps not matching format are replaced with nulls
Issue -
State: closed - Opened by dolfinus 12 months ago
- 2 comments
#661 - Failed to find data source: xml.
Issue -
State: closed - Opened by luisenriqueramos1977 about 1 year ago
- 3 comments
#660 - Shortcut common type inference cases to fail fast, speed up inference
Pull Request -
State: closed - Opened by srowen about 1 year ago
- 1 comment
Labels: enhancement
#659 - Update to test vs Spark 3.4, and tested Spark/Scala/Java configs
Pull Request -
State: closed - Opened by srowen about 1 year ago
Labels: enhancement
#658 - Vulnerabilities from dependencies: CVE-2023-22946
Issue -
State: closed - Opened by sasauz about 1 year ago
- 1 comment
#657 - restrict access in hive meta store tables with Unity Catalog single user cluster
Issue -
State: closed - Opened by ChackoSmitha about 1 year ago
- 2 comments
#656 - Problem with extra line breaks inside tags during writing XML file
Issue -
State: closed - Opened by VladIsLuve about 1 year ago
- 1 comment
#655 - Problem with reading cp1251 file
Issue -
State: closed - Opened by VladIsLuve about 1 year ago
- 7 comments
#654 - Note plan to merge spark-xml to Apache Spark 4.0
Pull Request -
State: closed - Opened by srowen about 1 year ago
#653 - Document that spark-xml is in maintenance mode
Issue -
State: closed - Opened by HyukjinKwon about 1 year ago
Labels: enhancement
#652 - strange tag while writing xml with nullValue
Issue -
State: closed - Opened by groneveld about 1 year ago
- 8 comments
#651 - Attribute values of nested fields are lost if option "attributePrefix" has empty value
Issue -
State: closed - Opened by voban about 1 year ago
- 3 comments
#650 - spark.sql.session.timeZone not taken into account while reading XML
Issue -
State: closed - Opened by BaptistePiron about 1 year ago
- 3 comments
#648 - "hidden" _metadata column is not identifying for the XML input file format
Issue -
State: closed - Opened by ChackoSmitha over 1 year ago
- 3 comments
#647 - Can't import XML file
Issue -
State: closed - Opened by sanyam-dev over 1 year ago
- 1 comment
#646 - Using spark-xml to parse nested xml structure in jupyter notebook
Issue -
State: closed - Opened by Xabitsuki over 1 year ago
- 2 comments
#645 - Reader can't read XML file if the rootTag and rowTag are the same
Issue -
State: closed - Opened by irajhedayati over 1 year ago
- 6 comments
#644 - Disallow strings ending in D or F as doubles when inferring schema
Pull Request -
State: closed - Opened by srowen over 1 year ago
Labels: bug
#643 - Schema for stringvalue not inferred correctly
Issue -
State: closed - Opened by ShubhamG25 over 1 year ago
- 2 comments
Labels: bug
#642 - fs.azure.account.key error when reading files from Azure and OAuth
Issue -
State: closed - Opened by DragonEnergy over 1 year ago
- 2 comments
#640 - EMRServerless
Issue -
State: closed - Opened by akash1302 over 1 year ago
- 1 comment
#639 - ignoreCorruptFiles and GZIP corrupted xml files
Issue -
State: closed - Opened by slavokx over 1 year ago
- 3 comments
#638 - In XSD, handle frac digits for decimal types. Also try to support custom type declarations in the XSD
Pull Request -
State: closed - Opened by srowen over 1 year ago
Labels: enhancement
#637 - Restore functionality of ignoreSurroundingSpaces when field is a simple string
Pull Request -
State: closed - Opened by srowen over 1 year ago
Labels: bug
#636 - ignoreSurroundingSpaces is not working after upgrading to version 0.16.0
Issue -
State: closed - Opened by irajhedayati over 1 year ago
- 1 comment
Labels: bug
#635 - Allow for top-level simple type declarations in XSD, referenced by name, when parsing XSD as schema
Issue -
State: closed - Opened by srowen over 1 year ago
- 1 comment
Labels: enhancement
#634 - Search recursively with xml
Issue -
State: open - Opened by DanialP over 1 year ago
- 2 comments
#634 - Search recursively with xml
Issue -
State: closed - Opened by DanialP over 1 year ago
- 2 comments
#633 - rowValidationXSDPath schema does not enforce data types from Int to String
Issue -
State: closed - Opened by Yanis77240 over 1 year ago
- 4 comments
#632 - Issue with scala: java.lang.NoClassDefFoundError: scala/$less$colon$less
Issue -
State: closed - Opened by coperator over 1 year ago
- 4 comments
#631 - Parse complexContent with extension element
Pull Request -
State: closed - Opened by shuch3ng over 1 year ago
Labels: enhancement
#630 - org.xml.sax.SAXParseException: Current configuration of the parser doesn't allow a maxOccurs attribute value to be set greater than the value 5,000.
Issue -
State: closed - Opened by aditi-kumari-singh over 1 year ago
- 3 comments
#629 - Initial pass at supports 'paths' data source option with multiple file paths
Pull Request -
State: closed - Opened by srowen over 1 year ago
- 4 comments
Labels: enhancement
#628 - read multiple XML file. and get file name as metadata
Issue -
State: closed - Opened by writetoarun over 1 year ago
- 2 comments
#628 - read multiple XML file. and get file name as metadata
Issue -
State: closed - Opened by writetoarun over 1 year ago
- 2 comments
#627 - when used along with select("*", "_metadata") error
Issue -
State: closed - Opened by writetoarun over 1 year ago
- 2 comments
#626 - decimalCannotGreaterThanPrecisionError with DecimalType() and value = 0.0X
Issue -
State: closed - Opened by marcuskw over 1 year ago
- 1 comment
#625 - Simplify handling of complex mixed elements when schema says it is a string; should always just repeat the content as string
Pull Request -
State: closed - Opened by srowen over 1 year ago
Labels: bug
#624 - Draft change to add custom timestamp format timezone support
Pull Request -
State: closed - Opened by srowen almost 2 years ago
#624 - Draft change to add custom timestamp format timezone support
Pull Request -
State: closed - Opened by srowen almost 2 years ago
#623 - Handle decimals with scale greater than precision
Pull Request -
State: closed - Opened by srowen almost 2 years ago
- 4 comments
Labels: bug
#622 - DecimalType parsing fails on some values
Issue -
State: closed - Opened by agolovenko almost 2 years ago
- 2 comments
#621 - Allow custom timestamp with Spark timezone property
Pull Request -
State: closed - Opened by JorisTruong almost 2 years ago
- 4 comments
Labels: enhancement
#621 - Allow custom timestamp with Spark timezone property
Pull Request -
State: closed - Opened by JorisTruong almost 2 years ago
- 4 comments
Labels: enhancement
#620 - Update Spark, dep versions; avoid 2.13 deprecations; suppress INFO logs in test
Pull Request -
State: closed - Opened by srowen almost 2 years ago
#619 - Parse ref attribute in XSD element
Pull Request -
State: closed - Opened by shuch3ng almost 2 years ago
Labels: enhancement
#618 - Allow xpath for rowTag
Issue -
State: closed - Opened by singlewind almost 2 years ago
- 3 comments
#617 - ref attribute in XSDToSchema
Issue -
State: closed - Opened by shuch3ng almost 2 years ago
- 7 comments
#617 - ref attribute in XSDToSchema
Issue -
State: closed - Opened by shuch3ng almost 2 years ago
- 7 comments
#616 - feat: added timeZone option
Pull Request -
State: closed - Opened by JorisTruong almost 2 years ago
#615 - Upgrade SBT
Pull Request -
State: closed - Opened by ganeshchand almost 2 years ago
#614 - Text fields with embedded tags
Issue -
State: closed - Opened by kornel-at-swoop almost 2 years ago
- 9 comments
Labels: bug
#613 - Upgrade SBT version
Issue -
State: closed - Opened by ganeshchand almost 2 years ago
- 3 comments
#612 - XML Timestamp parsing without timezone
Issue -
State: closed - Opened by JorisTruong almost 2 years ago
- 1 comment
Labels: enhancement
#611 - Write an xml element value with escape characters as it is in the input text
Issue -
State: closed - Opened by DipeshV almost 2 years ago
- 5 comments
#611 - Write an xml element value with escape characters as it is in the input text
Issue -
State: closed - Opened by DipeshV almost 2 years ago
- 5 comments
#610 - schema not respected when reading multiple xml files
Issue -
State: closed - Opened by JohnStokes228 almost 2 years ago
- 7 comments
#609 - facing error while writing the dataframe to xml file in local.
Issue -
State: closed - Opened by ParvezAlam11 almost 2 years ago
- 3 comments
#609 - facing error while writing the dataframe to xml file in local.
Issue -
State: closed - Opened by ParvezAlam11 almost 2 years ago
- 3 comments
#608 - XML parser behaves differently for StringType field when custom schema is used
Issue -
State: closed - Opened by atomobianco almost 2 years ago
- 1 comment
#607 - Getting error on latest cluster version (java.lang.NoClassDefFoundError: scala/$less$colon$less)
Issue -
State: closed - Opened by joe-chewning almost 2 years ago
- 2 comments
#607 - Getting error on latest cluster version (java.lang.NoClassDefFoundError: scala/$less$colon$less)
Issue -
State: closed - Opened by joe-chewning almost 2 years ago
- 2 comments
#606 - Failure to Parse xml file with Error "Found duplicate column(s) in the data schema: `_value`"
Issue -
State: closed - Opened by brcopeland about 2 years ago
- 6 comments
#605 - spark.read.format("xml").load(path) does not handle URIs with a comma (,)
Issue -
State: closed - Opened by embrike about 2 years ago
- 4 comments
#604 - DEPRECATED treatEmptyValueAsNulls works/suggested nullValue set to "" does not
Issue -
State: closed - Opened by clemj21 about 2 years ago
- 3 comments
#604 - DEPRECATED treatEmptyValueAsNulls works/suggested nullValue set to "" does not
Issue -
State: closed - Opened by clemj21 about 2 years ago
- 3 comments
#603 - Add arrayElementName option
Pull Request -
State: closed - Opened by srowen about 2 years ago
Labels: enhancement
#603 - Add arrayElementName option
Pull Request -
State: closed - Opened by srowen about 2 years ago
Labels: enhancement
#602 - Can "item" of ArrayType be renamed via an option when writing an XML file?
Issue -
State: closed - Opened by giuseppeceravolo about 2 years ago
- 12 comments
#601 - Package spark-xml in a python wheel so that it can be used with Delta Live Tables
Issue -
State: closed - Opened by edurdevic about 2 years ago
- 1 comment
#600 - Task failed while writing rows.
Issue -
State: closed - Opened by xifan987 about 2 years ago
- 2 comments
#599 - Spark-xml not running on Databricks Runtime 11.0
Issue -
State: closed - Opened by TheDataDexter about 2 years ago
- 3 comments
#598 - Misc: remove Experimental tags, update build to 0.16.0; add Spark 3.3 CI/CD
Pull Request -
State: closed - Opened by srowen about 2 years ago
#597 - rowtag not recognised when using ext_from_xml
Issue -
State: closed - Opened by charlottevdscheun about 2 years ago
- 2 comments
#596 - [Clean Up] Remove some duplicated code with Spark and use the ones directly from Spark Repo
Pull Request -
State: closed - Opened by ericsun95 about 2 years ago
Labels: enhancement
#595 - Rationalize logging of record, exception in error cases
Pull Request -
State: closed - Opened by srowen about 2 years ago
- 1 comment
Labels: enhancement