Revision bfa0d13f3f6b4b662ad0f355a8db00dd1244a698 authored by Herman van Hovell on 06 August 2024, 01:54:09 UTC, committed by Hyukjin Kwon on 06 August 2024, 01:54:09 UTC
### What changes were proposed in this pull request?
We allow the `JsonToStructs` and `XmlToStructs` expressions to use a json schema.

### Why are the changes needed?
A couple of reasons:
- We want to use a reference to the `from_json` and `from_xml` methods in the Column API in order to make unification of the Classic and Connect Scala clients possible.
- Reduce the amount of duplication between the Function API and the SparkConnectPlanner.
- Make DataFrame and SQL API behave the same.

### Does this PR introduce _any_ user-facing change?
No

### How was this patch tested?
Existing tests.

### Was this patch authored or co-authored using generative AI tooling?
No

Closes #47573 from hvanhovell/SPARK-49083.

Authored-by: Herman van Hovell <herman@databricks.com>
Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>
1 parent da5912a
History
File Mode Size
workflows
PULL_REQUEST_TEMPLATE -rw-r--r-- 3.4 KB
labeler.yml -rw-r--r-- 5.0 KB

back to top