Revision - dd69ac6 - [SPARK-19611][SQL][FOLLOWUP] set dataSchema correctly in [...] - origin: https://github.com/apache/spark

visit type:

https://github.com/apache/spark

05 April 2024, 20:24:39 UTC

Revision dd69ac620c5dea38d22ca63488b6fdb430e81da2 authored by Wenchen Fan on 31 October 2017, 10:35:32 UTC, committed by Wenchen Fan on 31 October 2017, 10:36:52 UTC

[SPARK-19611][SQL][FOLLOWUP] set dataSchema correctly in HiveMetastoreCatalog.convertToLogicalRelation

We made a mistake in https://github.com/apache/spark/pull/16944 . In `HiveMetastoreCatalog#inferIfNeeded` we infer the data schema, merge with full schema, and return the new full schema. At caller side we treat the full schema as data schema and set it to `HadoopFsRelation`.

This doesn't cause any problem because both parquet and orc can work with a wrong data schema that has extra columns, but it's better to fix this mistake.

N/A

Author: Wenchen Fan <wenchen@databricks.com>

Closes #19615 from cloud-fan/infer.

(cherry picked from commit 4d9ebf3835dde1abbf9cff29a55675d9f4227620)
Signed-off-by: Wenchen Fan <wenchen@databricks.com>

1 parent 7f8236c

Files
Changes

Permalinks

Tip revision: dd69ac620c5dea38d22ca63488b6fdb430e81da2 authored by Wenchen Fan on 31 October 2017, 10:35:32 UTC
[SPARK-19611][SQL][FOLLOWUP] set dataSchema correctly in HiveMetastoreCatalog.convertToLogicalRelation

Tip revision: dd69ac6

File	Mode	Size
.github
R
assembly
bin
build
common
conf
core
data
dev
docs
examples
external
graphx
launcher
licenses
mllib
mllib-local
project
python
repl
resource-managers
sbin
sql
streaming
tools
.gitattributes	-rw-r--r--	40 bytes
.gitignore	-rw-r--r--	1.2 KB
.travis.yml	-rw-r--r--	1.7 KB
CONTRIBUTING.md	-rw-r--r--	995 bytes
LICENSE	-rw-r--r--	17.5 KB
NOTICE	-rw-r--r--	24.1 KB
README.md	-rw-r--r--	3.7 KB
appveyor.yml	-rw-r--r--	1.9 KB
pom.xml	-rw-r--r--	94.8 KB
scalastyle-config.xml	-rw-r--r--	17.4 KB

Showing with 0 additions and 0 deletions (0 / 0 diffs computed)

Computing file changes ...

https://github.com/apache/spark

[SPARK-19611][SQL][FOLLOWUP] set dataSchema correctly in HiveMetastoreCatalog.convertToLogicalRelation

README.md