https://github.com/apache/spark
Revision 30d5c9fd8ae1944a94ddedae83433368a02e55e6 authored by Dongjoon Hyun on 13 October 2017, 15:09:12 UTC, committed by Wenchen Fan on 13 October 2017, 15:11:50 UTC
Before Hive 2.0, ORC File schema has invalid column names like `_col1` and `_col2`. This is a well-known limitation and there are several Apache Spark issues with `spark.sql.hive.convertMetastoreOrc=true`. This PR ignores ORC File schema and use Spark schema.

Pass the newly added test case.

Author: Dongjoon Hyun <dongjoon@apache.org>

Closes #19470 from dongjoon-hyun/SPARK-18355.

(cherry picked from commit e6e36004afc3f9fc8abea98542248e9de11b4435)
Signed-off-by: Wenchen Fan <wenchen@databricks.com>
1 parent c9187db
History
Tip revision: 30d5c9fd8ae1944a94ddedae83433368a02e55e6 authored by Dongjoon Hyun on 13 October 2017, 15:09:12 UTC
[SPARK-14387][SPARK-16628][SPARK-18355][SQL] Use Spark schema to read ORC table instead of ORC file schema
Tip revision: 30d5c9f

README.md

back to top