https://github.com/apache/spark
Revision 362ef94fa88ec5fc0535e5eabbe915da4c2541c8 authored by Shrikant Prasad on 08 March 2023, 03:33:39 UTC, committed by Kent Yao on 08 March 2023, 03:35:16 UTC
### What changes were proposed in this pull request?

The PR fixes the authentication failure of the proxy user on driver side while accessing kerberized hdfs through spark on k8s job. It follows the similar approach as it was done for Mesos: https://github.com/mesosphere/spark/pull/26

 ### Why are the changes needed?

When we try to access the kerberized HDFS through a proxy user in Spark Job running in cluster deploy mode with Kubernetes resource manager, we encounter AccessControlException. This is because  authentication in driver is done using tokens of the proxy user and since proxy user doesn't have any delegation tokens on driver, auth fails.

Further details:
https://issues.apache.org/jira/browse/SPARK-25355?focusedCommentId=17532063&page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#comment-17532063

 https://issues.apache.org/jira/browse/SPARK-25355?focusedCommentId=17532135&page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#comment-17532135

 ### Does this PR introduce _any_ user-facing change?

Yes, user will now be able to use proxy-user to access kerberized hdfs with Spark on K8s.

### How was this patch tested?

The patch was tested by:

1. Running job which accesses kerberized hdfs with proxy user in cluster mode and client mode with kubernetes resource manager.

2. Running job which accesses kerberized hdfs without proxy user in cluster mode and client mode with kubernetes resource manager.

3. Build and run test github action : https://github.com/shrprasa/spark/actions/runs/3051203625

Closes #37880 from shrprasa/proxy_user_fix.

Authored-by: Shrikant Prasad <shrprasa@visa.com>
Signed-off-by: Kent Yao <yao@apache.org>
(cherry picked from commit b3b3557ccbe53e34e0d0dbe3d21f49a230ee621b)
Signed-off-by: Kent Yao <yao@apache.org>
1 parent 954faa4
History
Tip revision: 362ef94fa88ec5fc0535e5eabbe915da4c2541c8 authored by Shrikant Prasad on 08 March 2023, 03:33:39 UTC
[SPARK-39399][CORE][K8S] Fix proxy-user authentication for Spark on k8s in cluster deploy mode
Tip revision: 362ef94

README.md

back to top