-
- Downloads
[SPARK-18099][YARN] Fail if same files added to distributed cache for --files and --archives
## What changes were proposed in this pull request? During spark-submit, if yarn dist cache is instructed to add same file under --files and --archives, This code change ensures the spark yarn distributed cache behaviour is retained i.e. to warn and fail if same files is mentioned in both --files and --archives. ## How was this patch tested? Manually tested: 1. if same jar is mentioned in --jars and --files it will continue to submit the job. - basically functionality [SPARK-14423] #12203 is unchanged 1. if same file is mentioned in --files and --archives it will fail to submit the job. Please review https://cwiki.apache.org/confluence/display/SPARK/Contributing+to+Spark before opening a pull request. … under archives and files Author: Kishor Patil <kpatil@yahoo-inc.com> Closes #15627 from kishorvpatil/spark18099.
Showing
- yarn/src/main/scala/org/apache/spark/deploy/yarn/Client.scala 10 additions, 2 deletions.../src/main/scala/org/apache/spark/deploy/yarn/Client.scala
- yarn/src/test/scala/org/apache/spark/deploy/yarn/ClientSuite.scala 42 additions, 0 deletions...test/scala/org/apache/spark/deploy/yarn/ClientSuite.scala
Please register or sign in to comment