-
- Downloads
[SPARK-5173]support python application running on yarn cluster mode
now when we run python application on yarn cluster mode through spark-submit, spark-submit does not support python application on yarn cluster mode. so i modify code of submit and yarn's AM in order to support it. through specifying .py file or primaryResource file via spark-submit, we can make pyspark run in yarn-cluster mode. example:spark-submit --master yarn-master --num-executors 1 --driver-memory 1g --executor-memory 1g xx.py --primaryResource yy.conf this config is same as pyspark on yarn-client mode. firstly,we put local path of .py or primaryResource to yarn's dist.files.that can be distributed on slave nodes.and then in spark-submit we transfer --py-files and --primaryResource to yarn.Client and use "org.apache.spark.deploy.PythonRunner" to user class that can run .py files on ApplicationMaster. in yarn.Client we transfer --py-files and --primaryResource to ApplicationMaster. in ApplicationMaster, user's class is org.apache.spark.deploy.PythonRunner, and user's args is primaryResource and -py-files. so that can make pyspark run on ApplicationMaster. JoshRosen tgravescs sryza Author: lianhuiwang <lianhuiwang09@gmail.com> Author: Wang Lianhui <lianhuiwang09@gmail.com> Closes #3976 from lianhuiwang/SPARK-5173 and squashes the following commits: 28a8a58 [lianhuiwang] fix variable name 67f8cee [lianhuiwang] update with andrewor's comments 0319ae3 [lianhuiwang] address with sryza's comments 2385ef6 [lianhuiwang] address with sryza's comments 03640ab [lianhuiwang] add sparkHome to env 47d2fc3 [lianhuiwang] fix test 2adc8f5 [lianhuiwang] add spark.test.home d60bc60 [lianhuiwang] fix test 5b30064 [lianhuiwang] add test 097a5ec [lianhuiwang] fix line length exceeds 100 905a106 [lianhuiwang] update with sryza and andrewor 's comments f1f55b6 [lianhuiwang] when yarn-cluster, all python files can be non-local 172eec1 [Wang Lianhui] fix a min submit's bug 9c941bc [lianhuiwang] support python application running on yarn cluster mode
Showing
- core/src/main/scala/org/apache/spark/deploy/PythonRunner.scala 1 addition, 1 deletion...src/main/scala/org/apache/spark/deploy/PythonRunner.scala
- core/src/main/scala/org/apache/spark/deploy/SparkSubmit.scala 42 additions, 7 deletions.../src/main/scala/org/apache/spark/deploy/SparkSubmit.scala
- core/src/main/scala/org/apache/spark/deploy/SparkSubmitArguments.scala 0 additions, 12 deletions.../scala/org/apache/spark/deploy/SparkSubmitArguments.scala
- yarn/src/main/scala/org/apache/spark/deploy/yarn/ApplicationMaster.scala 9 additions, 5 deletions...cala/org/apache/spark/deploy/yarn/ApplicationMaster.scala
- yarn/src/main/scala/org/apache/spark/deploy/yarn/ApplicationMasterArguments.scala 13 additions, 0 deletions...apache/spark/deploy/yarn/ApplicationMasterArguments.scala
- yarn/src/main/scala/org/apache/spark/deploy/yarn/Client.scala 17 additions, 2 deletions.../src/main/scala/org/apache/spark/deploy/yarn/Client.scala
- yarn/src/main/scala/org/apache/spark/deploy/yarn/ClientArguments.scala 15 additions, 5 deletions.../scala/org/apache/spark/deploy/yarn/ClientArguments.scala
- yarn/src/test/scala/org/apache/spark/deploy/yarn/YarnClusterSuite.scala 44 additions, 0 deletions...scala/org/apache/spark/deploy/yarn/YarnClusterSuite.scala
Please register or sign in to comment