-
- Downloads
[SPARK-12981] [SQL] extract Pyhton UDF in physical plan
## What changes were proposed in this pull request? Currently we extract Python UDFs into a special logical plan EvaluatePython in analyzer, But EvaluatePython is not part of catalyst, many rules have no knowledge of it , which will break many things (for example, filter push down or column pruning). We should treat Python UDFs as normal expressions, until we want to evaluate in physical plan, we could extract them in end of optimizer, or physical plan. This PR extract Python UDFs in physical plan. Closes #10935 ## How was this patch tested? Added regression tests. Author: Davies Liu <davies@databricks.com> Closes #12127 from davies/py_udf.
Showing
- python/pyspark/sql/tests.py 9 additions, 0 deletionspython/pyspark/sql/tests.py
- sql/core/src/main/scala/org/apache/spark/sql/execution/QueryExecution.scala 1 addition, 0 deletions...scala/org/apache/spark/sql/execution/QueryExecution.scala
- sql/core/src/main/scala/org/apache/spark/sql/execution/SparkStrategies.scala 0 additions, 2 deletions...cala/org/apache/spark/sql/execution/SparkStrategies.scala
- sql/core/src/main/scala/org/apache/spark/sql/execution/python/EvaluatePython.scala 0 additions, 23 deletions...rg/apache/spark/sql/execution/python/EvaluatePython.scala
- sql/core/src/main/scala/org/apache/spark/sql/execution/python/ExtractPythonUDFs.scala 53 additions, 41 deletions...apache/spark/sql/execution/python/ExtractPythonUDFs.scala
- sql/core/src/main/scala/org/apache/spark/sql/execution/python/PythonUDF.scala 1 addition, 2 deletions...ala/org/apache/spark/sql/execution/python/PythonUDF.scala
- sql/core/src/main/scala/org/apache/spark/sql/internal/SessionState.scala 0 additions, 1 deletion...in/scala/org/apache/spark/sql/internal/SessionState.scala
- sql/hive/src/main/scala/org/apache/spark/sql/hive/HiveSessionState.scala 0 additions, 1 deletion...in/scala/org/apache/spark/sql/hive/HiveSessionState.scala
Loading
Please register or sign in to comment