-
- Downloads
[SPARK-8670] [SQL] Nested columns can't be referenced in pyspark
This bug is caused by a wrong column-exist-check in `__getitem__` of pyspark dataframe. `DataFrame.apply` accepts not only top level column names, but also nested column name like `a.b`, so we should remove that check from `__getitem__`. Author: Wenchen Fan <cloud0fan@outlook.com> Closes #8202 from cloud-fan/nested.
Showing
- python/pyspark/sql/dataframe.py 0 additions, 2 deletionspython/pyspark/sql/dataframe.py
- python/pyspark/sql/tests.py 3 additions, 1 deletionpython/pyspark/sql/tests.py
- sql/core/src/main/scala/org/apache/spark/sql/DataFrame.scala 2 additions, 0 deletionssql/core/src/main/scala/org/apache/spark/sql/DataFrame.scala
Loading
Please register or sign in to comment