-
- Downloads
[SPARK-18394][SQL] Make an AttributeSet.toSeq output order consistent
## What changes were proposed in this pull request? This pr sorted output attributes on their name and exprId in `AttributeSet.toSeq` to make the order consistent. If the order is different, spark possibly generates different code and then misses cache in `CodeGenerator`, e.g., `GenerateColumnAccessor` generates code depending on an input attribute order. ## How was this patch tested? Added tests in `AttributeSetSuite` and manually checked if the cache worked well in the given query of the JIRA. Author: Takeshi Yamamuro <yamamuro@apache.org> Closes #18959 from maropu/SPARK-18394.
Showing
- sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/AttributeSet.scala 6 additions, 1 deletion.../apache/spark/sql/catalyst/expressions/AttributeSet.scala
- sql/catalyst/src/test/scala/org/apache/spark/sql/catalyst/expressions/AttributeSetSuite.scala 40 additions, 0 deletions...he/spark/sql/catalyst/expressions/AttributeSetSuite.scala
- sql/hive/src/test/scala/org/apache/spark/sql/hive/execution/PruningSuite.scala 6 additions, 1 deletion...la/org/apache/spark/sql/hive/execution/PruningSuite.scala
Please register or sign in to comment