-
- Downloads
[SPARK-11393] [SQL] CoGroupedIterator should respect the fact that...
[SPARK-11393] [SQL] CoGroupedIterator should respect the fact that GroupedIterator.hasNext is not idempotent When we cogroup 2 `GroupedIterator`s in `CoGroupedIterator`, if the right side is smaller, we will consume right data and keep the left data unchanged. Then we call `hasNext` which will call `left.hasNext`. This will make `GroupedIterator` generate an extra group as the previous one has not been comsumed yet. Author: Wenchen Fan <wenchen@databricks.com> Closes #9346 from cloud-fan/cogroup and squashes the following commits: 9be67c8 [Wenchen Fan] SPARK-11393
Showing
- sql/core/src/main/scala/org/apache/spark/sql/execution/CoGroupedIterator.scala 8 additions, 6 deletions...la/org/apache/spark/sql/execution/CoGroupedIterator.scala
- sql/core/src/test/scala/org/apache/spark/sql/execution/CoGroupedIteratorSuite.scala 24 additions, 0 deletions...g/apache/spark/sql/execution/CoGroupedIteratorSuite.scala
Please register or sign in to comment