-
- Downloads
[SPARK-11528] [SQL] Typed aggregations for Datasets
This PR adds the ability to do typed SQL aggregations. We will likely also want to provide an interface to allow users to do aggregations on objects, but this is deferred to another PR. ```scala val ds = Seq(("a", 10), ("a", 20), ("b", 1), ("b", 2), ("c", 1)).toDS() ds.groupBy(_._1).agg(sum("_2").as[Int]).collect() res0: Array(("a", 30), ("b", 3), ("c", 1)) ``` Author: Michael Armbrust <michael@databricks.com> Closes #9499 from marmbrus/dataset-agg.
Showing
- sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/namedExpressions.scala 4 additions, 0 deletions...che/spark/sql/catalyst/expressions/namedExpressions.scala
- sql/core/src/main/scala/org/apache/spark/sql/Dataset.scala 1 addition, 1 deletionsql/core/src/main/scala/org/apache/spark/sql/Dataset.scala
- sql/core/src/main/scala/org/apache/spark/sql/GroupedDataset.scala 91 additions, 2 deletions.../src/main/scala/org/apache/spark/sql/GroupedDataset.scala
- sql/core/src/test/scala/org/apache/spark/sql/DatasetSuite.scala 36 additions, 0 deletions...re/src/test/scala/org/apache/spark/sql/DatasetSuite.scala
Please register or sign in to comment