-
- Downloads
[SPARK-12818][SQL] Specialized integral and string types for Count-min Sketch
This PR is a follow-up of #10911. It adds specialized update methods for `CountMinSketch` so that we can avoid doing internal/external row format conversion in `DataFrame.countMinSketch()`. Author: Cheng Lian <lian@databricks.com> Closes #10968 from liancheng/cms-specialized.
Showing
- common/sketch/src/main/java/org/apache/spark/util/sketch/CountMinSketch.java 32 additions, 2 deletions...ain/java/org/apache/spark/util/sketch/CountMinSketch.java
- common/sketch/src/main/java/org/apache/spark/util/sketch/CountMinSketchImpl.java 28 additions, 7 deletions...java/org/apache/spark/util/sketch/CountMinSketchImpl.java
- sql/core/src/main/scala/org/apache/spark/sql/DataFrameStatFunctions.scala 39 additions, 26 deletions...n/scala/org/apache/spark/sql/DataFrameStatFunctions.scala
Loading
Please register or sign in to comment