Skip to content
  • Cheng Lian's avatar
    ce38a35b
    [SPARK-12935][SQL] DataFrame API for Count-Min Sketch · ce38a35b
    Cheng Lian authored
    This PR integrates Count-Min Sketch from spark-sketch into DataFrame. This version resorts to `RDD.aggregate` for building the sketch. A more performant UDAF version can be built in future follow-up PRs.
    
    Author: Cheng Lian <lian@databricks.com>
    
    Closes #10911 from liancheng/cms-df-api.
    ce38a35b
    [SPARK-12935][SQL] DataFrame API for Count-Min Sketch
    Cheng Lian authored
    This PR integrates Count-Min Sketch from spark-sketch into DataFrame. This version resorts to `RDD.aggregate` for building the sketch. A more performant UDAF version can be built in future follow-up PRs.
    
    Author: Cheng Lian <lian@databricks.com>
    
    Closes #10911 from liancheng/cms-df-api.
Loading