-
- Downloads
[SPARK-12936][SQL] Initial bloom filter implementation
This PR adds an initial implementation of bloom filter in the newly added sketch module. The implementation is based on the [`BloomFilter` class in guava](https://code.google.com/p/guava-libraries/source/browse/guava/src/com/google/common/hash/BloomFilter.java). Some difference from the design doc: * expose `bitSize` instead of `sizeInBytes` to user. * always need the `expectedInsertions` parameter when create bloom filter. Author: Wenchen Fan <wenchen@databricks.com> Closes #10883 from cloud-fan/bloom-filter.
Showing
- common/sketch/src/main/java/org/apache/spark/util/sketch/BitArray.java 94 additions, 0 deletions.../src/main/java/org/apache/spark/util/sketch/BitArray.java
- common/sketch/src/main/java/org/apache/spark/util/sketch/BloomFilter.java 153 additions, 0 deletions...c/main/java/org/apache/spark/util/sketch/BloomFilter.java
- common/sketch/src/main/java/org/apache/spark/util/sketch/BloomFilterImpl.java 164 additions, 0 deletions...in/java/org/apache/spark/util/sketch/BloomFilterImpl.java
- common/sketch/src/test/scala/org/apache/spark/util/sketch/BitArraySuite.scala 77 additions, 0 deletions...st/scala/org/apache/spark/util/sketch/BitArraySuite.scala
- common/sketch/src/test/scala/org/apache/spark/util/sketch/BloomFilterSuite.scala 114 additions, 0 deletions...scala/org/apache/spark/util/sketch/BloomFilterSuite.scala
Please register or sign in to comment