-
- Downloads
[SPARK-13963][ML] Adding binary toggle param to HashingTF
## What changes were proposed in this pull request? Adding binary toggle parameter to ml.feature.HashingTF, as well as mllib.feature.HashingTF since the former wraps this functionality. This parameter, if true, will set non-zero valued term counts to 1 to transform term count features to binary values that are well suited for discrete probability models. ## How was this patch tested? Added unit tests for ML and MLlib Author: Bryan Cutler <cutlerb@gmail.com> Closes #11832 from BryanCutler/binary-param-HashingTF-SPARK-13963.
Showing
- mllib/src/main/scala/org/apache/spark/ml/feature/HashingTF.scala 20 additions, 3 deletions...rc/main/scala/org/apache/spark/ml/feature/HashingTF.scala
- mllib/src/main/scala/org/apache/spark/mllib/feature/HashingTF.scala 14 additions, 1 deletion...main/scala/org/apache/spark/mllib/feature/HashingTF.scala
- mllib/src/test/scala/org/apache/spark/ml/feature/HashingTFSuite.scala 23 additions, 1 deletion...st/scala/org/apache/spark/ml/feature/HashingTFSuite.scala
- mllib/src/test/scala/org/apache/spark/mllib/feature/HashingTFSuite.scala 12 additions, 0 deletions...scala/org/apache/spark/mllib/feature/HashingTFSuite.scala
Please register or sign in to comment