Skip to content
  • Yong Tang's avatar
    bc748b7b
    [SPARK-14238][ML][MLLIB][PYSPARK] Add binary toggle Param to PySpark HashingTF in ML & MLlib · bc748b7b
    Yong Tang authored
    ## What changes were proposed in this pull request?
    
    This fix tries to add binary toggle Param to PySpark HashingTF in ML & MLlib. If this toggle is set, then all non-zero counts will be set to 1.
    
    Note: This fix (SPARK-14238) is extended from SPARK-13963 where Scala implementation was done.
    
    ## How was this patch tested?
    
    This fix adds two tests to cover the code changes. One for HashingTF in PySpark's ML and one for HashingTF in PySpark's MLLib.
    
    Author: Yong Tang <yong.tang.github@outlook.com>
    
    Closes #12079 from yongtang/SPARK-14238.
    bc748b7b
    [SPARK-14238][ML][MLLIB][PYSPARK] Add binary toggle Param to PySpark HashingTF in ML & MLlib
    Yong Tang authored
    ## What changes were proposed in this pull request?
    
    This fix tries to add binary toggle Param to PySpark HashingTF in ML & MLlib. If this toggle is set, then all non-zero counts will be set to 1.
    
    Note: This fix (SPARK-14238) is extended from SPARK-13963 where Scala implementation was done.
    
    ## How was this patch tested?
    
    This fix adds two tests to cover the code changes. One for HashingTF in PySpark's ML and one for HashingTF in PySpark's MLLib.
    
    Author: Yong Tang <yong.tang.github@outlook.com>
    
    Closes #12079 from yongtang/SPARK-14238.
Loading