Skip to content
Snippets Groups Projects
  • Burak Köse's avatar
    e20cd9f4
    [SPARK-14050][ML] Add multiple languages support and additional methods for Stop Words Remover · e20cd9f4
    Burak Köse authored
    ## What changes were proposed in this pull request?
    
    This PR continues the work from #11871 with the following changes:
    * load English stopwords as default
    * covert stopwords to list in Python
    * update some tests and doc
    
    ## How was this patch tested?
    
    Unit tests.
    
    Closes #11871
    
    cc: burakkose srowen
    
    Author: Burak Köse <burakks41@gmail.com>
    Author: Xiangrui Meng <meng@databricks.com>
    Author: Burak KOSE <burakks41@gmail.com>
    
    Closes #12843 from mengxr/SPARK-14050.
    e20cd9f4
    History
    [SPARK-14050][ML] Add multiple languages support and additional methods for Stop Words Remover
    Burak Köse authored
    ## What changes were proposed in this pull request?
    
    This PR continues the work from #11871 with the following changes:
    * load English stopwords as default
    * covert stopwords to list in Python
    * update some tests and doc
    
    ## How was this patch tested?
    
    Unit tests.
    
    Closes #11871
    
    cc: burakkose srowen
    
    Author: Burak Köse <burakks41@gmail.com>
    Author: Xiangrui Meng <meng@databricks.com>
    Author: Burak KOSE <burakks41@gmail.com>
    
    Closes #12843 from mengxr/SPARK-14050.
LICENSE-postgresql.txt 1.17 KiB