Skip to content
Snippets Groups Projects
  • Liquan Pei's avatar
    098c7344
    [SPARK-3486][MLlib][PySpark] PySpark support for Word2Vec · 098c7344
    Liquan Pei authored
    mengxr
    Added PySpark support for Word2Vec
    Change list
    (1) PySpark support for Word2Vec
    (2) SerDe support of string sequence both on python side and JVM side
    (3) Test for SerDe of string sequence on JVM side
    
    Author: Liquan Pei <liquanpei@gmail.com>
    
    Closes #2356 from Ishiihara/Word2Vec-python and squashes the following commits:
    
    476ea34 [Liquan Pei] style fixes
    b13a0b9 [Liquan Pei] resolve merge conflicts and minor fixes
    8671eba [Liquan Pei] Merge remote-tracking branch 'upstream/master' into Word2Vec-python
    daf88a6 [Liquan Pei] modification according to feedback
    a73fa19 [Liquan Pei] clean up
    3d8007b [Liquan Pei] fix findSynonyms for vector
    1bdcd2e [Liquan Pei] minor fixes
    cdef9f4 [Liquan Pei] add missing comments
    b7447eb [Liquan Pei] modify according to feedback
    b9a7383 [Liquan Pei] cache words RDD in fit
    89490bf [Liquan Pei] add tests and Word2VecModelWrapper
    78bbb53 [Liquan Pei] use pickle for seq string SerDe
    a264b08 [Liquan Pei] Merge remote-tracking branch 'upstream/master' into Word2Vec-python
    ca1e5ff [Liquan Pei] fix test
    68e7276 [Liquan Pei] minor style fixes
    48d5e72 [Liquan Pei] Functionality improvement
    0ad3ac1 [Liquan Pei] minor fix
    c867fdf [Liquan Pei] add Word2Vec to pyspark
    098c7344
    History
    [SPARK-3486][MLlib][PySpark] PySpark support for Word2Vec
    Liquan Pei authored
    mengxr
    Added PySpark support for Word2Vec
    Change list
    (1) PySpark support for Word2Vec
    (2) SerDe support of string sequence both on python side and JVM side
    (3) Test for SerDe of string sequence on JVM side
    
    Author: Liquan Pei <liquanpei@gmail.com>
    
    Closes #2356 from Ishiihara/Word2Vec-python and squashes the following commits:
    
    476ea34 [Liquan Pei] style fixes
    b13a0b9 [Liquan Pei] resolve merge conflicts and minor fixes
    8671eba [Liquan Pei] Merge remote-tracking branch 'upstream/master' into Word2Vec-python
    daf88a6 [Liquan Pei] modification according to feedback
    a73fa19 [Liquan Pei] clean up
    3d8007b [Liquan Pei] fix findSynonyms for vector
    1bdcd2e [Liquan Pei] minor fixes
    cdef9f4 [Liquan Pei] add missing comments
    b7447eb [Liquan Pei] modify according to feedback
    b9a7383 [Liquan Pei] cache words RDD in fit
    89490bf [Liquan Pei] add tests and Word2VecModelWrapper
    78bbb53 [Liquan Pei] use pickle for seq string SerDe
    a264b08 [Liquan Pei] Merge remote-tracking branch 'upstream/master' into Word2Vec-python
    ca1e5ff [Liquan Pei] fix test
    68e7276 [Liquan Pei] minor style fixes
    48d5e72 [Liquan Pei] Functionality improvement
    0ad3ac1 [Liquan Pei] minor fix
    c867fdf [Liquan Pei] add Word2Vec to pyspark
feature.py 6.02 KiB