-
- Downloads
[SPARK-6255] [MLLIB] Support multiclass classification in Python API
Python API parity check for classification and multiclass classification support, major disparities need to be added for Python: ```scala LogisticRegressionWithLBFGS setNumClasses setValidateData LogisticRegressionModel getThreshold numClasses numFeatures SVMWithSGD setValidateData SVMModel getThreshold ``` For users the greatest benefit in this PR is multiclass classification was supported by Python API. Users can train multiclass classification model and use it to predict in pyspark. Author: Yanbo Liang <ybliang8@gmail.com> Closes #5137 from yanboliang/spark-6255 and squashes the following commits: 0bd531e [Yanbo Liang] address comments 444d5e2 [Yanbo Liang] LogisticRegressionModel.predict() optimization fc7990b [Yanbo Liang] address comments b0d9c63 [Yanbo Liang] Support Mulinomial LR model predict in Python API ded847c [Yanbo Liang] Python API parity check for classification (support multiclass classification)
Showing
- mllib/src/main/scala/org/apache/spark/mllib/api/python/PythonMLLibAPI.scala 18 additions, 4 deletions...la/org/apache/spark/mllib/api/python/PythonMLLibAPI.scala
- python/pyspark/mllib/classification.py 108 additions, 26 deletionspython/pyspark/mllib/classification.py
- python/pyspark/mllib/regression.py 8 additions, 2 deletionspython/pyspark/mllib/regression.py
Loading
Please register or sign in to comment