-
- Downloads
[SPARK-11920][ML][DOC] ML LinearRegression should use correct dataset in...
[SPARK-11920][ML][DOC] ML LinearRegression should use correct dataset in examples and user guide doc ML ```LinearRegression``` use ```data/mllib/sample_libsvm_data.txt``` as dataset in examples and user guide doc, but it's actually classification dataset rather than regression dataset. We should use ```data/mllib/sample_linear_regression_data.txt``` instead. The deeper causes is that ```LinearRegression``` with "normal" solver can not solve this dataset correctly, may be due to the ill condition and unreasonable label. This issue has been reported at [SPARK-11918](https://issues.apache.org/jira/browse/SPARK-11918). It will confuse users if they run the example code but get exception, so we should make this change which can clearly illustrate the usage of ```LinearRegression``` algorithm. Author: Yanbo Liang <ybliang8@gmail.com> Closes #9905 from yanboliang/spark-11920.
Showing
- examples/src/main/java/org/apache/spark/examples/ml/JavaLinearRegressionWithElasticNetExample.java 1 addition, 1 deletion...xamples/ml/JavaLinearRegressionWithElasticNetExample.java
- examples/src/main/python/ml/linear_regression_with_elastic_net.py 2 additions, 1 deletion.../src/main/python/ml/linear_regression_with_elastic_net.py
- examples/src/main/scala/org/apache/spark/examples/ml/LinearRegressionWithElasticNetExample.scala 2 additions, 1 deletion...k/examples/ml/LinearRegressionWithElasticNetExample.scala
Please register or sign in to comment