-
- Downloads
[SPARK-13322][ML] AFTSurvivalRegression supports feature standardization
## What changes were proposed in this pull request? AFTSurvivalRegression should support feature standardization, it will improve the convergence rate. Test the convergence rate on the [Ovarian](https://stat.ethz.ch/R-manual/R-devel/library/survival/html/ovarian.html) data which is standard data comes with Survival library in R, * without standardization(before this PR) -> 74 iterations. * with standardization(after this PR) -> 38 iterations. But after this fix, with or without ```standardization``` will converge to the same solution. It means that ```standardization = false``` will run the same code route as ```standardization = true```. Because if the features are not standardized at all, it will result convergency issue when the features have very different scales. This behavior is the same as ML [```LinearRegression``` and ```LogisticRegression```](https://issues.apache.org/jira/browse/SPARK-8522). See more discussion about this topic at #11247. cc mengxr ## How was this patch tested? unit test. Author: Yanbo Liang <ybliang8@gmail.com> Closes #11365 from yanboliang/spark-13322.
Showing
- mllib/src/main/scala/org/apache/spark/ml/regression/AFTSurvivalRegression.scala 71 additions, 34 deletions...rg/apache/spark/ml/regression/AFTSurvivalRegression.scala
- mllib/src/test/scala/org/apache/spark/ml/regression/AFTSurvivalRegressionSuite.scala 22 additions, 0 deletions...ache/spark/ml/regression/AFTSurvivalRegressionSuite.scala
Please register or sign in to comment