-
- Downloads
[SPARK-17090][ML] Make tree aggregation level in linear/logistic regression configurable
## What changes were proposed in this pull request? Linear/logistic regression use treeAggregate with default depth (always = 2) for collecting coefficient gradient updates to the driver. For high dimensional problems, this can cause OOM error on the driver. This patch makes it configurable to avoid this problem if users' input data has many features. It adds a HasTreeDepth API in `sharedParams.scala`, and extends it to both Linear regression and logistic regression in .ml Author: hqzizania <hqzizania@gmail.com> Closes #14717 from hqzizania/SPARK-17090.
Showing
- mllib/src/main/scala/org/apache/spark/ml/classification/LogisticRegression.scala 18 additions, 6 deletions...g/apache/spark/ml/classification/LogisticRegression.scala
- mllib/src/main/scala/org/apache/spark/ml/classification/MultinomialLogisticRegression.scala 14 additions, 2 deletions...ark/ml/classification/MultinomialLogisticRegression.scala
- mllib/src/main/scala/org/apache/spark/ml/param/shared/SharedParamsCodeGen.scala 3 additions, 1 deletion...rg/apache/spark/ml/param/shared/SharedParamsCodeGen.scala
- mllib/src/main/scala/org/apache/spark/ml/param/shared/sharedParams.scala 21 additions, 4 deletions...scala/org/apache/spark/ml/param/shared/sharedParams.scala
- mllib/src/main/scala/org/apache/spark/ml/regression/LinearRegression.scala 18 additions, 4 deletions...ala/org/apache/spark/ml/regression/LinearRegression.scala
Please register or sign in to comment