-
- Downloads
[SPARK-6113] [ML] Tree ensembles for Pipelines API
This is a continuation of [https://github.com/apache/spark/pull/5530] (which was for Decision Trees), but for ensembles: Random Forests and Gradient-Boosted Trees. Please refer to the JIRA [https://issues.apache.org/jira/browse/SPARK-6113], the design doc linked from the JIRA, and the previous PR linked above for design discussions. This PR follows the example set by the previous PR for Decision Trees. It includes a few cleanups to Decision Trees. Note: There is one issue which will be addressed in a separate PR: Ensembles' component Models have no parent or fittingParamMap. I plan to submit a separate PR which makes those values in Model be Options. It does not matter much which PR gets merged first. CC: mengxr manishamde codedeft chouqin Author: Joseph K. Bradley <joseph@databricks.com> Closes #5626 from jkbradley/dt-api-ensembles and squashes the following commits: 729167a [Joseph K. Bradley] small cleanups based on code review bbae2a2 [Joseph K. Bradley] Updated per all comments in code review 855aa9a [Joseph K. Bradley] scala style fix ea3d901 [Joseph K. Bradley] Added GBT to spark.ml, with tests and examples c0f30c1 [Joseph K. Bradley] Added random forests and test suites to spark.ml. Not tested yet. Need to add example as well d045ebd [Joseph K. Bradley] some more updates, but far from done ee1a10b [Joseph K. Bradley] Added files from old PR and did some initial updates.
Showing
- examples/src/main/scala/org/apache/spark/examples/ml/DecisionTreeExample.scala 81 additions, 58 deletions...la/org/apache/spark/examples/ml/DecisionTreeExample.scala
- examples/src/main/scala/org/apache/spark/examples/ml/GBTExample.scala 238 additions, 0 deletions.../main/scala/org/apache/spark/examples/ml/GBTExample.scala
- examples/src/main/scala/org/apache/spark/examples/ml/RandomForestExample.scala 248 additions, 0 deletions...la/org/apache/spark/examples/ml/RandomForestExample.scala
- examples/src/main/scala/org/apache/spark/examples/mllib/GradientBoostedTreesRunner.scala 1 addition, 0 deletions...che/spark/examples/mllib/GradientBoostedTreesRunner.scala
- mllib/src/main/scala/org/apache/spark/ml/Model.scala 2 additions, 0 deletionsmllib/src/main/scala/org/apache/spark/ml/Model.scala
- mllib/src/main/scala/org/apache/spark/ml/classification/DecisionTreeClassifier.scala 10 additions, 14 deletions...ache/spark/ml/classification/DecisionTreeClassifier.scala
- mllib/src/main/scala/org/apache/spark/ml/classification/GBTClassifier.scala 228 additions, 0 deletions...la/org/apache/spark/ml/classification/GBTClassifier.scala
- mllib/src/main/scala/org/apache/spark/ml/classification/RandomForestClassifier.scala 185 additions, 0 deletions...ache/spark/ml/classification/RandomForestClassifier.scala
- mllib/src/main/scala/org/apache/spark/ml/impl/tree/treeParams.scala 210 additions, 39 deletions...main/scala/org/apache/spark/ml/impl/tree/treeParams.scala
- mllib/src/main/scala/org/apache/spark/ml/param/shared/SharedParamsCodeGen.scala 3 additions, 1 deletion...rg/apache/spark/ml/param/shared/SharedParamsCodeGen.scala
- mllib/src/main/scala/org/apache/spark/ml/param/shared/sharedParams.scala 20 additions, 0 deletions...scala/org/apache/spark/ml/param/shared/sharedParams.scala
- mllib/src/main/scala/org/apache/spark/ml/regression/DecisionTreeRegressor.scala 5 additions, 9 deletions...rg/apache/spark/ml/regression/DecisionTreeRegressor.scala
- mllib/src/main/scala/org/apache/spark/ml/regression/GBTRegressor.scala 218 additions, 0 deletions...n/scala/org/apache/spark/ml/regression/GBTRegressor.scala
- mllib/src/main/scala/org/apache/spark/ml/regression/RandomForestRegressor.scala 167 additions, 0 deletions...rg/apache/spark/ml/regression/RandomForestRegressor.scala
- mllib/src/main/scala/org/apache/spark/ml/tree/Node.scala 3 additions, 3 deletionsmllib/src/main/scala/org/apache/spark/ml/tree/Node.scala
- mllib/src/main/scala/org/apache/spark/ml/tree/Split.scala 11 additions, 11 deletionsmllib/src/main/scala/org/apache/spark/ml/tree/Split.scala
- mllib/src/main/scala/org/apache/spark/ml/tree/treeModels.scala 39 additions, 7 deletions.../src/main/scala/org/apache/spark/ml/tree/treeModels.scala
- mllib/src/test/java/org/apache/spark/ml/classification/JavaDecisionTreeClassifierSuite.java 4 additions, 6 deletions...rk/ml/classification/JavaDecisionTreeClassifierSuite.java
- mllib/src/test/java/org/apache/spark/ml/classification/JavaGBTClassifierSuite.java 100 additions, 0 deletions...pache/spark/ml/classification/JavaGBTClassifierSuite.java
- mllib/src/test/java/org/apache/spark/ml/classification/JavaRandomForestClassifierSuite.java 103 additions, 0 deletions...rk/ml/classification/JavaRandomForestClassifierSuite.java
Loading
Please register or sign in to comment