-
- Downloads
[SPARK-5597][MLLIB] save/load for decision trees and emsembles
This is based on #4444 from jkbradley with the following changes: 1. Node schema updated to ~~~ treeId: int nodeId: Int predict/ |- predict: Double |- prob: Double impurity: Double isLeaf: Boolean split/ |- feature: Int |- threshold: Double |- featureType: Int |- categories: Array[Double] leftNodeId: Integer rightNodeId: Integer infoGain: Double ~~~ 2. Some refactor of the implementation. Closes #4444. Author: Joseph K. Bradley <joseph@databricks.com> Author: Xiangrui Meng <meng@databricks.com> Closes #4493 from mengxr/SPARK-5597 and squashes the following commits: 75e3bb6 [Xiangrui Meng] fix style 2b0033d [Xiangrui Meng] update tree export schema and refactor the implementation 45873a2 [Joseph K. Bradley] org imports 1d4c264 [Joseph K. Bradley] Added save/load for tree ensembles dcdbf85 [Joseph K. Bradley] added save/load for decision tree but need to generalize it to ensembles
Showing
- mllib/src/main/scala/org/apache/spark/mllib/tree/model/DecisionTreeModel.scala 196 additions, 1 deletion...org/apache/spark/mllib/tree/model/DecisionTreeModel.scala
- mllib/src/main/scala/org/apache/spark/mllib/tree/model/InformationGainStats.scala 3 additions, 1 deletion.../apache/spark/mllib/tree/model/InformationGainStats.scala
- mllib/src/main/scala/org/apache/spark/mllib/tree/model/Node.scala 5 additions, 0 deletions...c/main/scala/org/apache/spark/mllib/tree/model/Node.scala
- mllib/src/main/scala/org/apache/spark/mllib/tree/model/Predict.scala 7 additions, 0 deletions...ain/scala/org/apache/spark/mllib/tree/model/Predict.scala
- mllib/src/main/scala/org/apache/spark/mllib/tree/model/treeEnsembleModels.scala 155 additions, 2 deletions...rg/apache/spark/mllib/tree/model/treeEnsembleModels.scala
- mllib/src/test/scala/org/apache/spark/mllib/tree/DecisionTreeSuite.scala 118 additions, 2 deletions...scala/org/apache/spark/mllib/tree/DecisionTreeSuite.scala
- mllib/src/test/scala/org/apache/spark/mllib/tree/GradientBoostedTreesSuite.scala 52 additions, 29 deletions...g/apache/spark/mllib/tree/GradientBoostedTreesSuite.scala
- mllib/src/test/scala/org/apache/spark/mllib/tree/RandomForestSuite.scala 25 additions, 3 deletions...scala/org/apache/spark/mllib/tree/RandomForestSuite.scala
Loading
Please register or sign in to comment