Skip to content
Snippets Groups Projects
  • Joseph K. Bradley's avatar
    469a6e5f
    [SPARK-4575] [mllib] [docs] spark.ml pipelines doc + bug fixes · 469a6e5f
    Joseph K. Bradley authored
    Documentation:
    * Added ml-guide.md, linked from mllib-guide.md
    * Updated mllib-guide.md with small section pointing to ml-guide.md
    
    Examples:
    * CrossValidatorExample
    * SimpleParamsExample
    * (I copied these + the SimpleTextClassificationPipeline example into the ml-guide.md)
    
    Bug fixes:
    * PipelineModel: did not use ParamMaps correctly
    * UnaryTransformer: issues with TypeTag serialization (Thanks to mengxr for that fix!)
    
    CC: mengxr shivaram  etrain  Documentation for Pipelines: I know the docs are not complete, but the goal is to have enough to let interested people get started using spark.ml and to add more docs once the package is more established/complete.
    
    Author: Joseph K. Bradley <joseph@databricks.com>
    Author: jkbradley <joseph.kurata.bradley@gmail.com>
    Author: Xiangrui Meng <meng@databricks.com>
    
    Closes #3588 from jkbradley/ml-package-docs and squashes the following commits:
    
    d393b5c [Joseph K. Bradley] fixed bug in Pipeline (typo from last commit).  updated examples for CV and Params for spark.ml
    c38469c [Joseph K. Bradley] Updated ml-guide with CV examples
    99f88c2 [Joseph K. Bradley] Fixed bug in PipelineModel.transform* with usage of params.  Updated CrossValidatorExample to use more training examples so it is less likely to get a 0-size fold.
    ea34dc6 [jkbradley] Merge pull request #4 from mengxr/ml-package-docs
    3b83ec0 [Xiangrui Meng] replace TypeTag with explicit datatype
    41ad9b1 [Joseph K. Bradley] Added examples for spark.ml: SimpleParamsExample + Java version, CrossValidatorExample + Java version.  CrossValidatorExample not working yet.  Added programming guide for spark.ml, but need to add CrossValidatorExample to it once CrossValidatorExample works.
    469a6e5f
    History
    [SPARK-4575] [mllib] [docs] spark.ml pipelines doc + bug fixes
    Joseph K. Bradley authored
    Documentation:
    * Added ml-guide.md, linked from mllib-guide.md
    * Updated mllib-guide.md with small section pointing to ml-guide.md
    
    Examples:
    * CrossValidatorExample
    * SimpleParamsExample
    * (I copied these + the SimpleTextClassificationPipeline example into the ml-guide.md)
    
    Bug fixes:
    * PipelineModel: did not use ParamMaps correctly
    * UnaryTransformer: issues with TypeTag serialization (Thanks to mengxr for that fix!)
    
    CC: mengxr shivaram  etrain  Documentation for Pipelines: I know the docs are not complete, but the goal is to have enough to let interested people get started using spark.ml and to add more docs once the package is more established/complete.
    
    Author: Joseph K. Bradley <joseph@databricks.com>
    Author: jkbradley <joseph.kurata.bradley@gmail.com>
    Author: Xiangrui Meng <meng@databricks.com>
    
    Closes #3588 from jkbradley/ml-package-docs and squashes the following commits:
    
    d393b5c [Joseph K. Bradley] fixed bug in Pipeline (typo from last commit).  updated examples for CV and Params for spark.ml
    c38469c [Joseph K. Bradley] Updated ml-guide with CV examples
    99f88c2 [Joseph K. Bradley] Fixed bug in PipelineModel.transform* with usage of params.  Updated CrossValidatorExample to use more training examples so it is less likely to get a 0-size fold.
    ea34dc6 [jkbradley] Merge pull request #4 from mengxr/ml-package-docs
    3b83ec0 [Xiangrui Meng] replace TypeTag with explicit datatype
    41ad9b1 [Joseph K. Bradley] Added examples for spark.ml: SimpleParamsExample + Java version, CrossValidatorExample + Java version.  CrossValidatorExample not working yet.  Added programming guide for spark.ml, but need to add CrossValidatorExample to it once CrossValidatorExample works.