Skip to content
Snippets Groups Projects
  • Joseph K. Bradley's avatar
    b921fe4d
    [SPARK-10595] [ML] [MLLIB] [DOCS] Various ML guide cleanups · b921fe4d
    Joseph K. Bradley authored
    Various ML guide cleanups.
    
    * ml-guide.md: Make it easier to access the algorithm-specific guides.
    * LDA user guide: EM often begins with useless topics, but running longer generally improves them dramatically.  E.g., 10 iterations on a Wikipedia dataset produces useless topics, but 50 iterations produces very meaningful topics.
    * mllib-feature-extraction.html#elementwiseproduct: “w” parameter should be “scalingVec”
    * Clean up Binarizer user guide a little.
    * Document in Pipeline that users should not put an instance into the Pipeline in more than 1 place.
    * spark.ml Word2Vec user guide: clean up grammar/writing
    * Chi Sq Feature Selector docs: Improve text in doc.
    
    CC: mengxr feynmanliang
    
    Author: Joseph K. Bradley <joseph@databricks.com>
    
    Closes #8752 from jkbradley/mlguide-fixes-1.5.
    b921fe4d
    History
    [SPARK-10595] [ML] [MLLIB] [DOCS] Various ML guide cleanups
    Joseph K. Bradley authored
    Various ML guide cleanups.
    
    * ml-guide.md: Make it easier to access the algorithm-specific guides.
    * LDA user guide: EM often begins with useless topics, but running longer generally improves them dramatically.  E.g., 10 iterations on a Wikipedia dataset produces useless topics, but 50 iterations produces very meaningful topics.
    * mllib-feature-extraction.html#elementwiseproduct: “w” parameter should be “scalingVec”
    * Clean up Binarizer user guide a little.
    * Document in Pipeline that users should not put an instance into the Pipeline in more than 1 place.
    * spark.ml Word2Vec user guide: clean up grammar/writing
    * Chi Sq Feature Selector docs: Improve text in doc.
    
    CC: mengxr feynmanliang
    
    Author: Joseph K. Bradley <joseph@databricks.com>
    
    Closes #8752 from jkbradley/mlguide-fixes-1.5.