-
- Downloads
[SPARK-7198] [MLLIB] VectorAssembler should output ML attributes
`VectorAssembler` should carry over ML attributes. For unknown attributes, we assume numeric values. This PR handles the following cases: 1. DoubleType with ML attribute: carry over 2. DoubleType without ML attribute: numeric value 3. Scalar type: numeric value 4. VectorType with all ML attributes: carry over and update names 5. VectorType with number of ML attributes: assume all numeric 6. VectorType without ML attributes: check the first row and get the number of attributes jkbradley Author: Xiangrui Meng <meng@databricks.com> Closes #6452 from mengxr/SPARK-7198 and squashes the following commits: a9d2469 [Xiangrui Meng] add space facdb1f [Xiangrui Meng] VectorAssembler should output ML attributes
Showing
- mllib/src/main/scala/org/apache/spark/ml/feature/VectorAssembler.scala 46 additions, 5 deletions...n/scala/org/apache/spark/ml/feature/VectorAssembler.scala
- mllib/src/test/scala/org/apache/spark/ml/feature/VectorAssemblerSuite.scala 37 additions, 0 deletions...la/org/apache/spark/ml/feature/VectorAssemblerSuite.scala
Please register or sign in to comment