-
- Downloads
[SPARK-5477] refactor stat.py
There is only a single `stat.py` file for the `mllib.stat` package. We recently added `MultivariateGaussian` under `mllib.stat.distribution` in Scala/Java. It would be nice to refactor `stat.py` and make it easy to expand. Note that `ChiSqTestResult` is moved from `mllib.stat` to `mllib.stat.test`. The latter is used in Scala/Java. It is only used in the return value of `Statistics.chiSqTest`, so this should be an okay change. davies Author: Xiangrui Meng <meng@databricks.com> Closes #4266 from mengxr/py-stat-refactor and squashes the following commits: 1a5e1db [Xiangrui Meng] refactor stat.py
Showing
- mllib/pom.xml 1 addition, 0 deletionsmllib/pom.xml
- python/pyspark/mllib/stat/__init__.py 24 additions, 0 deletionspython/pyspark/mllib/stat/__init__.py
- python/pyspark/mllib/stat/_statistics.py 2 additions, 53 deletionspython/pyspark/mllib/stat/_statistics.py
- python/pyspark/mllib/stat/test.py 69 additions, 0 deletionspython/pyspark/mllib/stat/test.py
- python/run-tests 1 addition, 1 deletionpython/run-tests
python/pyspark/mllib/stat/__init__.py
0 → 100644
python/pyspark/mllib/stat/test.py
0 → 100644
Please register or sign in to comment