-
- Downloads
[SPARK-7240][SQL] Single pass covariance calculation for dataframes
Added the calculation of covariance between two columns to DataFrames. cc mengxr rxin Author: Burak Yavuz <brkyvz@gmail.com> Closes #5825 from brkyvz/df-cov and squashes the following commits: cb18046 [Burak Yavuz] changed to sample covariance f2e862b [Burak Yavuz] fixed failed test 51e39b8 [Burak Yavuz] moved implementation 0c6a759 [Burak Yavuz] addressed math comments 8456eca [Burak Yavuz] fix pyStyle3 aa2ad29 [Burak Yavuz] fix pyStyle2 4e97a50 [Burak Yavuz] Merge branch 'master' of github.com:apache/spark into df-cov e3b0b85 [Burak Yavuz] addressed comments v0.1 a7115f1 [Burak Yavuz] fix python style 7dc6dbc [Burak Yavuz] reorder imports 408cb77 [Burak Yavuz] initial commit
Showing
- python/pyspark/sql/__init__.py 3 additions, 1 deletionpython/pyspark/sql/__init__.py
- python/pyspark/sql/dataframe.py 35 additions, 1 deletionpython/pyspark/sql/dataframe.py
- python/pyspark/sql/tests.py 5 additions, 0 deletionspython/pyspark/sql/tests.py
- sql/core/src/main/scala/org/apache/spark/sql/DataFrameStatFunctions.scala 11 additions, 1 deletion...n/scala/org/apache/spark/sql/DataFrameStatFunctions.scala
- sql/core/src/main/scala/org/apache/spark/sql/execution/stat/StatFunctions.scala 80 additions, 0 deletions...a/org/apache/spark/sql/execution/stat/StatFunctions.scala
- sql/core/src/test/java/test/org/apache/spark/sql/JavaDataFrameSuite.java 7 additions, 0 deletions...st/java/test/org/apache/spark/sql/JavaDataFrameSuite.java
- sql/core/src/test/scala/org/apache/spark/sql/DataFrameStatSuite.scala 16 additions, 2 deletions.../test/scala/org/apache/spark/sql/DataFrameStatSuite.scala
Loading
Please register or sign in to comment