Skip to content
Snippets Groups Projects
  1. Aug 25, 2016
  2. Aug 24, 2016
    • gatorsmile's avatar
      [SPARK-17190][SQL] Removal of HiveSharedState · 4d0706d6
      gatorsmile authored
      ### What changes were proposed in this pull request?
      Since `HiveClient` is used to interact with the Hive metastore, it should be hidden in `HiveExternalCatalog`. After moving `HiveClient` into `HiveExternalCatalog`, `HiveSharedState` becomes a wrapper of `HiveExternalCatalog`. Thus, removal of `HiveSharedState` becomes straightforward. After removal of `HiveSharedState`, the reflection logic is directly applied on the choice of `ExternalCatalog` types, based on the configuration of `CATALOG_IMPLEMENTATION`.
      
      ~~`HiveClient` is also used/invoked by the other entities besides HiveExternalCatalog, we defines the following two APIs: getClient and getNewClient~~
      
      ### How was this patch tested?
      The existing test cases
      
      Author: gatorsmile <gatorsmile@gmail.com>
      
      Closes #14757 from gatorsmile/removeHiveClient.
      4d0706d6
    • Sameer Agarwal's avatar
      [SPARK-17228][SQL] Not infer/propagate non-deterministic constraints · ac27557e
      Sameer Agarwal authored
      ## What changes were proposed in this pull request?
      
      Given that filters based on non-deterministic constraints shouldn't be pushed down in the query plan, unnecessarily inferring them is confusing and a source of potential bugs. This patch simplifies the inferring logic by simply ignoring them.
      
      ## How was this patch tested?
      
      Added a new test in `ConstraintPropagationSuite`.
      
      Author: Sameer Agarwal <sameerag@cs.berkeley.edu>
      
      Closes #14795 from sameeragarwal/deterministic-constraints.
      ac27557e
    • Junyang Qian's avatar
      [SPARKR][MINOR] Add installation message for remote master mode and improve other messages · 3a60be4b
      Junyang Qian authored
      ## What changes were proposed in this pull request?
      
      This PR gives informative message to users when they try to connect to a remote master but don't have Spark package in their local machine.
      
      As a clarification, for now, automatic installation will only happen if they start SparkR in R console (rather than from sparkr-shell) and connect to local master. In the remote master mode, local Spark package is still needed, but we will not trigger the install.spark function because the versions have to match those on the cluster, which involves more user input. Instead, we here try to provide detailed message that may help the users.
      
      Some of the other messages have also been slightly changed.
      
      ## How was this patch tested?
      
      Manual test.
      
      Author: Junyang Qian <junyangq@databricks.com>
      
      Closes #14761 from junyangq/SPARK-16579-V1.
      3a60be4b
    • Junyang Qian's avatar
      [SPARKR][MINOR] Add more examples to window function docs · 18708f76
      Junyang Qian authored
      ## What changes were proposed in this pull request?
      
      This PR adds more examples to window function docs to make them more accessible to the users.
      
      It also fixes default value issues for `lag` and `lead`.
      
      ## How was this patch tested?
      
      Manual test, R unit test.
      
      Author: Junyang Qian <junyangq@databricks.com>
      
      Closes #14779 from junyangq/SPARKR-FixWindowFunctionDocs.
      18708f76
    • Felix Cheung's avatar
      [MINOR][SPARKR] fix R MLlib parameter documentation · 945c04bc
      Felix Cheung authored
      ## What changes were proposed in this pull request?
      
      Fixed several misplaced param tag - they should be on the spark.* method generics
      
      ## How was this patch tested?
      
      run knitr
      junyangq
      
      Author: Felix Cheung <felixcheung_m@hotmail.com>
      
      Closes #14792 from felixcheung/rdocmllib.
      945c04bc
    • hyukjinkwon's avatar
      [SPARK-16216][SQL] Read/write timestamps and dates in ISO 8601 and... · 29952ed0
      hyukjinkwon authored
      [SPARK-16216][SQL] Read/write timestamps and dates in ISO 8601 and dateFormat/timestampFormat option for CSV and JSON
      
      ## What changes were proposed in this pull request?
      
      ### Default - ISO 8601
      
      Currently, CSV datasource is writing `Timestamp` and `Date` as numeric form and JSON datasource is writing both as below:
      
      - CSV
        ```
        // TimestampType
        1414459800000000
        // DateType
        16673
        ```
      
      - Json
      
        ```
        // TimestampType
        1970-01-01 11:46:40.0
        // DateType
        1970-01-01
        ```
      
      So, for CSV we can't read back what we write and for JSON it becomes ambiguous because the timezone is being missed.
      
      So, this PR make both **write** `Timestamp` and `Date` in ISO 8601 formatted string (please refer the [ISO 8601 specification](https://www.w3.org/TR/NOTE-datetime)).
      
      - For `Timestamp` it becomes as below: (`yyyy-MM-dd'T'HH:mm:ss.SSSZZ`)
      
        ```
        1970-01-01T02:00:01.000-01:00
        ```
      
      - For `Date` it becomes as below (`yyyy-MM-dd`)
      
        ```
        1970-01-01
        ```
      
      ### Custom date format option - `dateFormat`
      
      This PR also adds the support to write and read dates and timestamps in a formatted string as below:
      
      - **DateType**
      
        - With `dateFormat` option (e.g. `yyyy/MM/dd`)
      
          ```
          +----------+
          |      date|
          +----------+
          |2015/08/26|
          |2014/10/27|
          |2016/01/28|
          +----------+
          ```
      
      ### Custom date format option - `timestampFormat`
      
      - **TimestampType**
      
        - With `dateFormat` option (e.g. `dd/MM/yyyy HH:mm`)
      
          ```
          +----------------+
          |            date|
          +----------------+
          |2015/08/26 18:00|
          |2014/10/27 18:30|
          |2016/01/28 20:00|
          +----------------+
          ```
      
      ## How was this patch tested?
      
      Unit tests were added in `CSVSuite` and `JsonSuite`. For JSON, existing tests cover the default cases.
      
      Author: hyukjinkwon <gurwls223@gmail.com>
      
      Closes #14279 from HyukjinKwon/SPARK-16216-json-csv.
      29952ed0
    • Alex Bozarth's avatar
      [SPARK-15083][WEB UI] History Server can OOM due to unlimited TaskUIData · 891ac2b9
      Alex Bozarth authored
      ## What changes were proposed in this pull request?
      
      Based on #12990 by tankkyo
      
      Since the History Server currently loads all application's data it can OOM if too many applications have a significant task count. `spark.ui.trimTasks` (default: false) can be set to true to trim tasks by `spark.ui.retainedTasks` (default: 10000)
      
      (This is a "quick fix" to help those running into the problem until a update of how the history server loads app data can be done)
      
      ## How was this patch tested?
      
      Manual testing and dev/run-tests
      
      ![spark-15083](https://cloud.githubusercontent.com/assets/13952758/17713694/fe82d246-63b0-11e6-9697-b87ea75ff4ef.png)
      
      Author: Alex Bozarth <ajbozart@us.ibm.com>
      
      Closes #14673 from ajbozarth/spark15083.
      891ac2b9
    • Dongjoon Hyun's avatar
      [SPARK-16983][SQL] Add `prettyName` for row_number, dense_rank, percent_rank, cume_dist · 40b30fcf
      Dongjoon Hyun authored
      ## What changes were proposed in this pull request?
      
      Currently, two-word window functions like `row_number`, `dense_rank`, `percent_rank`, and `cume_dist` are expressed without `_` in error messages. We had better show the correct names.
      
      **Before**
      ```scala
      scala> sql("select row_number()").show
      java.lang.UnsupportedOperationException: Cannot evaluate expression: rownumber()
      ```
      
      **After**
      ```scala
      scala> sql("select row_number()").show
      java.lang.UnsupportedOperationException: Cannot evaluate expression: row_number()
      ```
      
      ## How was this patch tested?
      
      Pass the Jenkins and manual.
      
      Author: Dongjoon Hyun <dongjoon@apache.org>
      
      Closes #14571 from dongjoon-hyun/SPARK-16983.
      40b30fcf
    • Sean Owen's avatar
      [SPARK-16781][PYSPARK] java launched by PySpark as gateway may not be the same... · 0b3a4be9
      Sean Owen authored
      [SPARK-16781][PYSPARK] java launched by PySpark as gateway may not be the same java used in the spark environment
      
      ## What changes were proposed in this pull request?
      
      Update to py4j 0.10.3 to enable JAVA_HOME support
      
      ## How was this patch tested?
      
      Pyspark tests
      
      Author: Sean Owen <sowen@cloudera.com>
      
      Closes #14748 from srowen/SPARK-16781.
      0b3a4be9
    • Xin Ren's avatar
      [SPARK-16445][MLLIB][SPARKR] Multilayer Perceptron Classifier wrapper in SparkR · 2fbdb606
      Xin Ren authored
      https://issues.apache.org/jira/browse/SPARK-16445
      
      ## What changes were proposed in this pull request?
      
      Create Multilayer Perceptron Classifier wrapper in SparkR
      
      ## How was this patch tested?
      
      Tested manually on local machine
      
      Author: Xin Ren <iamshrek@126.com>
      
      Closes #14447 from keypointt/SPARK-16445.
      2fbdb606
    • Junyang Qian's avatar
      [SPARKR][MINOR] Fix doc for show method · d2932a0e
      Junyang Qian authored
      ## What changes were proposed in this pull request?
      
      The original doc of `show` put methods for multiple classes together but the text only talks about `SparkDataFrame`. This PR tries to fix this problem.
      
      ## How was this patch tested?
      
      Manual test.
      
      Author: Junyang Qian <junyangq@databricks.com>
      
      Closes #14776 from junyangq/SPARK-FixShowDoc.
      d2932a0e
    • Yanbo Liang's avatar
      [MINOR][DOC] Fix wrong ml.feature.Normalizer document. · 45b786ac
      Yanbo Liang authored
      ## What changes were proposed in this pull request?
      The ```ml.feature.Normalizer``` examples illustrate L1 norm rather than L2, we should correct corresponding document.
      ![image](https://cloud.githubusercontent.com/assets/1962026/17928637/85aec284-69b0-11e6-9b13-d465ee560581.png)
      
      ## How was this patch tested?
      Doc change, no test.
      
      Author: Yanbo Liang <ybliang8@gmail.com>
      
      Closes #14787 from yanboliang/normalizer.
      45b786ac
    • VinceShieh's avatar
      [SPARK-17086][ML] Fix InvalidArgumentException issue in QuantileDiscretizer... · 92c0eaf3
      VinceShieh authored
      [SPARK-17086][ML] Fix InvalidArgumentException issue in QuantileDiscretizer when some quantiles are duplicated
      
      ## What changes were proposed in this pull request?
      
      In cases when QuantileDiscretizerSuite is called upon a numeric array with duplicated elements,  we will  take the unique elements generated from approxQuantiles as input for Bucketizer.
      
      ## How was this patch tested?
      
      An unit test is added in QuantileDiscretizerSuite
      
      QuantileDiscretizer.fit will throw an illegal exception when calling setSplits on a list of splits
      with duplicated elements. Bucketizer.setSplits should only accept either a numeric vector of two
      or more unique cut points, although that may produce less number of buckets than requested.
      
      Signed-off-by: VinceShieh <vincent.xieintel.com>
      
      Author: VinceShieh <vincent.xie@intel.com>
      
      Closes #14747 from VinceShieh/SPARK-17086.
      92c0eaf3
    • Weiqing Yang's avatar
      [MINOR][BUILD] Fix Java CheckStyle Error · 673a80d2
      Weiqing Yang authored
      ## What changes were proposed in this pull request?
      As Spark 2.0.1 will be released soon (mentioned in the spark dev mailing list), besides the critical bugs, it's better to fix the code style errors before the release.
      
      Before:
      ```
      ./dev/lint-java
      Checkstyle checks failed at following occurrences:
      [ERROR] src/main/java/org/apache/spark/util/collection/unsafe/sort/UnsafeExternalSorter.java:[525] (sizes) LineLength: Line is longer than 100 characters (found 119).
      [ERROR] src/main/java/org/apache/spark/examples/sql/streaming/JavaStructuredNetworkWordCount.java:[64] (sizes) LineLength: Line is longer than 100 characters (found 103).
      ```
      After:
      ```
      ./dev/lint-java
      Using `mvn` from path: /usr/local/bin/mvn
      Checkstyle checks passed.
      ```
      ## How was this patch tested?
      Manual.
      
      Author: Weiqing Yang <yangweiqing001@gmail.com>
      
      Closes #14768 from Sherry302/fixjavastyle.
      673a80d2
    • Wenchen Fan's avatar
      [SPARK-17186][SQL] remove catalog table type INDEX · 52fa45d6
      Wenchen Fan authored
      ## What changes were proposed in this pull request?
      
      Actually Spark SQL doesn't support index, the catalog table type `INDEX` is from Hive. However, most operations in Spark SQL can't handle index table, e.g. create table, alter table, etc.
      
      Logically index table should be invisible to end users, and Hive also generates special table name for index table to avoid users accessing it directly. Hive has special SQL syntax to create/show/drop index tables.
      
      At Spark SQL side, although we can describe index table directly, but the result is unreadable, we should use the dedicated SQL syntax to do it(e.g. `SHOW INDEX ON tbl`). Spark SQL can also read index table directly, but the result is always empty.(Can hive read index table directly?)
      
      This PR remove the table type `INDEX`, to make it clear that Spark SQL doesn't support index currently.
      
      ## How was this patch tested?
      
      existing tests.
      
      Author: Wenchen Fan <wenchen@databricks.com>
      
      Closes #14752 from cloud-fan/minor2.
      52fa45d6
    • Weiqing Yang's avatar
      [MINOR][SQL] Remove implemented functions from comments of 'HiveSessionCatalog.scala' · b9994ad0
      Weiqing Yang authored
      ## What changes were proposed in this pull request?
      This PR removes implemented functions from comments of `HiveSessionCatalog.scala`: `java_method`, `posexplode`, `str_to_map`.
      
      ## How was this patch tested?
      Manual.
      
      Author: Weiqing Yang <yangweiqing001@gmail.com>
      
      Closes #14769 from Sherry302/cleanComment.
      b9994ad0
  3. Aug 23, 2016
    • Tejas Patil's avatar
      [SPARK-16862] Configurable buffer size in `UnsafeSorterSpillReader` · c1937dd1
      Tejas Patil authored
      ## What changes were proposed in this pull request?
      
      Jira: https://issues.apache.org/jira/browse/SPARK-16862
      
      `BufferedInputStream` used in `UnsafeSorterSpillReader` uses the default 8k buffer to read data off disk. This PR makes it configurable to improve on disk reads. I have made the default value to be 1 MB as with that value I observed improved performance.
      
      ## How was this patch tested?
      
      I am relying on the existing unit tests.
      
      ## Performance
      
      After deploying this change to prod and setting the config to 1 mb, there was a 12% reduction in the CPU time and 19.5% reduction in CPU reservation time.
      
      Author: Tejas Patil <tejasp@fb.com>
      
      Closes #14726 from tejasapatil/spill_buffer_2.
      c1937dd1
    • Josh Rosen's avatar
      [SPARK-17194] Use single quotes when generating SQL for string literals · bf8ff833
      Josh Rosen authored
      When Spark emits SQL for a string literal, it should wrap the string in single quotes, not double quotes. Databases which adhere more strictly to the ANSI SQL standards, such as Postgres, allow only single-quotes to be used for denoting string literals (see http://stackoverflow.com/a/1992331/590203).
      
      Author: Josh Rosen <joshrosen@databricks.com>
      
      Closes #14763 from JoshRosen/SPARK-17194.
      bf8ff833
    • Zheng RuiFeng's avatar
      [TRIVIAL] Typo Fix · 6555ef0c
      Zheng RuiFeng authored
      ## What changes were proposed in this pull request?
      Fix a typo
      
      ## How was this patch tested?
      no tests
      
      Author: Zheng RuiFeng <ruifengz@foxmail.com>
      
      Closes #14772 from zhengruifeng/minor_numClasses.
      6555ef0c
    • hyukjinkwon's avatar
      [MINOR][DOC] Use standard quotes instead of "curly quote" marks from Mac in... · 58855991
      hyukjinkwon authored
      [MINOR][DOC] Use standard quotes instead of "curly quote" marks from Mac in structured streaming programming guides
      
      ## What changes were proposed in this pull request?
      
      This PR fixes curly quotes (`“` and `”` ) to standard quotes (`"`).
      
      This will be a actual problem when users copy and paste the examples. This would not work.
      
      This seems only happening in `structured-streaming-programming-guide.md`.
      
      ## How was this patch tested?
      
      Manually built.
      
      This will change some examples to be correctly marked down as below:
      
      ![2016-08-23 3 24 13](https://cloud.githubusercontent.com/assets/6477701/17882878/2a38332e-694a-11e6-8e84-76bdb89151e0.png)
      
      to
      
      ![2016-08-23 3 26 06](https://cloud.githubusercontent.com/assets/6477701/17882888/376eaa28-694a-11e6-8b88-32ea83997037.png)
      
      Author: hyukjinkwon <gurwls223@gmail.com>
      
      Closes #14770 from HyukjinKwon/minor-quotes.
      58855991
    • Junyang Qian's avatar
      [SPARKR][MINOR] Remove reference link for common Windows environment variables · 8fd63e80
      Junyang Qian authored
      ## What changes were proposed in this pull request?
      
      The PR removes reference link in the doc for environment variables for common Windows folders. The cran check gave code 503: service unavailable on the original link.
      
      ## How was this patch tested?
      
      Manual check.
      
      Author: Junyang Qian <junyangq@databricks.com>
      
      Closes #14767 from junyangq/SPARKR-RemoveLink.
      8fd63e80
    • Davies Liu's avatar
      [SPARK-13286] [SQL] add the next expression of SQLException as cause · 9afdfc94
      Davies Liu authored
      ## What changes were proposed in this pull request?
      
      Some JDBC driver (for example PostgreSQL) does not use the underlying exception as cause, but have another APIs (getNextException) to access that, so it it's included in the error logging, making us hard to find the root cause, especially in batch mode.
      
      This PR will pull out the next exception and add it as cause (if it's different) or suppressed (if there is another different cause).
      
      ## How was this patch tested?
      
      Can't reproduce this on the default JDBC driver, so did not add a regression test.
      
      Author: Davies Liu <davies@databricks.com>
      
      Closes #14722 from davies/keep_cause.
      9afdfc94
    • Jagadeesan's avatar
      [SPARK-17095] [Documentation] [Latex and Scala doc do not play nicely] · 97d461b7
      Jagadeesan authored
      ## What changes were proposed in this pull request?
      
      In Latex, it is common to find "}}}" when closing several expressions at once. [SPARK-16822](https://issues.apache.org/jira/browse/SPARK-16822) added Mathjax to render Latex equations in scaladoc. However, when scala doc sees "}}}" or "{{{" it treats it as a special character for code block. This results in some very strange output.
      
      Author: Jagadeesan <as2@us.ibm.com>
      
      Closes #14688 from jagadeesanas2/SPARK-17095.
      97d461b7
    • Jacek Laskowski's avatar
      [SPARK-17199] Use CatalystConf.resolver for case-sensitivity comparison · 9d376ad7
      Jacek Laskowski authored
      ## What changes were proposed in this pull request?
      
      Use `CatalystConf.resolver` consistently for case-sensitivity comparison (removed dups).
      
      ## How was this patch tested?
      
      Local build. Waiting for Jenkins to ensure clean build and test.
      
      Author: Jacek Laskowski <jacek@japila.pl>
      
      Closes #14771 from jaceklaskowski/17199-catalystconf-resolver.
      9d376ad7
    • Sean Zhong's avatar
      [SPARK-17188][SQL] Moves class QuantileSummaries to project catalyst for... · cc33460a
      Sean Zhong authored
      [SPARK-17188][SQL] Moves class QuantileSummaries to project catalyst for implementing percentile_approx
      
      ## What changes were proposed in this pull request?
      
      This is a sub-task of [SPARK-16283](https://issues.apache.org/jira/browse/SPARK-16283) (Implement percentile_approx SQL function), which moves class QuantileSummaries to project catalyst so that it can be reused when implementing aggregation function `percentile_approx`.
      
      ## How was this patch tested?
      
      This PR only does class relocation, class implementation is not changed.
      
      Author: Sean Zhong <seanzhong@databricks.com>
      
      Closes #14754 from clockfly/move_QuantileSummaries_to_catalyst.
      cc33460a
  4. Aug 22, 2016
    • Felix Cheung's avatar
      [SPARKR][MINOR] Update R DESCRIPTION file · d2b3d3e6
      Felix Cheung authored
      ## What changes were proposed in this pull request?
      
      Update DESCRIPTION
      
      ## How was this patch tested?
      
      Run install and CRAN tests
      
      Author: Felix Cheung <felixcheung_m@hotmail.com>
      
      Closes #14764 from felixcheung/rpackagedescription.
      d2b3d3e6
    • Cheng Lian's avatar
      [SPARK-17182][SQL] Mark Collect as non-deterministic · 2cdd92a7
      Cheng Lian authored
      ## What changes were proposed in this pull request?
      
      This PR marks the abstract class `Collect` as non-deterministic since the results of `CollectList` and `CollectSet` depend on the actual order of input rows.
      
      ## How was this patch tested?
      
      Existing test cases should be enough.
      
      Author: Cheng Lian <lian@databricks.com>
      
      Closes #14749 from liancheng/spark-17182-non-deterministic-collect.
      2cdd92a7
    • Shivaram Venkataraman's avatar
      [SPARK-16577][SPARKR] Add CRAN documentation checks to run-tests.sh · 920806ab
      Shivaram Venkataraman authored
      ## What changes were proposed in this pull request?
      
      (Please fill in changes proposed in this fix)
      
      ## How was this patch tested?
      
      This change adds CRAN documentation checks to be run as a part of `R/run-tests.sh` . As this script is also used by Jenkins this means that we will get documentation checks on every PR going forward.
      
      (If this patch involves UI changes, please attach a screenshot; otherwise, remove this)
      
      Author: Shivaram Venkataraman <shivaram@cs.berkeley.edu>
      
      Closes #14759 from shivaram/sparkr-cran-jenkins.
      920806ab
    • hqzizania's avatar
      [SPARK-17090][FOLLOW-UP][ML] Add expert param support to SharedParamsCodeGen · 37f0ab70
      hqzizania authored
      ## What changes were proposed in this pull request?
      
      Add expert param support to SharedParamsCodeGen where aggregationDepth a expert param is added.
      
      Author: hqzizania <hqzizania@gmail.com>
      
      Closes #14738 from hqzizania/SPARK-17090-minor.
      37f0ab70
    • gatorsmile's avatar
      [SPARK-17144][SQL] Removal of useless CreateHiveTableAsSelectLogicalPlan · 6d93f9e0
      gatorsmile authored
      ## What changes were proposed in this pull request?
      `CreateHiveTableAsSelectLogicalPlan` is a dead code after refactoring.
      
      ## How was this patch tested?
      N/A
      
      Author: gatorsmile <gatorsmile@gmail.com>
      
      Closes #14707 from gatorsmile/removeCreateHiveTable.
      6d93f9e0
    • Eric Liang's avatar
      [SPARK-16550][SPARK-17042][CORE] Certain classes fail to deserialize in block manager replication · 8e223ea6
      Eric Liang authored
      ## What changes were proposed in this pull request?
      
      This is a straightforward clone of JoshRosen 's original patch. I have follow-up changes to fix block replication for repl-defined classes as well, but those appear to be flaking tests so I'm going to leave that for SPARK-17042
      
      ## How was this patch tested?
      
      End-to-end test in ReplSuite (also more tests in DistributedSuite from the original patch).
      
      Author: Eric Liang <ekl@databricks.com>
      
      Closes #14311 from ericl/spark-16550.
      8e223ea6
    • Felix Cheung's avatar
      [SPARK-16508][SPARKR] doc updates and more CRAN check fixes · 71afeeea
      Felix Cheung authored
      ## What changes were proposed in this pull request?
      
      replace ``` ` ``` in code doc with `\code{thing}`
      remove added `...` for drop(DataFrame)
      fix remaining CRAN check warnings
      
      ## How was this patch tested?
      
      create doc with knitr
      
      junyangq
      
      Author: Felix Cheung <felixcheung_m@hotmail.com>
      
      Closes #14734 from felixcheung/rdoccleanup.
      71afeeea
    • Eric Liang's avatar
      [SPARK-17162] Range does not support SQL generation · 84770b59
      Eric Liang authored
      ## What changes were proposed in this pull request?
      
      The range operator previously didn't support SQL generation, which made it not possible to use in views.
      
      ## How was this patch tested?
      
      Unit tests.
      
      cc hvanhovell
      
      Author: Eric Liang <ekl@databricks.com>
      
      Closes #14724 from ericl/spark-17162.
      84770b59
    • Sean Zhong's avatar
      [MINOR][SQL] Fix some typos in comments and test hints · 929cb8be
      Sean Zhong authored
      ## What changes were proposed in this pull request?
      
      Fix some typos in comments and test hints
      
      ## How was this patch tested?
      
      N/A.
      
      Author: Sean Zhong <seanzhong@databricks.com>
      
      Closes #14755 from clockfly/fix_minor_typo.
      929cb8be
    • Shivaram Venkataraman's avatar
      [SPARKR][MINOR] Add Xiangrui and Felix to maintainers · 6f3cd36f
      Shivaram Venkataraman authored
      ## What changes were proposed in this pull request?
      
      This change adds Xiangrui Meng and Felix Cheung to the maintainers field in the package description.
      
      ## How was this patch tested?
      
      (Please explain how this patch was tested. E.g. unit tests, integration tests, manual tests)
      
      (If this patch involves UI changes, please attach a screenshot; otherwise, remove this)
      
      Author: Shivaram Venkataraman <shivaram@cs.berkeley.edu>
      
      Closes #14758 from shivaram/sparkr-maintainers.
      6f3cd36f
    • Felix Cheung's avatar
      [SPARK-17173][SPARKR] R MLlib refactor, cleanup, reformat, fix deprecation in test · 0583ecda
      Felix Cheung authored
      ## What changes were proposed in this pull request?
      
      refactor, cleanup, reformat, fix deprecation in test
      
      ## How was this patch tested?
      
      unit tests, manual tests
      
      Author: Felix Cheung <felixcheung_m@hotmail.com>
      
      Closes #14735 from felixcheung/rmllibutil.
      0583ecda
    • Sean Owen's avatar
      [SPARK-16320][DOC] Document G1 heap region's effect on spark 2.0 vs 1.6 · 342278c0
      Sean Owen authored
      ## What changes were proposed in this pull request?
      
      Collect GC discussion in one section, and documenting findings about G1 GC heap region size.
      
      ## How was this patch tested?
      
      Jekyll doc build
      
      Author: Sean Owen <sowen@cloudera.com>
      
      Closes #14732 from srowen/SPARK-16320.
      342278c0
    • Junyang Qian's avatar
      [SPARKR][MINOR] Fix Cache Folder Path in Windows · 209e1b3c
      Junyang Qian authored
      ## What changes were proposed in this pull request?
      
      This PR tries to fix the scheme of local cache folder in Windows. The name of the environment variable should be `LOCALAPPDATA` rather than `%LOCALAPPDATA%`.
      
      ## How was this patch tested?
      
      Manual test in Windows 7.
      
      Author: Junyang Qian <junyangq@databricks.com>
      
      Closes #14743 from junyangq/SPARKR-FixWindowsInstall.
      209e1b3c
Loading