Skip to content
Snippets Groups Projects
  1. Apr 25, 2017
  2. Apr 14, 2017
  3. Mar 28, 2017
  4. Mar 21, 2017
  5. Jan 12, 2017
  6. Dec 22, 2016
  7. Dec 15, 2016
  8. Dec 08, 2016
  9. Nov 28, 2016
  10. Nov 07, 2016
    • Ryan Blue's avatar
      [SPARK-18086] Add support for Hive session vars. · 29f59c73
      Ryan Blue authored
      
      ## What changes were proposed in this pull request?
      
      This adds support for Hive variables:
      
      * Makes values set via `spark-sql --hivevar name=value` accessible
      * Adds `getHiveVar` and `setHiveVar` to the `HiveClient` interface
      * Adds a SessionVariables trait for sessions like Hive that support variables (including Hive vars)
      * Adds SessionVariables support to variable substitution
      * Adds SessionVariables support to the SET command
      
      ## How was this patch tested?
      
      * Adds a test to all supported Hive versions for accessing Hive variables
      * Adds HiveVariableSubstitutionSuite
      
      Author: Ryan Blue <blue@apache.org>
      
      Closes #15738 from rdblue/SPARK-18086-add-hivevar-support.
      
      (cherry picked from commit 9b0593d5)
      Signed-off-by: default avatarReynold Xin <rxin@databricks.com>
      29f59c73
  11. Nov 01, 2016
    • Josh Rosen's avatar
      [SPARK-17350][SQL] Disable default use of KryoSerializer in Thrift Server · 6e629815
      Josh Rosen authored
      In SPARK-4761 / #3621 (December 2014) we enabled Kryo serialization by default in the Spark Thrift Server. However, I don't think that the original rationale for doing this still holds now that most Spark SQL serialization is now performed via encoders and our UnsafeRow format.
      
      In addition, the use of Kryo as the default serializer can introduce performance problems because the creation of new KryoSerializer instances is expensive and we haven't performed instance-reuse optimizations in several code paths (including DirectTaskResult deserialization).
      
      Given all of this, I propose to revert back to using JavaSerializer as the default serializer in the Thrift Server.
      
      /cc liancheng
      
      Author: Josh Rosen <joshrosen@databricks.com>
      
      Closes #14906 from JoshRosen/disable-kryo-in-thriftserver.
      6e629815
  12. Oct 16, 2016
    • Dongjoon Hyun's avatar
      [SPARK-17819][SQL] Support default database in connection URIs for Spark Thrift Server · 59e3eb5a
      Dongjoon Hyun authored
      ## What changes were proposed in this pull request?
      
      Currently, Spark Thrift Server ignores the default database in URI. This PR supports that like the following.
      
      ```sql
      $ bin/beeline -u jdbc:hive2://localhost:10000 -e "create database testdb"
      $ bin/beeline -u jdbc:hive2://localhost:10000/testdb -e "create table t(a int)"
      $ bin/beeline -u jdbc:hive2://localhost:10000/testdb -e "show tables"
      ...
      +------------+--------------+--+
      | tableName  | isTemporary  |
      +------------+--------------+--+
      | t          | false        |
      +------------+--------------+--+
      1 row selected (0.347 seconds)
      $ bin/beeline -u jdbc:hive2://localhost:10000 -e "show tables"
      ...
      +------------+--------------+--+
      | tableName  | isTemporary  |
      +------------+--------------+--+
      +------------+--------------+--+
      No rows selected (0.098 seconds)
      ```
      
      ## How was this patch tested?
      
      Manual.
      
      Note: I tried to add a test case for this, but I cannot found a suitable testsuite for this. I'll add the testcase if some advice is given.
      
      Author: Dongjoon Hyun <dongjoon@apache.org>
      
      Closes #15399 from dongjoon-hyun/SPARK-17819.
      59e3eb5a
  13. Oct 07, 2016
    • Sean Owen's avatar
      [SPARK-17707][WEBUI] Web UI prevents spark-submit application to be finished · cff56075
      Sean Owen authored
      ## What changes were proposed in this pull request?
      
      This expands calls to Jetty's simple `ServerConnector` constructor to explicitly specify a `ScheduledExecutorScheduler` that makes daemon threads. It should otherwise result in exactly the same configuration, because the other args are copied from the constructor that is currently called.
      
      (I'm not sure we should change the Hive Thriftserver impl, but I did anyway.)
      
      This also adds `sc.stop()` to the quick start guide example.
      
      ## How was this patch tested?
      
      Existing tests; _pending_ at least manual verification of the fix.
      
      Author: Sean Owen <sowen@cloudera.com>
      
      Closes #15381 from srowen/SPARK-17707.
      cff56075
  14. Oct 03, 2016
    • Dongjoon Hyun's avatar
      [SPARK-17112][SQL] "select null" via JDBC triggers IllegalArgumentException in Thriftserver · c571cfb2
      Dongjoon Hyun authored
      ## What changes were proposed in this pull request?
      
      Currently, Spark Thrift Server raises `IllegalArgumentException` for queries whose column types are `NullType`, e.g., `SELECT null` or `SELECT if(true,null,null)`. This PR fixes that by returning `void` like Hive 1.2.
      
      **Before**
      ```sql
      $ bin/beeline -u jdbc:hive2://localhost:10000 -e "select null"
      Connecting to jdbc:hive2://localhost:10000
      Connected to: Spark SQL (version 2.1.0-SNAPSHOT)
      Driver: Hive JDBC (version 1.2.1.spark2)
      Transaction isolation: TRANSACTION_REPEATABLE_READ
      Error: java.lang.IllegalArgumentException: Unrecognized type name: null (state=,code=0)
      Closing: 0: jdbc:hive2://localhost:10000
      
      $ bin/beeline -u jdbc:hive2://localhost:10000 -e "select if(true,null,null)"
      Connecting to jdbc:hive2://localhost:10000
      Connected to: Spark SQL (version 2.1.0-SNAPSHOT)
      Driver: Hive JDBC (version 1.2.1.spark2)
      Transaction isolation: TRANSACTION_REPEATABLE_READ
      Error: java.lang.IllegalArgumentException: Unrecognized type name: null (state=,code=0)
      Closing: 0: jdbc:hive2://localhost:10000
      ```
      
      **After**
      ```sql
      $ bin/beeline -u jdbc:hive2://localhost:10000 -e "select null"
      Connecting to jdbc:hive2://localhost:10000
      Connected to: Spark SQL (version 2.1.0-SNAPSHOT)
      Driver: Hive JDBC (version 1.2.1.spark2)
      Transaction isolation: TRANSACTION_REPEATABLE_READ
      +-------+--+
      | NULL  |
      +-------+--+
      | NULL  |
      +-------+--+
      1 row selected (3.242 seconds)
      Beeline version 1.2.1.spark2 by Apache Hive
      Closing: 0: jdbc:hive2://localhost:10000
      
      $ bin/beeline -u jdbc:hive2://localhost:10000 -e "select if(true,null,null)"
      Connecting to jdbc:hive2://localhost:10000
      Connected to: Spark SQL (version 2.1.0-SNAPSHOT)
      Driver: Hive JDBC (version 1.2.1.spark2)
      Transaction isolation: TRANSACTION_REPEATABLE_READ
      +-------------------------+--+
      | (IF(true, NULL, NULL))  |
      +-------------------------+--+
      | NULL                    |
      +-------------------------+--+
      1 row selected (0.201 seconds)
      Beeline version 1.2.1.spark2 by Apache Hive
      Closing: 0: jdbc:hive2://localhost:10000
      ```
      
      ## How was this patch tested?
      
      * Pass the Jenkins test with a new testsuite.
      * Also, Manually, after starting Spark Thrift Server, run the following command.
      ```sql
      $ bin/beeline -u jdbc:hive2://localhost:10000 -e "select null"
      $ bin/beeline -u jdbc:hive2://localhost:10000 -e "select if(true,null,null)"
      ```
      
      **Hive 1.2**
      ```sql
      hive> create table null_table as select null;
      hive> desc null_table;
      OK
      _c0                     void
      ```
      
      Author: Dongjoon Hyun <dongjoon@apache.org>
      
      Closes #15325 from dongjoon-hyun/SPARK-17112.
      c571cfb2
  15. Aug 24, 2016
    • gatorsmile's avatar
      [SPARK-17190][SQL] Removal of HiveSharedState · 4d0706d6
      gatorsmile authored
      ### What changes were proposed in this pull request?
      Since `HiveClient` is used to interact with the Hive metastore, it should be hidden in `HiveExternalCatalog`. After moving `HiveClient` into `HiveExternalCatalog`, `HiveSharedState` becomes a wrapper of `HiveExternalCatalog`. Thus, removal of `HiveSharedState` becomes straightforward. After removal of `HiveSharedState`, the reflection logic is directly applied on the choice of `ExternalCatalog` types, based on the configuration of `CATALOG_IMPLEMENTATION`.
      
      ~~`HiveClient` is also used/invoked by the other entities besides HiveExternalCatalog, we defines the following two APIs: getClient and getNewClient~~
      
      ### How was this patch tested?
      The existing test cases
      
      Author: gatorsmile <gatorsmile@gmail.com>
      
      Closes #14757 from gatorsmile/removeHiveClient.
      4d0706d6
  16. Aug 11, 2016
  17. Aug 08, 2016
    • Alice's avatar
      [SPARK-16563][SQL] fix spark sql thrift server FetchResults bug · e17a76ef
      Alice authored
      ## What changes were proposed in this pull request?
      
      Add a constant iterator which point to head of result. The header will be used to reset iterator when fetch result from first row repeatedly.
      JIRA ticket https://issues.apache.org/jira/browse/SPARK-16563
      
      ## How was this patch tested?
      
      This bug was found when using Cloudera HUE connecting to spark sql thrift server, currently SQL statement result can be only fetched for once. The fix was tested manually with Cloudera HUE, With this fix, HUE can fetch spark SQL results repeatedly through thrift server.
      
      Author: Alice <alice.gugu@gmail.com>
      Author: Alice <guhq@garena.com>
      
      Closes #14218 from alicegugu/SparkSQLFetchResultsBug.
      e17a76ef
  18. Jul 19, 2016
  19. Jul 11, 2016
    • Reynold Xin's avatar
      [SPARK-16477] Bump master version to 2.1.0-SNAPSHOT · ffcb6e05
      Reynold Xin authored
      ## What changes were proposed in this pull request?
      After SPARK-16476 (committed earlier today as #14128), we can finally bump the version number.
      
      ## How was this patch tested?
      N/A
      
      Author: Reynold Xin <rxin@databricks.com>
      
      Closes #14130 from rxin/SPARK-16477.
      ffcb6e05
  20. Jul 05, 2016
    • Cheng Hao's avatar
      [SPARK-15730][SQL] Respect the --hiveconf in the spark-sql command line · 920cb5fe
      Cheng Hao authored
      ## What changes were proposed in this pull request?
      This PR makes spark-sql (backed by SparkSQLCLIDriver) respects confs set by hiveconf, which is what we do in previous versions. The change is that when we start SparkSQLCLIDriver, we explicitly set confs set through --hiveconf to SQLContext's conf (basically treating those confs as a SparkSQL conf).
      
      ## How was this patch tested?
      A new test in CliSuite.
      
      Closes #13542
      
      Author: Cheng Hao <hao.cheng@intel.com>
      Author: Yin Huai <yhuai@databricks.com>
      
      Closes #14058 from yhuai/hiveConfThriftServer.
      920cb5fe
  21. Jun 24, 2016
  22. Jun 15, 2016
  23. Jun 14, 2016
    • Jeff Zhang's avatar
      doc fix of HiveThriftServer · 53bb0308
      Jeff Zhang authored
      ## What changes were proposed in this pull request?
      
      Just minor doc fix.
      
      \cc yhuai
      
      Author: Jeff Zhang <zjffdu@apache.org>
      
      Closes #13659 from zjffdu/doc_fix.
      53bb0308
  24. May 27, 2016
  25. May 26, 2016
    • Reynold Xin's avatar
      [SPARK-15552][SQL] Remove unnecessary private[sql] methods in SparkSession · 0f61d6ef
      Reynold Xin authored
      ## What changes were proposed in this pull request?
      SparkSession has a list of unnecessary private[sql] methods. These methods cause some trouble because private[sql] doesn't apply in Java. In the cases that they are easy to remove, we can simply remove them. This patch does that.
      
      As part of this pull request, I also replaced a bunch of protected[sql] with private[sql], to tighten up visibility.
      
      ## How was this patch tested?
      Updated test cases to reflect the changes.
      
      Author: Reynold Xin <rxin@databricks.com>
      
      Closes #13319 from rxin/SPARK-15552.
      0f61d6ef
  26. May 25, 2016
    • lfzCarlosC's avatar
      [MINOR][MLLIB][STREAMING][SQL] Fix typos · 02c8072e
      lfzCarlosC authored
      fixed typos for source code for components [mllib] [streaming] and [SQL]
      
      None and obvious.
      
      Author: lfzCarlosC <lfz.carlos@gmail.com>
      
      Closes #13298 from lfzCarlosC/master.
      02c8072e
  27. May 23, 2016
Loading