Skip to content
Snippets Groups Projects
  1. Jan 04, 2016
    • Josh Rosen's avatar
      [SPARK-10359][PROJECT-INFRA] Use more random number in... · 9fd7a2f0
      Josh Rosen authored
      [SPARK-10359][PROJECT-INFRA] Use more random number in dev/test-dependencies.sh; fix version switching
      
      This patch aims to fix another potential source of flakiness in the `dev/test-dependencies.sh` script.
      
      pwendell's original patch and my version used `$(date +%s | tail -c6)` to generate a suffix to use when installing temporary Spark versions into the local Maven cache, but this value only changes once per second and thus is highly collision-prone when concurrent builds launch on AMPLab Jenkins. In order to reduce the potential for conflicts, this patch updates the script to call Python's random number generator instead.
      
      I also fixed a bug in how we captured the original project version; the bug was causing the exit handler code to fail.
      
      Author: Josh Rosen <joshrosen@databricks.com>
      
      Closes #10558 from JoshRosen/build-dep-tests-round-3.
      9fd7a2f0
    • Josh Rosen's avatar
      [SPARK-12612][PROJECT-INFRA] Add missing Hadoop profiles to dev/run-tests-*.py scripts and dev/deps · 0d165ec2
      Josh Rosen authored
      There are a couple of places in the `dev/run-tests-*.py` scripts which deal with Hadoop profiles, but the set of profiles that they handle does not include all Hadoop profiles defined in our POM. Similarly, the `hadoop-2.2` and `hadoop-2.6` profiles were missing from `dev/deps`.
      
      This patch updates these scripts to include all four Hadoop profiles defined in our POM.
      
      Author: Josh Rosen <joshrosen@databricks.com>
      
      Closes #10565 from JoshRosen/add-missing-hadoop-profiles-in-test-scripts.
      0d165ec2
  2. Jan 03, 2016
  3. Jan 02, 2016
    • Sean Owen's avatar
      [SPARK-12481][CORE][STREAMING][SQL] Remove usage of Hadoop deprecated APIs and... · 15bd7362
      Sean Owen authored
      [SPARK-12481][CORE][STREAMING][SQL] Remove usage of Hadoop deprecated APIs and reflection that supported 1.x
      
      Remove use of deprecated Hadoop APIs now that 2.2+ is required
      
      Author: Sean Owen <sowen@cloudera.com>
      
      Closes #10446 from srowen/SPARK-12481.
      15bd7362
    • hyukjinkwon's avatar
      [SPARK-10180][SQL] JDBC datasource are not processing EqualNullSafe filter · 94f7a12b
      hyukjinkwon authored
      This PR is followed by https://github.com/apache/spark/pull/8391.
      Previous PR fixes JDBCRDD to support null-safe equality comparison for JDBC datasource. This PR fixes the problem that it can actually return null as a result of the comparison resulting error as using the value of that comparison.
      
      Author: hyukjinkwon <gurwls223@gmail.com>
      Author: HyukjinKwon <gurwls223@gmail.com>
      
      Closes #8743 from HyukjinKwon/SPARK-10180.
      94f7a12b
    • Herman van Hovell's avatar
      [SPARK-12362][SQL][WIP] Inline Hive Parser · 970635a9
      Herman van Hovell authored
      This PR inlines the Hive SQL parser in Spark SQL.
      
      The previous (merged) incarnation of this PR passed all tests, but had and still has problems with the build. These problems are caused by a the fact that - for some reason - in some cases the ANTLR generated code is not included in the compilation fase.
      
      This PR is a WIP and should not be merged until we have sorted out the build issues.
      
      Author: Herman van Hovell <hvanhovell@questtec.nl>
      Author: Nong Li <nong@databricks.com>
      Author: Nong Li <nongli@gmail.com>
      
      Closes #10525 from hvanhovell/SPARK-12362.
      970635a9
  4. Jan 01, 2016
  5. Dec 31, 2015
    • Josh Rosen's avatar
      [SPARK-10359][PROJECT-INFRA] Multiple fixes to dev/test-dependencies.sh script · 5adec63a
      Josh Rosen authored
      This patch includes multiple fixes for the `dev/test-dependencies.sh` script (which was introduced in #10461):
      
      - Use `build/mvn --force` instead of `mvn` in one additional place.
      - Explicitly set a zero exit code on success.
      - Set `LC_ALL=C` to make `sort` results agree across machines (see https://stackoverflow.com/questions/28881/).
      - Set `should_run_build_tests=True` for `build` module (this somehow got lost).
      
      Author: Josh Rosen <joshrosen@databricks.com>
      
      Closes #10543 from JoshRosen/dep-script-fixes.
      5adec63a
    • Marcelo Vanzin's avatar
      [SPARK-3873][STREAMING] Import order fixes for streaming. · efb10cc9
      Marcelo Vanzin authored
      Also included a few miscelaneous other modules that had very few violations.
      
      Author: Marcelo Vanzin <vanzin@cloudera.com>
      
      Closes #10532 from vanzin/SPARK-3873-streaming.
      efb10cc9
    • Yin Huai's avatar
      [SPARK-12039][SQL] Re-enable HiveSparkSubmitSuite's SPARK-9757 Persist Parquet... · 5cdecb18
      Yin Huai authored
      [SPARK-12039][SQL] Re-enable HiveSparkSubmitSuite's SPARK-9757 Persist Parquet relation with decimal column
      
      https://issues.apache.org/jira/browse/SPARK-12039
      
      since we do not support hadoop1, we can re-enable this test in master.
      
      Author: Yin Huai <yhuai@databricks.com>
      
      Closes #10533 from yhuai/SPARK-12039-enable.
      5cdecb18
    • Shixiong Zhu's avatar
      [SPARK-7995][SPARK-6280][CORE] Remove AkkaRpcEnv and remove systemName from setupEndpointRef · 4f5a24d7
      Shixiong Zhu authored
      ### Remove AkkaRpcEnv
      
      Keep `SparkEnv.actorSystem` because Streaming still uses it. Will remove it and AkkaUtils after refactoring Streaming actorStream API.
      
      ### Remove systemName
      There are 2 places using `systemName`:
      * `RpcEnvConfig.name`. Actually, although it's used as `systemName` in `AkkaRpcEnv`, `NettyRpcEnv` uses it as the service name to output the log `Successfully started service *** on port ***`. Since the service name in log is useful, I keep `RpcEnvConfig.name`.
      * `def setupEndpointRef(systemName: String, address: RpcAddress, endpointName: String)`. Each `ActorSystem` has a `systemName`. Akka requires `systemName` in its URI and will refuse a connection if `systemName` is not matched. However, `NettyRpcEnv` doesn't use it. So we can remove `systemName` from `setupEndpointRef` since we are removing `AkkaRpcEnv`.
      
      ### Remove RpcEnv.uriOf
      
      `uriOf` exists because Akka uses different URI formats for with and without authentication, e.g., `akka.ssl.tcp...` and `akka.tcp://...`. But `NettyRpcEnv` uses the same format. So it's not necessary after removing `AkkaRpcEnv`.
      
      Author: Shixiong Zhu <shixiong@databricks.com>
      
      Closes #10459 from zsxwing/remove-akka-rpc-env.
      4f5a24d7
    • Davies Liu's avatar
      [SPARK-12585] [SQL] move numFields to constructor of UnsafeRow · e6c77874
      Davies Liu authored
      Right now, numFields will be passed in by pointTo(), then bitSetWidthInBytes is calculated, making pointTo() a little bit heavy.
      
      It should be part of constructor of UnsafeRow.
      
      Author: Davies Liu <davies@databricks.com>
      
      Closes #10528 from davies/numFields.
      e6c77874
  6. Dec 30, 2015
Loading