Skip to content
Snippets Groups Projects
  1. Feb 16, 2016
  2. Feb 13, 2016
  3. Feb 11, 2016
    • Tathagata Das's avatar
      [STREAMING][TEST] Fix flaky streaming.FailureSuite · 219a74a7
      Tathagata Das authored
      Under some corner cases, the test suite failed to shutdown the SparkContext causing cascaded failures. This fix does two things
      - Makes sure no SparkContext is active after every test
      - Makes sure StreamingContext is always shutdown (prevents leaking of StreamingContexts as well, just in case)
      
      Author: Tathagata Das <tathagata.das1565@gmail.com>
      
      Closes #11166 from tdas/fix-failuresuite.
      219a74a7
  4. Feb 09, 2016
    • Sean Owen's avatar
      [SPARK-13170][STREAMING] Investigate replacing SynchronizedQueue as it is deprecated · 68ed3632
      Sean Owen authored
      Replace SynchronizeQueue with synchronized access to a Queue
      
      Author: Sean Owen <sowen@cloudera.com>
      
      Closes #11111 from srowen/SPARK-13170.
      68ed3632
    • Holden Karau's avatar
      [SPARK-13165][STREAMING] Replace deprecated synchronizedBuffer in streaming · 159198ef
      Holden Karau authored
      Building with Scala 2.11 results in the warning trait SynchronizedBuffer in package mutable is deprecated: Synchronization via traits is deprecated as it is inherently unreliable. Consider java.util.concurrent.ConcurrentLinkedQueue as an alternative - we already use ConcurrentLinkedQueue elsewhere so lets replace it.
      
      Some notes about how behaviour is different for reviewers:
      The Seq from a SynchronizedBuffer that was implicitly converted would continue to receive updates - however when we do the same conversion explicitly on the ConcurrentLinkedQueue this isn't the case. Hence changing some of the (internal & test) APIs to pass an Iterable. toSeq is safe to use if there are no more updates.
      
      Author: Holden Karau <holden@us.ibm.com>
      Author: tedyu <yuzhihong@gmail.com>
      
      Closes #11067 from holdenk/SPARK-13165-replace-deprecated-synchronizedBuffer-in-streaming.
      159198ef
  5. Feb 04, 2016
  6. Feb 03, 2016
  7. Feb 02, 2016
  8. Feb 01, 2016
    • Shixiong Zhu's avatar
      [SPARK-6847][CORE][STREAMING] Fix stack overflow issue when updateStateByKey... · 6075573a
      Shixiong Zhu authored
      [SPARK-6847][CORE][STREAMING] Fix stack overflow issue when updateStateByKey is followed by a checkpointed dstream
      
      Add a local property to indicate if checkpointing all RDDs that are marked with the checkpoint flag, and enable it in Streaming
      
      Author: Shixiong Zhu <shixiong@databricks.com>
      
      Closes #10934 from zsxwing/recursive-checkpoint.
      6075573a
  9. Jan 30, 2016
    • Josh Rosen's avatar
      [SPARK-6363][BUILD] Make Scala 2.11 the default Scala version · 289373b2
      Josh Rosen authored
      This patch changes Spark's build to make Scala 2.11 the default Scala version. To be clear, this does not mean that Spark will stop supporting Scala 2.10: users will still be able to compile Spark for Scala 2.10 by following the instructions on the "Building Spark" page; however, it does mean that Scala 2.11 will be the default Scala version used by our CI builds (including pull request builds).
      
      The Scala 2.11 compiler is faster than 2.10, so I think we'll be able to look forward to a slight speedup in our CI builds (it looks like it's about 2X faster for the Maven compile-only builds, for instance).
      
      After this patch is merged, I'll update Jenkins to add new compile-only jobs to ensure that Scala 2.10 compilation doesn't break.
      
      Author: Josh Rosen <joshrosen@databricks.com>
      
      Closes #10608 from JoshRosen/SPARK-6363.
      289373b2
  10. Jan 26, 2016
    • Sean Owen's avatar
      [SPARK-3369][CORE][STREAMING] Java mapPartitions Iterator->Iterable is... · 649e9d0f
      Sean Owen authored
      [SPARK-3369][CORE][STREAMING] Java mapPartitions Iterator->Iterable is inconsistent with Scala's Iterator->Iterator
      
      Fix Java function API methods for flatMap and mapPartitions to require producing only an Iterator, not Iterable. Also fix DStream.flatMap to require a function producing TraversableOnce only, not Traversable.
      
      CC rxin pwendell for API change; tdas since it also touches streaming.
      
      Author: Sean Owen <sowen@cloudera.com>
      
      Closes #10413 from srowen/SPARK-3369.
      649e9d0f
  11. Jan 23, 2016
  12. Jan 22, 2016
    • Shixiong Zhu's avatar
      [SPARK-7997][CORE] Remove Akka from Spark Core and Streaming · bc1babd6
      Shixiong Zhu authored
      - Remove Akka dependency from core. Note: the streaming-akka project still uses Akka.
      - Remove HttpFileServer
      - Remove Akka configs from SparkConf and SSLOptions
      - Rename `spark.akka.frameSize` to `spark.rpc.message.maxSize`. I think it's still worth to keep this config because using `DirectTaskResult` or `IndirectTaskResult`  depends on it.
      - Update comments and docs
      
      Author: Shixiong Zhu <shixiong@databricks.com>
      
      Closes #10854 from zsxwing/remove-akka.
      bc1babd6
  13. Jan 20, 2016
    • Shixiong Zhu's avatar
      [SPARK-7799][SPARK-12786][STREAMING] Add "streaming-akka" project · b7d74a60
      Shixiong Zhu authored
      Include the following changes:
      
      1. Add "streaming-akka" project and org.apache.spark.streaming.akka.AkkaUtils for creating an actorStream
      2. Remove "StreamingContext.actorStream" and "JavaStreamingContext.actorStream"
      3. Update the ActorWordCount example and add the JavaActorWordCount example
      4. Make "streaming-zeromq" depend on "streaming-akka" and update the codes accordingly
      
      Author: Shixiong Zhu <shixiong@databricks.com>
      
      Closes #10744 from zsxwing/streaming-akka-2.
      b7d74a60
    • Shixiong Zhu's avatar
      [SPARK-12847][CORE][STREAMING] Remove StreamingListenerBus and post all... · 944fdadf
      Shixiong Zhu authored
      [SPARK-12847][CORE][STREAMING] Remove StreamingListenerBus and post all Streaming events to the same thread as Spark events
      
      Including the following changes:
      
      1. Add StreamingListenerForwardingBus to WrappedStreamingListenerEvent process events in `onOtherEvent` to StreamingListener
      2. Remove StreamingListenerBus
      3. Merge AsynchronousListenerBus and LiveListenerBus to the same class LiveListenerBus
      4. Add `logEvent` method to SparkListenerEvent so that EventLoggingListener can use it to ignore WrappedStreamingListenerEvents
      
      Author: Shixiong Zhu <shixiong@databricks.com>
      
      Closes #10779 from zsxwing/streaming-listener.
      944fdadf
  14. Jan 18, 2016
    • Josh Rosen's avatar
      [SPARK-10985][CORE] Avoid passing evicted blocks throughout BlockManager · b8cb548a
      Josh Rosen authored
      This patch refactors portions of the BlockManager and CacheManager in order to avoid having to pass `evictedBlocks` lists throughout the code. It appears that these lists were only consumed by `TaskContext.taskMetrics`, so the new code now directly updates the metrics from the lower-level BlockManager methods.
      
      Author: Josh Rosen <joshrosen@databricks.com>
      
      Closes #10776 from JoshRosen/SPARK-10985.
      b8cb548a
  15. Jan 12, 2016
  16. Jan 11, 2016
  17. Jan 10, 2016
    • Marcelo Vanzin's avatar
      [SPARK-3873][BUILD] Enable import ordering error checking. · 6439a825
      Marcelo Vanzin authored
      Turn import ordering violations into build errors, plus a few adjustments
      to account for how the checker behaves. I'm a little on the fence about
      whether the existing code is right, but it's easier to appease the checker
      than to discuss what's the more correct order here.
      
      Plus a few fixes to imports that cropped in since my recent cleanups.
      
      Author: Marcelo Vanzin <vanzin@cloudera.com>
      
      Closes #10612 from vanzin/SPARK-3873-enable.
      6439a825
  18. Jan 08, 2016
  19. Jan 07, 2016
  20. Jan 06, 2016
    • Josh Rosen's avatar
      [SPARK-7689] Remove TTL-based metadata cleaning in Spark 2.0 · 8e19c766
      Josh Rosen authored
      This PR removes `spark.cleaner.ttl` and the associated TTL-based metadata cleaning code.
      
      Now that we have the `ContextCleaner` and a timer to trigger periodic GCs, I don't think that `spark.cleaner.ttl` is necessary anymore. The TTL-based cleaning isn't enabled by default, isn't included in our end-to-end tests, and has been a source of user confusion when it is misconfigured. If the TTL is set too low, data which is still being used may be evicted / deleted, leading to hard to diagnose bugs.
      
      For all of these reasons, I think that we should remove this functionality in Spark 2.0. Additional benefits of doing this include marginally reduced memory usage, since we no longer need to store timetsamps in hashmaps, and a handful fewer threads.
      
      Author: Josh Rosen <joshrosen@databricks.com>
      
      Closes #10534 from JoshRosen/remove-ttl-based-cleaning.
      8e19c766
    • Sean Owen's avatar
      [SPARK-12604][CORE] Java count(AprroxDistinct)ByKey methods return Scala Long not Java · ac56cf60
      Sean Owen authored
      Change Java countByKey, countApproxDistinctByKey return types to use Java Long, not Scala; update similar methods for consistency on java.long.Long.valueOf with no API change
      
      Author: Sean Owen <sowen@cloudera.com>
      
      Closes #10554 from srowen/SPARK-12604.
      ac56cf60
    • Shixiong Zhu's avatar
      Revert "[SPARK-12672][STREAMING][UI] Use the uiRoot function instead of... · cbaea959
      Shixiong Zhu authored
      Revert "[SPARK-12672][STREAMING][UI] Use the uiRoot function instead of default root path to gain the streaming batch url."
      
      This reverts commit 19e4e9fe. Will merge #10618 instead.
      cbaea959
    • huangzhaowei's avatar
      [SPARK-12672][STREAMING][UI] Use the uiRoot function instead of default root... · 19e4e9fe
      huangzhaowei authored
      [SPARK-12672][STREAMING][UI] Use the uiRoot function instead of default root path to gain the streaming batch url.
      
      Author: huangzhaowei <carlmartinmax@gmail.com>
      
      Closes #10617 from SaintBacchus/SPARK-12672.
      19e4e9fe
  21. Jan 05, 2016
  22. Jan 04, 2016
    • Shixiong Zhu's avatar
      [SPARK-12608][STREAMING] Remove submitJobThreadPool since submitJob doesn't... · 43706bf8
      Shixiong Zhu authored
      [SPARK-12608][STREAMING] Remove submitJobThreadPool since submitJob doesn't create a separate thread to wait for the job result
      
      Before #9264, submitJob would create a separate thread to wait for the job result. `submitJobThreadPool` was a workaround in `ReceiverTracker` to run these waiting-job-result threads. Now #9264 has been merged to master and resolved this blocking issue, `submitJobThreadPool` can be removed now.
      
      Author: Shixiong Zhu <shixiong@databricks.com>
      
      Closes #10560 from zsxwing/remove-submitJobThreadPool.
      43706bf8
    • guoxu1231's avatar
      [SPARK-12513][STREAMING] SocketReceiver hang in Netcat example · 962aac4d
      guoxu1231 authored
      Explicitly close client side socket connection before restart socket receiver.
      
      Author: guoxu1231 <guoxu1231@gmail.com>
      Author: Shawn Guo <guoxu1231@gmail.com>
      
      Closes #10464 from guoxu1231/SPARK-12513.
      962aac4d
  23. Jan 02, 2016
  24. Dec 31, 2015
  25. Dec 24, 2015
    • Kazuaki Ishizaki's avatar
      [SPARK-12311][CORE] Restore previous value of "os.arch" property in test... · 39204661
      Kazuaki Ishizaki authored
      [SPARK-12311][CORE] Restore previous value of "os.arch" property in test suites after forcing to set specific value to "os.arch" property
      
      Restore the original value of os.arch property after each test
      
      Since some of tests forced to set the specific value to os.arch property, we need to set the original value.
      
      Author: Kazuaki Ishizaki <ishizaki@jp.ibm.com>
      
      Closes #10289 from kiszk/SPARK-12311.
      39204661
  26. Dec 22, 2015
  27. Dec 19, 2015
Loading