-
- Downloads
[SPARK-1592][streaming] Automatically remove streaming input blocks
The raw input data is stored as blocks in BlockManagers. Earlier they were cleared by cleaner ttl. Now since streaming does not require cleaner TTL to be set, the block would not get cleared. This increases up the Spark's memory usage, which is not even accounted and shown in the Spark storage UI. It may cause the data blocks to spill over to disk, which eventually slows down the receiving of data (persisting to memory become bottlenecked by writing to disk). The solution in this PR is to automatically remove those blocks. The mechanism to keep track of which BlockRDDs (which has presents the raw data blocks as a RDD) can be safely cleared already exists. Just use it to explicitly remove blocks from BlockRDDs. Author: Tathagata Das <tathagata.das1565@gmail.com> Closes #512 from tdas/block-rdd-unpersist and squashes the following commits: d25e610 [Tathagata Das] Merge remote-tracking branch 'apache/master' into block-rdd-unpersist 5f46d69 [Tathagata Das] Merge remote-tracking branch 'apache/master' into block-rdd-unpersist 2c320cd [Tathagata Das] Updated configuration with spark.streaming.unpersist setting. 2d4b2fd [Tathagata Das] Automatically removed input blocks
Showing
- core/src/main/scala/org/apache/spark/rdd/BlockRDD.scala 40 additions, 5 deletionscore/src/main/scala/org/apache/spark/rdd/BlockRDD.scala
- docs/configuration.md 5 additions, 2 deletionsdocs/configuration.md
- streaming/src/main/scala/org/apache/spark/streaming/Time.scala 1 addition, 1 deletion...ming/src/main/scala/org/apache/spark/streaming/Time.scala
- streaming/src/main/scala/org/apache/spark/streaming/dstream/DStream.scala 13 additions, 3 deletions...in/scala/org/apache/spark/streaming/dstream/DStream.scala
- streaming/src/test/scala/org/apache/spark/streaming/BasicOperationsSuite.scala 75 additions, 1 deletion...ala/org/apache/spark/streaming/BasicOperationsSuite.scala
- streaming/src/test/scala/org/apache/spark/streaming/InputStreamsSuite.scala 0 additions, 13 deletions.../scala/org/apache/spark/streaming/InputStreamsSuite.scala
- streaming/src/test/scala/org/apache/spark/streaming/NetworkReceiverSuite.scala 1 addition, 0 deletions...ala/org/apache/spark/streaming/NetworkReceiverSuite.scala
Loading
Please register or sign in to comment