- Mar 17, 2012
-
-
Matei Zaharia authored
-
- May 24, 2011
-
- Apr 27, 2011
-
-
Mosharaf Chowdhury authored
-
- Jan 14, 2011
-
-
Mosharaf Chowdhury authored
After the last commit the bottleneck shifted from "requesting to tracker for mappers" (now done in batches) to "notifying tracker when threads leave" (done individually)
-
Mosharaf Chowdhury authored
Turned OFF timers in the reducers due to inconsistent behavior (sometimes they fire, sometimes they don't)
-
- Jan 13, 2011
-
-
Mosharaf Chowdhury authored
Implemented a tracker strategy that allows reducers to create concurrent connections proportional to their time remaining. Which connections to create is random though.
-
Mosharaf Chowdhury authored
Added a tracker strategy that selects random mappers for reducers. This can be used to measure tracker overhead.
-
- Jan 10, 2011
-
-
Mosharaf Chowdhury authored
-
- Jan 09, 2011
-
-
Mosharaf Chowdhury authored
-
- Jan 03, 2011
-
-
Mosharaf Chowdhury authored
- Implemented a new tracker strategy for shuffle where if a reducer is too fast its stalled until other catchup. Basic version is working, but more work is necessary.
-
- Dec 30, 2010
-
-
Mosharaf Chowdhury authored
-
Mosharaf Chowdhury authored
CustomBlockedInMemoryShuffle is receiving multiple blocks after connecting to a mapper instead of just one.
-
Mosharaf Chowdhury authored
Consumption is delayed until everything has been received. Otherwise it interferes with network performance.
-
Mosharaf Chowdhury authored
CustomBlockedLocalFileShuffle: reducers are reading multiple blocks per connections instead of just one. Sometimes ShuffleServer fails to start for small shuffle data with small block size in local VM. No problem with large block size.
-
- Dec 28, 2010
-
-
Mosharaf Chowdhury authored
Updating reception stats before consuming. Can create trouble if there is any exception during consumption (less likely,) but this frees up splits that threads can connect to instead of idling around.
-
Mosharaf Chowdhury authored
- Fixed several bugs. (Copy-paste is the bane of coding :|)
-
- Dec 27, 2010
-
-
Mosharaf Chowdhury authored
Bug fix: tracker (running in Spark master) wasn't initializing Shuffle object and was using inconsistent config values.
-
- Dec 26, 2010
-
-
Mosharaf Chowdhury authored
-
Mosharaf Chowdhury authored
Tracker framework is in place that supports pluggable tracker strategy. There are several bugs along with performance problems. - For larger data shuffle ShuffleServerThread gets "Broken Pipe" and ShuffleClient gets "Connection Reset" - There is a bug in the accounting counters of BalanceConnectionsShuffleTrackerStrategy. Some of them go below zero while decrementing which is not supposed to happen.
-
- Dec 24, 2010
-
-
Mosharaf Chowdhury authored
-
- Dec 22, 2010
-
-
Mosharaf Chowdhury authored
-
Mosharaf Chowdhury authored
CustomBlockedLocalFileShuffle has been added. This is essentially ManualBlockedLocalFileShuffle with our servers.
-
Mosharaf Chowdhury authored
There will be a new CustomBlockedLocalFileShuffle where 'Custom' will mean ManualBlockedLocalFileShuffle with custom server instead of jetty.
-
Mosharaf Chowdhury authored
Added an in-memory implementation of CustomParalleLFS. There is a serialization/deserialization bug in the implementation.
-
Mosharaf Chowdhury authored
Fixed an indexing bug in HttpBlockedLocalFileShuffle. It still doesn't work on EC2 with >5 nodes cluster.
-
Mosharaf Chowdhury authored
-
Mosharaf Chowdhury authored
-
Mosharaf Chowdhury authored
-
Mosharaf Chowdhury authored
HttpBlockedLocalFileShuffle has also been converted to have per-reducer consumption thread. Works in local mesos, but NOT on EC2 :|
-
- Dec 21, 2010
-
-
Mosharaf Chowdhury authored
Upside: No synchronized blocking on "combiners" variable. 3x faster :) Downside: Inefficient implementation. Requiring too much temporary data. Approx. 2x increase in memory requirement :( Should be fixed at some point.
-
Mosharaf Chowdhury authored
-
- Dec 19, 2010
-
-
Mosharaf Chowdhury authored
-
Mosharaf Chowdhury authored
-
Mosharaf Chowdhury authored
Renamed BlockedLocalFileShuffle to HttpBlockedLocalFileShuffle for merging with the mos-shuffle branch.
-
- Dec 16, 2010
-
-
Mosharaf Chowdhury authored
-
Mosharaf Chowdhury authored
-
Mosharaf Chowdhury authored
- Config option related to pipelining has been removed. - Summary: Basic -> Pipelining / Parallel -> NO pipelining
-
- Dec 15, 2010
-
-
Mosharaf Chowdhury authored
- Renamed some ParallelLocalFileShuffle config options for clarity.
-
- Dec 07, 2010
-
-
Mosharaf Chowdhury authored
- Changed DfsShuffle to default in RDD.scala.
-
- Dec 04, 2010
-
-
Mosharaf Chowdhury authored
-