- Jun 09, 2012
-
-
Matei Zaharia authored
RDD partitioning in more cases where it's possible, and use iterators instead of materializing collections when doing joins.
-
- Jun 06, 2012
-
-
Matei Zaharia authored
cluster instead of on the master. Fixes #110.
-
Matei Zaharia authored
SizeEstimator. Also added a test for SizeEstimator. Fixes #136.
-
Matei Zaharia authored
-
Matei Zaharia authored
-
- May 30, 2012
-
-
Reynold Xin authored
-
Reynold Xin authored
of a weird file dediated to accessing this variable).
-
- May 26, 2012
-
-
Matei Zaharia authored
Little refactoring and unit tests for CacheTrackerActor
-
Matei Zaharia authored
BoundedMemoryCache.put should fail when estimated size of 'value' is larger than cache capacity
-
- May 22, 2012
-
-
Richard Benkovsky authored
-
Richard Benkovsky authored
-
Richard Benkovsky authored
-
Richard Benkovsky authored
-
Richard Benkovsky authored
-
- May 20, 2012
-
-
Richard Benkovsky authored
-
- May 19, 2012
-
-
Richard Benkovsky authored
-
- May 18, 2012
-
-
Matei Zaharia authored
Return size estimation, cache usage, and cache capacity from slave nodes to CacheTracker
-
- May 16, 2012
-
-
Reynold Xin authored
to be serialized in closure serialization. This can significantly reduce the task setup time in Shark when the query involves a large number of (Hive) partitions.
-
- May 15, 2012
-
-
Reynold Xin authored
it was pretty ugly that put() should return -1 for failures.
-
- May 14, 2012
-
-
Reynold Xin authored
trackor. This is essential for building a dashboard to see the status of caches on all slaves.
-
- May 06, 2012
-
-
Matei Zaharia authored
from the same dataset.
-
- Apr 24, 2012
-
-
Matei Zaharia authored
Force serialize/deserialize task results in local execution mode.
-
Matei Zaharia authored
End task instead of just exiting in LocalScheduler for tasks that throw exceptions
-
Reynold Xin authored
would throw an exception for local tasks).
-
Reynold Xin authored
-
- Apr 23, 2012
-
-
Antonio authored
-
- Apr 20, 2012
-
-
Antonio authored
-
Matei Zaharia authored
Added the ability to set environmental variables in piped rdd.
-
- Apr 17, 2012
-
-
Reynold Xin authored
-
- Apr 12, 2012
-
-
Matei Zaharia authored
-
- Apr 10, 2012
-
-
Matei Zaharia authored
Added an option (spark.closure.serializer) to specify the serializer for closures.
-
Reynold Xin authored
tasks.
-
- Apr 09, 2012
-
-
Reynold Xin authored
closures. This enables using Kryo as the closure serializer.
-
Matei Zaharia authored
-
Matei Zaharia authored
-
- Apr 06, 2012
-
-
Matei Zaharia authored
Report entry dropping in BoundedMemoryCache
-
Ankur Dave authored
-
- Apr 05, 2012
-
-
Matei Zaharia authored
-
Matei Zaharia authored
-
- Mar 30, 2012
-
-
Matei Zaharia authored
BoundedMemoryCache, in case the JVM was not started with its initial heap size equaling its maximum one (-Xms == -Xmx).
-