Commits · fd0374b9de2e32d55fb14c371a98f0f39c30a17a · cs525-sp18-g07 / spark

Sep 29, 2012
- Comment · fd0374b9
  Matei Zaharia authored 12 years ago
  
  fd0374b9
- Removed Logging trait from CoalescedRDD since we don't log anything · 5718cef2
  Matei Zaharia authored 12 years ago
  
  5718cef2
- Merge pull request #228 from rxin/dev · 4a74e863
  Matei Zaharia authored 12 years ago
  
  Added mapPartitionsWithSplit to the programming guide.
  4a74e863
- Added a CoalescedRDD class for reducing the number of partitions in an RDD. · 143ef4f9
  Matei Zaharia authored 12 years ago
  
  143ef4f9
- Comment · c45758dd
  Matei Zaharia authored 12 years ago
  
  c45758dd
- Merge branch 'dev' of github.com:mesos/spark into dev · ebd52347
  Matei Zaharia authored 12 years ago
  
  ebd52347
- Made BlockManager unmap memory-mapped files when necessary to reduce the · 9b326d01
  Matei Zaharia authored 12 years ago
  
  number of open files. Also optimized sending of disk-based blocks.
  9b326d01
- Added mapPartitionsWithSplit to the programming guide. · f5812d03
  Reynold Xin authored 12 years ago
  
  f5812d03
- Merge pull request #227 from JoshRosen/fix/distinct_numsplits · 2f11e3c2
  Matei Zaharia authored 12 years ago
  
  Allow controlling number of splits in distinct().
  2f11e3c2
- Use null as dummy value in distinct(). · 8654165e
  Josh Rosen authored 12 years ago
  
  8654165e
- Allow controlling number of splits in distinct(). · 37c199bb
  Josh Rosen authored 12 years ago
  
  37c199bb
- Don't create a Cache in SparkEnv because we don't use it · 56dcad59
  Matei Zaharia authored 12 years ago
  
  56dcad59
- Logging tweaks · 1d44644f
  Matei Zaharia authored 12 years ago
  
  1d44644f
Sep 28, 2012
- Renamed subdirs option · 815d6bd6
  Matei Zaharia authored 12 years ago
  
  815d6bd6
- Made subdirs per local dir configurable, and reduced lock usage a bit · e54e1d70
  Matei Zaharia authored 12 years ago
  
  e54e1d70
- Made disk store use multiple directories, deleted ShuffleManager · ae8c7d6c
  Matei Zaharia authored 12 years ago
  
  ae8c7d6c
- Print and track user call sites in more places in Spark · 3d726799
  Matei Zaharia authored 12 years ago
  
  3d726799
- Merge pull request #225 from pwendell/dev · 9f6efbf0
  Matei Zaharia authored 12 years ago
  
  Log message which records RDD origin
  9f6efbf0
- Changed the way tasks' dependency files are sent to workers so that · 0121a26b
  Matei Zaharia authored 12 years ago
  
  custom serializers or Kryo registrators can be loaded.
  0121a26b
- Fixing some whitespace issues · 9fc78f8f
  Patrick Wendell authored 12 years ago
  
  9fc78f8f
- Changes based on Matei's comments · bc909c29
  Patrick Wendell authored 12 years ago
  
  bc909c29
- Log message which records RDD origin · c387e40f
  Patrick Wendell authored 12 years ago
  
  This adds tracking to determine the "origin" of an RDD. Origin is defined by the boundary between the user's code and the spark code, during an RDD's instantiation. It is meant to help users understand where a Spark RDD is coming from in their code. This patch also logs origin data when stages are submitted to the scheduler. Finally, it adds a new log message to fix an inconsitency in the way that dependent stages (those missing parents) and independent stages (those without) are logged during submission.
  c387e40f
- Fixed a bug where isLocal was set to false when using local[K] · 2a8bfbca
  Matei Zaharia authored 12 years ago
  
  2a8bfbca
Sep 27, 2012
- Fix a bug in JAR fetcher that made it always fetch the JAR · 4a138403
  Matei Zaharia authored 12 years ago
  
  4a138403
- Added an option to compress blocks in the block store · 009b0e37
  Matei Zaharia authored 12 years ago
  
  009b0e37
- Renamed storage levels to something cleaner; fixes #223. · 7bcb08ce
  Matei Zaharia authored 12 years ago
  
  7bcb08ce
- Merge branch 'dev' of github.com:mesos/spark into dev · 0850d641
  Matei Zaharia authored 12 years ago
  
  0850d641
- Minor typos · bf18e099
  Matei Zaharia authored 12 years ago
  
  bf18e099
- Minor doc fixes · a4093f75
  Matei Zaharia authored 12 years ago
  
  a4093f75
- Merge pull request #222 from rxin/dev · 920fab23
  Matei Zaharia authored 12 years ago
  
  Added MapPartitionsWithSplitRDD.
  920fab23
- Updates to standalone cluster, web UI and deploy docs. · ea05fc13
  Matei Zaharia authored 12 years ago
  
  ea05fc13
Sep 26, 2012
- Allow controlling number of splits in sortByKey. · 1ef4f0fb
  Matei Zaharia authored 12 years ago
  
  1ef4f0fb
- More updates to docs, including tuning guide · 874a9fd4
  Matei Zaharia authored 12 years ago
  
  874a9fd4
- Added MapPartitionsWithSplitRDD. · 1ad1331a
  Reynold Xin authored 12 years ago
  
  1ad1331a
- Look for Kryo registrator using context class loader · ee71fa49
  Matei Zaharia authored 12 years ago
  
  ee71fa49
- Doc tweaks · 58eb44ac
  Matei Zaharia authored 12 years ago
  
  58eb44ac
- Fixed a test that was getting extremely lucky before, and increased the · d71a358c
  Matei Zaharia authored 12 years ago
  
  number of samples used for sorting
  d71a358c
- Doc fixes · d51d5e05
  Matei Zaharia authored 12 years ago
  
  d51d5e05
- Fixes to Java guide · c5754bb9
  Matei Zaharia authored 12 years ago
  
  c5754bb9
- Various enhancements to the programming guide and HTML/CSS · f1246cc7
  Matei Zaharia authored 12 years ago
  
  f1246cc7