Skip to content
Snippets Groups Projects
  1. Dec 26, 2013
    • Matei Zaharia's avatar
      Merge pull request #295 from markhamstra/JobProgressListenerNPE · 5e69fc5b
      Matei Zaharia authored
      Avoid a lump of coal (NPE) in JobProgressListener's stocking.
      5e69fc5b
    • Matei Zaharia's avatar
      Merge pull request #296 from witgo/master · e240bad0
      Matei Zaharia authored
      Renamed ClusterScheduler to TaskSchedulerImpl for yarn and new-yarn package
      e240bad0
    • liguoqiang's avatar
      fix this import order · b662c88a
      liguoqiang authored
      b662c88a
    • Mark Hamstra's avatar
    • Matei Zaharia's avatar
      Merge pull request #283 from tmyklebu/master · c344ed04
      Matei Zaharia authored
      Python bindings for mllib
      
      This pull request contains Python bindings for the regression, clustering, classification, and recommendation tools in mllib.
      
      For each 'train' frontend exposed, there is a Scala stub in PythonMLLibAPI.scala and a Python stub in mllib.py.  The Python stub serialises the input RDD and any vector/matrix arguments into a mutually-understood format and calls the Scala stub.  The Scala stub deserialises the RDD and the vector/matrix arguments, calls the appropriate 'train' function, serialises the resulting model, and returns the serialised model.
      
      ALSModel is slightly different since a MatrixFactorizationModel has RDDs inside.  The Scala stub returns a handle to a Scala MatrixFactorizationModel; prediction is done by calling the Scala predict method.
      
      I have tested these bindings on an x86_64 machine running Linux.  There is a risk that these bindings may fail on some choose-your-own-endian platform if Python's endian differs from java.nio.ByteBuffer's idea of the native byte order.
      c344ed04
  2. Dec 25, 2013
  3. Dec 24, 2013
  4. Dec 23, 2013
  5. Dec 22, 2013
  6. Dec 21, 2013
  7. Dec 20, 2013
Loading