Skip to content
Snippets Groups Projects
  1. Sep 08, 2015
    • Reynold Xin's avatar
      [RELEASE] Add more contributors & only show names in release notes. · ae74c3fa
      Reynold Xin authored
      Author: Reynold Xin <rxin@databricks.com>
      
      Closes #8660 from rxin/contrib.
      ae74c3fa
    • Michael Armbrust's avatar
      [HOTFIX] Fix build break caused by #8494 · 2143d592
      Michael Armbrust authored
      Author: Michael Armbrust <michael@databricks.com>
      
      Closes #8659 from marmbrus/testBuildBreak.
      2143d592
    • Cheng Hao's avatar
      [SPARK-10327] [SQL] Cache Table is not working while subquery has alias in its project list · d637a666
      Cheng Hao authored
      ```scala
          import org.apache.spark.sql.hive.execution.HiveTableScan
          sql("select key, value, key + 1 from src").registerTempTable("abc")
          cacheTable("abc")
      
          val sparkPlan = sql(
            """select a.key, b.key, c.key from
              |abc a join abc b on a.key=b.key
              |join abc c on a.key=c.key""".stripMargin).queryExecution.sparkPlan
      
          assert(sparkPlan.collect { case e: InMemoryColumnarTableScan => e }.size === 3) // failed
          assert(sparkPlan.collect { case e: HiveTableScan => e }.size === 0) // failed
      ```
      
      The actual plan is:
      
      ```
      == Parsed Logical Plan ==
      'Project [unresolvedalias('a.key),unresolvedalias('b.key),unresolvedalias('c.key)]
       'Join Inner, Some(('a.key = 'c.key))
        'Join Inner, Some(('a.key = 'b.key))
         'UnresolvedRelation [abc], Some(a)
         'UnresolvedRelation [abc], Some(b)
        'UnresolvedRelation [abc], Some(c)
      
      == Analyzed Logical Plan ==
      key: int, key: int, key: int
      Project [key#14,key#61,key#66]
       Join Inner, Some((key#14 = key#66))
        Join Inner, Some((key#14 = key#61))
         Subquery a
          Subquery abc
           Project [key#14,value#15,(key#14 + 1) AS _c2#16]
            MetastoreRelation default, src, None
         Subquery b
          Subquery abc
           Project [key#61,value#62,(key#61 + 1) AS _c2#58]
            MetastoreRelation default, src, None
        Subquery c
         Subquery abc
          Project [key#66,value#67,(key#66 + 1) AS _c2#63]
           MetastoreRelation default, src, None
      
      == Optimized Logical Plan ==
      Project [key#14,key#61,key#66]
       Join Inner, Some((key#14 = key#66))
        Project [key#14,key#61]
         Join Inner, Some((key#14 = key#61))
          Project [key#14]
           InMemoryRelation [key#14,value#15,_c2#16], true, 10000, StorageLevel(true, true, false, true, 1), (Project [key#14,value#15,(key#14 + 1) AS _c2#16]), Some(abc)
          Project [key#61]
           MetastoreRelation default, src, None
        Project [key#66]
         MetastoreRelation default, src, None
      
      == Physical Plan ==
      TungstenProject [key#14,key#61,key#66]
       BroadcastHashJoin [key#14], [key#66], BuildRight
        TungstenProject [key#14,key#61]
         BroadcastHashJoin [key#14], [key#61], BuildRight
          ConvertToUnsafe
           InMemoryColumnarTableScan [key#14], (InMemoryRelation [key#14,value#15,_c2#16], true, 10000, StorageLevel(true, true, false, true, 1), (Project [key#14,value#15,(key#14 + 1) AS _c2#16]), Some(abc))
          ConvertToUnsafe
           HiveTableScan [key#61], (MetastoreRelation default, src, None)
        ConvertToUnsafe
         HiveTableScan [key#66], (MetastoreRelation default, src, None)
      ```
      
      Author: Cheng Hao <hao.cheng@intel.com>
      
      Closes #8494 from chenghao-intel/weird_cache.
      d637a666
    • Tathagata Das's avatar
      [SPARK-10492] [STREAMING] [DOCUMENTATION] Update Streaming documentation about... · 52b24a60
      Tathagata Das authored
      [SPARK-10492] [STREAMING] [DOCUMENTATION] Update Streaming documentation about rate limiting and backpressure
      
      Author: Tathagata Das <tathagata.das1565@gmail.com>
      
      Closes #8656 from tdas/SPARK-10492 and squashes the following commits:
      
      986cdd6 [Tathagata Das] Added information on backpressure
      52b24a60
    • Vinod K C's avatar
      [SPARK-10468] [ MLLIB ] Verify schema before Dataframe select API call · e6f8d368
      Vinod K C authored
      Loader.checkSchema was called to verify the schema after dataframe.select(...).
      Schema verification should be done before dataframe.select(...)
      
      Author: Vinod K C <vinod.kc@huawei.com>
      
      Closes #8636 from vinodkc/fix_GaussianMixtureModel_load_verification.
      e6f8d368
    • Yin Huai's avatar
      [SPARK-10441] [SQL] Save data correctly to json. · 7a9dcbc9
      Yin Huai authored
      https://issues.apache.org/jira/browse/SPARK-10441
      
      Author: Yin Huai <yhuai@databricks.com>
      
      Closes #8597 from yhuai/timestampJson.
      7a9dcbc9
    • Yanbo Liang's avatar
      [SPARK-10470] [ML] ml.IsotonicRegressionModel.copy should set parent · f7b55dbf
      Yanbo Liang authored
      Copied model must have the same parent, but ml.IsotonicRegressionModel.copy did not set parent.
      Here fix it and add test case.
      
      Author: Yanbo Liang <ybliang8@gmail.com>
      
      Closes #8637 from yanboliang/spark-10470.
      f7b55dbf
    • Wenchen Fan's avatar
      [SPARK-10316] [SQL] respect nondeterministic expressions in PhysicalOperation · 5fd57955
      Wenchen Fan authored
      We did a lot of special handling for non-deterministic expressions in `Optimizer`. However, `PhysicalOperation` just collects all Projects and Filters and mess it up. We should respect the operators order caused by non-deterministic expressions in `PhysicalOperation`.
      
      Author: Wenchen Fan <cloud0fan@outlook.com>
      
      Closes #8486 from cloud-fan/fix.
      5fd57955
    • Yanbo Liang's avatar
      [SPARK-10480] [ML] Fix ML.LinearRegressionModel.copy() · 5b2192e8
      Yanbo Liang authored
      This PR fix two model ```copy()``` related issues:
      [SPARK-10480](https://issues.apache.org/jira/browse/SPARK-10480)
      ```ML.LinearRegressionModel.copy()``` ignored argument ```extra```, it will not take effect when users setting this parameter.
      [SPARK-10479](https://issues.apache.org/jira/browse/SPARK-10479)
      ```ML.LogisticRegressionModel.copy()``` should copy model summary if available.
      
      Author: Yanbo Liang <ybliang8@gmail.com>
      
      Closes #8641 from yanboliang/linear-regression-copy.
      5b2192e8
    • Liang-Chi Hsieh's avatar
      [SPARK-9170] [SQL] Use OrcStructInspector to be case preserving when writing ORC files · 990c9f79
      Liang-Chi Hsieh authored
      JIRA: https://issues.apache.org/jira/browse/SPARK-9170
      
      `StandardStructObjectInspector` will implicitly lowercase column names. But I think Orc format doesn't have such requirement. In fact, there is a `OrcStructInspector` specified for Orc format. We should use it when serialize rows to Orc file. It can be case preserving when writing ORC files.
      
      Author: Liang-Chi Hsieh <viirya@appier.com>
      
      Closes #7520 from viirya/use_orcstruct.
      990c9f79
    • Jacek Laskowski's avatar
      Docs small fixes · 6ceed852
      Jacek Laskowski authored
      Author: Jacek Laskowski <jacek@japila.pl>
      
      Closes #8629 from jaceklaskowski/docs-fixes.
      6ceed852
    • Stephen Hopper's avatar
      [DOC] Added R to the list of languages with "high-level API" support in the… · 9d8e838d
      Stephen Hopper authored
      … main README.
      
      Author: Stephen Hopper <shopper@shopper-osx.local>
      
      Closes #8646 from enragedginger/master.
      9d8e838d
  2. Sep 07, 2015
    • Reynold Xin's avatar
      [SPARK-9767] Remove ConnectionManager. · 5ffe752b
      Reynold Xin authored
      We introduced the Netty network module for shuffle in Spark 1.2, and has turned it on by default for 3 releases. The old ConnectionManager is difficult to maintain. If we merge the patch now, by the time it is released, it would be 1 yr for which ConnectionManager is off by default. It's time to remove it.
      
      Author: Reynold Xin <rxin@databricks.com>
      
      Closes #8161 from rxin/SPARK-9767.
      5ffe752b
  3. Sep 05, 2015
  4. Sep 04, 2015
    • Yin Huai's avatar
      [SPARK-9925] [SQL] [TESTS] Set SQLConf.SHUFFLE_PARTITIONS.key correctly for tests · 47058ca5
      Yin Huai authored
      This PR fix the failed test and conflict for #8155
      
      https://issues.apache.org/jira/browse/SPARK-9925
      
      Closes #8155
      
      Author: Yin Huai <yhuai@databricks.com>
      Author: Davies Liu <davies@databricks.com>
      
      Closes #8602 from davies/shuffle_partitions.
      47058ca5
    • Holden Karau's avatar
      [SPARK-10402] [DOCS] [ML] Add defaults to the scaladoc for params in ml/ · 22eab706
      Holden Karau authored
      We should make sure the scaladoc for params includes their default values through the models in ml/
      
      Author: Holden Karau <holden@pigscanfly.ca>
      
      Closes #8591 from holdenk/SPARK-10402-add-scaladoc-for-default-values-of-params-in-ml.
      22eab706
    • xutingjun's avatar
      [SPARK-10311] [STREAMING] Reload appId and attemptId when app starts with... · eafe3723
      xutingjun authored
      [SPARK-10311] [STREAMING] Reload appId and attemptId when app starts with checkpoint file in cluster mode
      
      Author: xutingjun <xutingjun@huawei.com>
      
      Closes #8477 from XuTingjun/streaming-attempt.
      eafe3723
    • robbins's avatar
      [SPARK-10454] [SPARK CORE] wait for empty event queue · 2e1c1755
      robbins authored
      Author: robbins <robbins@uk.ibm.com>
      
      Closes #8605 from robbinspg/DAGSchedulerSuite-fix.
      2e1c1755
    • Timothy Chen's avatar
      [SPARK-9669] [MESOS] Support PySpark on Mesos cluster mode. · b087d23e
      Timothy Chen authored
      Support running pyspark with cluster mode on Mesos!
      This doesn't upload any scripts, so if running in a remote Mesos requires the user to specify the script from a available URI.
      
      Author: Timothy Chen <tnachen@gmail.com>
      
      Closes #8349 from tnachen/mesos_python.
      b087d23e
    • Andrew Or's avatar
      [SPARK-10450] [SQL] Minor improvements to readability / style / typos etc. · 3339e6f6
      Andrew Or authored
      Author: Andrew Or <andrew@databricks.com>
      
      Closes #8603 from andrewor14/minor-sql-changes.
      3339e6f6
    • Wenchen Fan's avatar
      [SPARK-10176] [SQL] Show partially analyzed plans when checkAnswer fails to analyze · c3c0e431
      Wenchen Fan authored
      This PR takes over https://github.com/apache/spark/pull/8389.
      
      This PR improves `checkAnswer` to print the partially analyzed plan in addition to the user friendly error message, in order to aid debugging failing tests.
      
      In doing so, I ran into a conflict with the various ways that we bring a SQLContext into the tests. Depending on the trait we refer to the current context as `sqlContext`, `_sqlContext`, `ctx` or `hiveContext` with access modifiers `public`, `protected` and `private` depending on the defining class.
      
      I propose we refactor as follows:
      
      1. All tests should only refer to a `protected sqlContext` when testing general features, and `protected hiveContext` when it is a method that only exists on a `HiveContext`.
      2. All tests should only import `testImplicits._` (i.e., don't import `TestHive.implicits._`)
      
      Author: Wenchen Fan <cloud0fan@outlook.com>
      
      Closes #8584 from cloud-fan/cleanupTests.
      c3c0e431
    • Michael Armbrust's avatar
      MAINTENANCE: Automated closing of pull requests. · 804a0126
      Michael Armbrust authored
      This commit exists to close the following pull requests on Github:
      
      Closes #1890 (requested by andrewor14, JoshRosen)
      Closes #3558 (requested by JoshRosen, marmbrus)
      Closes #3890 (requested by marmbrus)
      Closes #3895 (requested by andrewor14, marmbrus)
      Closes #4055 (requested by andrewor14)
      Closes #4105 (requested by andrewor14)
      Closes #4812 (requested by marmbrus)
      Closes #5109 (requested by andrewor14)
      Closes #5178 (requested by andrewor14)
      Closes #5298 (requested by marmbrus)
      Closes #5393 (requested by marmbrus)
      Closes #5449 (requested by andrewor14)
      Closes #5468 (requested by marmbrus)
      Closes #5715 (requested by marmbrus)
      Closes #6192 (requested by marmbrus)
      Closes #6319 (requested by marmbrus)
      Closes #6326 (requested by marmbrus)
      Closes #6349 (requested by marmbrus)
      Closes #6380 (requested by andrewor14)
      Closes #6554 (requested by marmbrus)
      Closes #6696 (requested by marmbrus)
      Closes #6868 (requested by marmbrus)
      Closes #6951 (requested by marmbrus)
      Closes #7129 (requested by marmbrus)
      Closes #7188 (requested by marmbrus)
      Closes #7358 (requested by marmbrus)
      Closes #7379 (requested by marmbrus)
      Closes #7628 (requested by marmbrus)
      Closes #7715 (requested by marmbrus)
      Closes #7782 (requested by marmbrus)
      Closes #7914 (requested by andrewor14)
      Closes #8051 (requested by andrewor14)
      Closes #8269 (requested by andrewor14)
      Closes #8448 (requested by andrewor14)
      Closes #8576 (requested by andrewor14)
      804a0126
    • Shivaram Venkataraman's avatar
      [MINOR] Minor style fix in SparkR · 143e521d
      Shivaram Venkataraman authored
      `dev/lintr-r` passes on my machine now
      
      Author: Shivaram Venkataraman <shivaram@cs.berkeley.edu>
      
      Closes #8601 from shivaram/sparkr-style-fix.
      143e521d
  5. Sep 03, 2015
Loading