Skip to content
Snippets Groups Projects
  1. May 30, 2015
  2. May 29, 2015
    • Shivaram Venkataraman's avatar
      [SPARK-6806] [SPARKR] [DOCS] Add a new SparkR programming guide · 5f48e5c3
      Shivaram Venkataraman authored
      This PR adds a new SparkR programming guide at the top-level. This will be useful for R users as our APIs don't directly match the Scala/Python APIs and as we need to explain SparkR without using RDDs as examples etc.
      
      cc rxin davies pwendell
      
      cc cafreeman -- Would be great if you could also take a look at this !
      
      Author: Shivaram Venkataraman <shivaram@cs.berkeley.edu>
      
      Closes #6490 from shivaram/sparkr-guide and squashes the following commits:
      
      d5ff360 [Shivaram Venkataraman] Add a section on HiveContext, HQL queries
      408dce5 [Shivaram Venkataraman] Fix link
      dbb86e3 [Shivaram Venkataraman] Fix minor typo
      9aff5e0 [Shivaram Venkataraman] Address comments, use dplyr-like syntax in example
      d09703c [Shivaram Venkataraman] Fix default argument in read.df
      ea816a1 [Shivaram Venkataraman] Add a new SparkR programming guide Also update write.df, read.df to handle defaults better
      5f48e5c3
  3. May 28, 2015
    • Matt Wise's avatar
      [DOCS] Fix typo in documentation for Java UDF registration · 35410614
      Matt Wise authored
      This contribution is my original work and I license the work to the project under the project's open source license
      
      Author: Matt Wise <mwise@quixey.com>
      
      Closes #6447 from wisematthew/fix-typo-in-java-udf-registration-doc and squashes the following commits:
      
      e7ef5f7 [Matt Wise] Fix typo in documentation for Java UDF registration
      35410614
  4. May 23, 2015
    • Davies Liu's avatar
      [SPARK-6806] [SPARKR] [DOCS] Fill in SparkR examples in programming guide · 7af3818c
      Davies Liu authored
      sqlCtx -> sqlContext
      
      You can check the docs by:
      
      ```
      $ cd docs
      $ SKIP_SCALADOC=1 jekyll serve
      ```
      cc shivaram
      
      Author: Davies Liu <davies@databricks.com>
      
      Closes #5442 from davies/r_docs and squashes the following commits:
      
      7a12ec6 [Davies Liu] remove rdd in R docs
      8496b26 [Davies Liu] remove the docs related to RDD
      e23b9d6 [Davies Liu] delete R docs for RDD API
      222e4ff [Davies Liu] Merge branch 'master' into r_docs
      89684ce [Davies Liu] Merge branch 'r_docs' of github.com:davies/spark into r_docs
      f0a10e1 [Davies Liu] address comments from @shivaram
      f61de71 [Davies Liu] Update pairRDD.R
      3ef7cf3 [Davies Liu] use + instead of function(a,b) a+b
      2f10a77 [Davies Liu] address comments from @cafreeman
      9c2a062 [Davies Liu] mention R api together with Python API
      23f751a [Davies Liu] Fill in SparkR examples in programming guide
      7af3818c
  5. May 12, 2015
  6. May 11, 2015
  7. May 07, 2015
  8. Apr 24, 2015
    • Deborah Siegel's avatar
      [SPARK-7136][Docs] Spark SQL and DataFrame Guide fix example file and paths · 59b7cfc4
      Deborah Siegel authored
      Changes example file for Generic Load/Save Functions to users.parquet rather than people.parquet which doesn't exist unless a later example has already been executed. Also adds filepaths.
      
      Author: Deborah Siegel <deborah.siegel@gmail.com>
      Author: DEBORAH SIEGEL <deborahsiegel@d-140-142-0-49.dhcp4.washington.edu>
      Author: DEBORAH SIEGEL <deborahsiegel@DEBORAHs-MacBook-Pro.local>
      Author: DEBORAH SIEGEL <deborahsiegel@d-69-91-154-197.dhcp4.washington.edu>
      
      Closes #5693 from d3borah/master and squashes the following commits:
      
      4d5e43b [Deborah Siegel] sparkSQL doc change
      b15a497 [Deborah Siegel] Revert "sparkSQL doc change"
      5a2863c [DEBORAH SIEGEL] Merge remote-tracking branch 'upstream/master'
      91972fc [DEBORAH SIEGEL] sparkSQL doc change
      f000e59 [DEBORAH SIEGEL] Merge remote-tracking branch 'upstream/master'
      db54173 [DEBORAH SIEGEL] fixed aggregateMessages example in graphX doc
      59b7cfc4
  9. Apr 23, 2015
    • Ken Geis's avatar
      Update sql-programming-guide.md · 67bccbda
      Ken Geis authored
      fix typo
      
      Author: Ken Geis <geis.ken@gmail.com>
      
      Closes #5674 from kgeis/patch-1 and squashes the following commits:
      
      5ae67de [Ken Geis] Update sql-programming-guide.md
      67bccbda
  10. Apr 18, 2015
    • Olivier Girardot's avatar
      SPARK-6992 : Fix documentation example for Spark SQL on StructType · 5f095d56
      Olivier Girardot authored
      
      This patch is fixing the Java examples for Spark SQL when defining
      programmatically a Schema and mapping Rows.
      
      Author: Olivier Girardot <o.girardot@lateral-thoughts.com>
      
      Closes #5569 from ogirardot/branch-1.3 and squashes the following commits:
      
      c29e58d [Olivier Girardot] SPARK-6992 : Fix documentation example for Spark SQL on StructType
      
      (cherry picked from commit c9b1ba4b16a7afe93d45bf75b128cc0dd287ded0)
      Signed-off-by: default avatarReynold Xin <rxin@databricks.com>
      5f095d56
  11. Apr 17, 2015
    • Olivier Girardot's avatar
      SPARK-6988 : Fix documentation regarding DataFrames using the Java API · d305e686
      Olivier Girardot authored
      
      This patch includes :
       * adding how to use map after an sql query using javaRDD
       * fixing the first few java examples that were written in Scala
      
      Thank you for your time,
      
      Olivier.
      
      Author: Olivier Girardot <o.girardot@lateral-thoughts.com>
      
      Closes #5564 from ogirardot/branch-1.3 and squashes the following commits:
      
      9f8d60e [Olivier Girardot] SPARK-6988 : Fix documentation regarding DataFrames using the Java API
      
      (cherry picked from commit 6b528dc139da594ef2e651d84bd91fe0f738a39d)
      Signed-off-by: default avatarReynold Xin <rxin@databricks.com>
      d305e686
  12. Apr 15, 2015
  13. Apr 11, 2015
  14. Apr 08, 2015
    • Davies Liu's avatar
      [SPARK-6781] [SQL] use sqlContext in python shell · 6ada4f6f
      Davies Liu authored
      Use `sqlContext` in PySpark shell, make it consistent with SQL programming guide. `sqlCtx` is also kept for compatibility.
      
      Author: Davies Liu <davies@databricks.com>
      
      Closes #5425 from davies/sqlCtx and squashes the following commits:
      
      af67340 [Davies Liu] sqlCtx -> sqlContext
      15a278f [Davies Liu] use sqlContext in python shell
      6ada4f6f
  15. Mar 26, 2015
    • Michael Armbrust's avatar
      [DOCS][SQL] Fix JDBC example · aad00322
      Michael Armbrust authored
      Author: Michael Armbrust <michael@databricks.com>
      
      Closes #5192 from marmbrus/fixJDBCDocs and squashes the following commits:
      
      b48a33d [Michael Armbrust] [DOCS][SQL] Fix JDBC example
      aad00322
  16. Mar 25, 2015
    • Bill Chambers's avatar
      [DOCUMENTATION]Fixed Missing Type Import in Documentation · c5cc4146
      Bill Chambers authored
      Needed to import the types specifically, not the more general pyspark.sql
      
      Author: Bill Chambers <wchambers@ischool.berkeley.edu>
      Author: anabranch <wac.chambers@gmail.com>
      
      Closes #5179 from anabranch/master and squashes the following commits:
      
      8fa67bf [anabranch] Corrected SqlContext Import
      603b080 [Bill Chambers] [DOCUMENTATION]Fixed Missing Type Import in Documentation
      c5cc4146
  17. Mar 22, 2015
    • vinodkc's avatar
      [SPARK-6337][Documentation, SQL]Spark 1.3 doc fixes · 2bf40c58
      vinodkc authored
      Author: vinodkc <vinod.kc.in@gmail.com>
      
      Closes #5112 from vinodkc/spark_1.3_doc_fixes and squashes the following commits:
      
      2c6aee6 [vinodkc] Spark 1.3 doc fixes
      2bf40c58
    • Kamil Smuga's avatar
      SPARK-6454 [DOCS] Fix links to pyspark api · 6ef48632
      Kamil Smuga authored
      Author: Kamil Smuga <smugakamil@gmail.com>
      Author: stderr <smugakamil@gmail.com>
      
      Closes #5120 from kamilsmuga/master and squashes the following commits:
      
      fee3281 [Kamil Smuga] more python api links fixed for docs
      13240cb [Kamil Smuga] resolved merge conflicts with upstream/master
      6649b3b [Kamil Smuga] fix broken docs links to Python API
      92f03d7 [stderr] Fix links to pyspark api
      6ef48632
  18. Mar 17, 2015
    • Tijo Thomas's avatar
      [SPARK-6383][SQL]Fixed compiler and errors in Dataframe examples · a012e086
      Tijo Thomas authored
      Author: Tijo Thomas <tijoparacka@gmail.com>
      
      Closes #5068 from tijoparacka/fix_sql_dataframe_example and squashes the following commits:
      
      6953ac1 [Tijo Thomas] Handled Java and Python example sections
      0751a74 [Tijo Thomas] Fixed compiler and errors in Dataframe examples
      a012e086
  19. Mar 13, 2015
  20. Mar 12, 2015
  21. Mar 10, 2015
    • Michael Armbrust's avatar
      [SPARK-5183][SQL] Update SQL Docs with JDBC and Migration Guide · 26723741
      Michael Armbrust authored
      Author: Michael Armbrust <michael@databricks.com>
      
      Closes #4958 from marmbrus/sqlDocs and squashes the following commits:
      
      9351dbc [Michael Armbrust] fix parquet example
      6877e13 [Michael Armbrust] add sql examples
      d81b7e7 [Michael Armbrust] rxins comments
      e393528 [Michael Armbrust] fix order
      19c2735 [Michael Armbrust] more on data source load/store
      00d5914 [Michael Armbrust] Update SQL Docs with JDBC and Migration Guide
      26723741
  22. Mar 09, 2015
  23. Feb 17, 2015
    • CodingCat's avatar
      [Minor] fix typo in SQL document · 31efb39c
      CodingCat authored
      Author: CodingCat <zhunansjtu@gmail.com>
      
      Closes #4656 from CodingCat/fix_typo and squashes the following commits:
      
      b41d15c [CodingCat] recover
      689fe46 [CodingCat] fix typo
      31efb39c
  24. Feb 12, 2015
    • Antonio Navarro Perez's avatar
      [SQL][DOCS] Update sql documentation · 6a1be026
      Antonio Navarro Perez authored
      Updated examples using the new api and added DataFrame concept
      
      Author: Antonio Navarro Perez <ajnavarro@users.noreply.github.com>
      
      Closes #4560 from ajnavarro/ajnavarro-doc-sql-update and squashes the following commits:
      
      82ebcf3 [Antonio Navarro Perez] Changed a missing JavaSQLContext to SQLContext.
      8d5376a [Antonio Navarro Perez] fixed typo
      8196b6b [Antonio Navarro Perez] [SQL][DOCS] Update sql documentation
      6a1be026
  25. Feb 10, 2015
    • Davies Liu's avatar
      [SPARK-5704] [SQL] [PySpark] createDataFrame from RDD with columns · ea602840
      Davies Liu authored
      Deprecate inferSchema() and applySchema(), use createDataFrame() instead, which could take an optional `schema` to create an DataFrame from an RDD. The `schema` could be StructType or list of names of columns.
      
      Author: Davies Liu <davies@databricks.com>
      
      Closes #4498 from davies/create and squashes the following commits:
      
      08469c1 [Davies Liu] remove Scala/Java API for now
      c80a7a9 [Davies Liu] fix hive test
      d1bd8f2 [Davies Liu] cleanup applySchema
      9526e97 [Davies Liu] createDataFrame from RDD with columns
      ea602840
  26. Feb 05, 2015
    • Daoyuan Wang's avatar
      [Branch-1.3] [DOC] doc fix for date · 6fa4ac1b
      Daoyuan Wang authored
      Trivial fix.
      
      Author: Daoyuan Wang <daoyuan.wang@intel.com>
      
      Closes #4400 from adrian-wang/docdate and squashes the following commits:
      
      31bbe40 [Daoyuan Wang] doc fix for date
      6fa4ac1b
    • Matei Zaharia's avatar
      [SPARK-5608] Improve SEO of Spark documentation pages · 4d74f060
      Matei Zaharia authored
      - Add meta description tags on some of the most important doc pages
      - Shorten the titles of some pages to have more relevant keywords; for
        example there's no reason to have "Spark SQL Programming Guide - Spark
        1.2.0 documentation", we can just say "Spark SQL - Spark 1.2.0
        documentation".
      
      Author: Matei Zaharia <matei@databricks.com>
      
      Closes #4381 from mateiz/docs-seo and squashes the following commits:
      
      4940563 [Matei Zaharia] [SPARK-5608] Improve SEO of Spark documentation pages
      4d74f060
  27. Feb 03, 2015
    • Daoyuan Wang's avatar
      [SPARK-4987] [SQL] parquet timestamp type support · 0c20ce69
      Daoyuan Wang authored
      Author: Daoyuan Wang <daoyuan.wang@intel.com>
      
      Closes #3820 from adrian-wang/parquettimestamp and squashes the following commits:
      
      b1e2a0d [Daoyuan Wang] fix for nanos
      4dadef1 [Daoyuan Wang] fix wrong read
      93f438d [Daoyuan Wang] parquet timestamp support
      0c20ce69
  28. Jan 18, 2015
  29. Dec 30, 2014
    • luogankun's avatar
      [SPARK-4930][SQL][DOCS]Update SQL programming guide, CACHE TABLE is eager · 2deac748
      luogankun authored
      `CACHE TABLE tbl` is now __eager__ by default not __lazy__
      
      Author: luogankun <luogankun@gmail.com>
      
      Closes #3773 from luogankun/SPARK-4930 and squashes the following commits:
      
      cc17b7d [luogankun] [SPARK-4930][SQL][DOCS]Update SQL programming guide, add CACHE [LAZY] TABLE [AS SELECT] ...
      bffe0e8 [luogankun] [SPARK-4930][SQL][DOCS]Update SQL programming guide, CACHE TABLE tbl is eager
      2deac748
    • luogankun's avatar
      [SPARK-4916][SQL][DOCS]Update SQL programming guide about cache section · f7a41a0e
      luogankun authored
      `SchemeRDD.cache()` now uses in-memory columnar storage.
      
      Author: luogankun <luogankun@gmail.com>
      
      Closes #3759 from luogankun/SPARK-4916 and squashes the following commits:
      
      7b39864 [luogankun] [SPARK-4916]Update SQL programming guide
      6018122 [luogankun] Merge branch 'master' of https://github.com/apache/spark into SPARK-4916
      0b93785 [luogankun] [SPARK-4916]Update SQL programming guide
      99b2336 [luogankun] [SPARK-4916]Update SQL programming guide
      f7a41a0e
  30. Dec 16, 2014
    • Peter Vandenabeele's avatar
      [DOCS][SQL] Add a Note on jsonFile having separate JSON objects per line · 1a9e35e5
      Peter Vandenabeele authored
      * This commit hopes to avoid the confusion I faced when trying
        to submit a regular, valid multi-line JSON file, also see
      
        http://apache-spark-user-list.1001560.n3.nabble.com/Loading-JSON-Dataset-fails-with-com-fasterxml-jackson-databind-JsonMappingException-td20041.html
      
      Author: Peter Vandenabeele <peter@vandenabeele.com>
      
      Closes #3517 from petervandenabeele/pv-docs-note-on-jsonFile-format/01 and squashes the following commits:
      
      1f98e52 [Peter Vandenabeele] Revert to people.json and simple Note text
      6b6e062 [Peter Vandenabeele] Change the "JSON" connotation to "txt"
      fca7dfb [Peter Vandenabeele] Add a Note on jsonFile having separate JSON objects per line
      1a9e35e5
    • Judy Nash's avatar
      [SQL] SPARK-4700: Add HTTP protocol spark thrift server · 17688d14
      Judy Nash authored
      Add HTTP protocol support and test cases to spark thrift server, so users can deploy thrift server in both TCP and http mode.
      
      Author: Judy Nash <judynash@microsoft.com>
      Author: judynash <judynash@microsoft.com>
      
      Closes #3672 from judynash/master and squashes the following commits:
      
      526315d [Judy Nash] correct spacing on startThriftServer method
      31a6520 [Judy Nash] fix code style issues and update sql programming guide format issue
      47bf87e [Judy Nash] modify withJdbcStatement method definition to meet less than 100 line length
      2e9c11c [Judy Nash] add thrift server in http mode documentation on sql programming guide
      1cbd305 [Judy Nash] Merge remote-tracking branch 'upstream/master'
      2b1d312 [Judy Nash] updated http thrift server support based on feedback
      377532c [judynash] add HTTP protocol spark thrift server
      17688d14
  31. Dec 04, 2014
    • Andy Konwinski's avatar
      Fix typo in Spark SQL docs. · 15cf3b01
      Andy Konwinski authored
      Author: Andy Konwinski <andykonwinski@gmail.com>
      
      Closes #3611 from andyk/patch-3 and squashes the following commits:
      
      7bab333 [Andy Konwinski] Fix typo in Spark SQL docs.
      15cf3b01
  32. Dec 01, 2014
Loading