Skip to content
Snippets Groups Projects
  1. Apr 03, 2017
    • hyukjinkwon's avatar
      [MINOR][DOCS] Replace non-breaking space to normal spaces that breaks rendering markdown · 364b0db7
      hyukjinkwon authored
      # What changes were proposed in this pull request?
      
      It seems there are several non-breaking spaces were inserted into several `.md`s and they look breaking rendering markdown files.
      
      These are different. For example, this can be checked via `python` as below:
      
      ```python
      >>> " "
      '\xc2\xa0'
      >>> " "
      ' '
      ```
      
      _Note that it seems this PR description automatically replaces non-breaking spaces into normal spaces. Please open a `vi` and copy and paste it into `python` to verify this (do not copy the characters here)._
      
      I checked the output below in  Sapari and Chrome on Mac OS and, Internal Explorer on Windows 10.
      
      **Before**
      
      ![2017-04-03 12 37 17](https://cloud.githubusercontent.com/assets/6477701/24594655/50aaba02-186a-11e7-80bb-d34b17a3398a.png)
      ![2017-04-03 12 36 57](https://cloud.githubusercontent.com/assets/6477701/24594654/50a855e6-186a-11e7-94e2-661e56544b0f.png)
      
      **After**
      
      ![2017-04-03 12 36 46](https://cloud.githubusercontent.com/assets/6477701/24594657/53c2545c-186a-11e7-9a73-00529afbfd75.png)
      ![2017-04-03 12 36 31](https://cloud.githubusercontent.com/assets/6477701/24594658/53c286c0-186a-11e7-99c9-e66b1f510fe7.png)
      
      ## How was this patch tested?
      
      Manually checking.
      
      These instances were found via
      
      ```
      grep --include=*.scala --include=*.python --include=*.java --include=*.r --include=*.R --include=*.md --include=*.r -r -I " " .
      ```
      
      in Mac OS.
      
      It seems there are several instances more as below:
      
      ```
      ./docs/sql-programming-guide.md:        │   ├── ...
      ./docs/sql-programming-guide.md:        │   │
      ./docs/sql-programming-guide.md:        │   ├── country=US
      ./docs/sql-programming-guide.md:        │   │   └── data.parquet
      ./docs/sql-programming-guide.md:        │   ├── country=CN
      ./docs/sql-programming-guide.md:        │   │   └── data.parquet
      ./docs/sql-programming-guide.md:        │   └── ...
      ./docs/sql-programming-guide.md:            ├── ...
      ./docs/sql-programming-guide.md:            │
      ./docs/sql-programming-guide.md:            ├── country=US
      ./docs/sql-programming-guide.md:            │   └── data.parquet
      ./docs/sql-programming-guide.md:            ├── country=CN
      ./docs/sql-programming-guide.md:            │   └── data.parquet
      ./docs/sql-programming-guide.md:            └── ...
      ./sql/core/src/test/README.md:│   ├── *.avdl                  # Testing Avro IDL(s)
      ./sql/core/src/test/README.md:│   └── *.avpr                  # !! NO TOUCH !! Protocol files generated from Avro IDL(s)
      ./sql/core/src/test/README.md:│   ├── gen-avro.sh             # Script used to generate Java code for Avro
      ./sql/core/src/test/README.md:│   └── gen-thrift.sh           # Script used to generate Java code for Thrift
      ```
      
      These seems generated via `tree` command which inserts non-breaking spaces. They do not look causing any problem for rendering within code blocks and I did not fix it to reduce the overhead to manually replace it when it is overwritten via `tree` command in the future.
      
      Author: hyukjinkwon <gurwls223@gmail.com>
      
      Closes #17517 from HyukjinKwon/non-breaking-space.
      364b0db7
  2. Dec 10, 2016
  3. Dec 03, 2016
  4. Nov 23, 2016
    • Sean Owen's avatar
      [SPARK-18073][DOCS][WIP] Migrate wiki to spark.apache.org web site · 7e0cd1d9
      Sean Owen authored
      ## What changes were proposed in this pull request?
      
      Updates links to the wiki to links to the new location of content on spark.apache.org.
      
      ## How was this patch tested?
      
      Doc builds
      
      Author: Sean Owen <sowen@cloudera.com>
      
      Closes #15967 from srowen/SPARK-18073.1.
      7e0cd1d9
  5. Oct 12, 2016
    • Sean Owen's avatar
      [SPARK-17840][DOCS] Add some pointers for wiki/CONTRIBUTING.md in README.md... · f8062b63
      Sean Owen authored
      [SPARK-17840][DOCS] Add some pointers for wiki/CONTRIBUTING.md in README.md and some warnings in PULL_REQUEST_TEMPLATE
      
      ## What changes were proposed in this pull request?
      
      Link to contributing wiki in PR template, README.md
      
      ## How was this patch tested?
      
      Doc-only change, tested by Jekyll
      
      Author: Sean Owen <sowen@cloudera.com>
      
      Closes #15429 from srowen/SPARK-17840.
      f8062b63
  6. Jun 14, 2016
    • Adam Roberts's avatar
      [SPARK-15821][DOCS] Include parallel build info · a431e3f1
      Adam Roberts authored
      ## What changes were proposed in this pull request?
      
      We should mention that users can build Spark using multiple threads to decrease build times; either here or in "Building Spark"
      
      ## How was this patch tested?
      
      Built on machines with between one core to 192 cores using mvn -T 1C and observed faster build times with no loss in stability
      
      In response to the question here https://issues.apache.org/jira/browse/SPARK-15821 I think we should suggest this option as we know it works for Spark and can result in faster builds
      
      Author: Adam Roberts <aroberts@uk.ibm.com>
      
      Closes #13562 from a-roberts/patch-3.
      a431e3f1
  7. Dec 04, 2015
    • kaklakariada's avatar
      Add links howto to setup IDEs for developing spark · 17e4e021
      kaklakariada authored
      These links make it easier for new developers to work with Spark in their IDE.
      
      Author: kaklakariada <kaklakariada@users.noreply.github.com>
      
      Closes #10104 from kaklakariada/readme-developing-ide-gettting-started.
      17e4e021
  8. Nov 01, 2015
  9. Oct 04, 2015
  10. Sep 08, 2015
  11. Jun 01, 2015
    • Reynold Xin's avatar
      Update README to include DataFrames and zinc. · 3c015689
      Reynold Xin authored
      Also cut trailing whitespaces.
      
      Author: Reynold Xin <rxin@databricks.com>
      
      Closes #6548 from rxin/readme and squashes the following commits:
      
      630efc3 [Reynold Xin] Update README to include DataFrames and zinc.
      3c015689
  12. May 11, 2015
  13. Mar 12, 2015
  14. Feb 02, 2015
    • Nicholas Chammas's avatar
      [Docs] Fix Building Spark link text · 3f941b68
      Nicholas Chammas authored
      Author: Nicholas Chammas <nicholas.chammas@gmail.com>
      
      Closes #4312 from nchammas/patch-2 and squashes the following commits:
      
      9d943aa [Nicholas Chammas] [Docs] Fix Building Spark link text
      3f941b68
  15. Dec 25, 2014
  16. Nov 09, 2014
    • Sean Owen's avatar
      SPARK-971 [DOCS] Link to Confluence wiki from project website / documentation · 8c99a47a
      Sean Owen authored
      This is a trivial change to add links to the wiki from `README.md` and the main docs page. It is already linked to from spark.apache.org.
      
      Author: Sean Owen <sowen@cloudera.com>
      
      Closes #3169 from srowen/SPARK-971 and squashes the following commits:
      
      dcb84d0 [Sean Owen] Add link to wiki from README, docs home page
      8c99a47a
  17. Oct 28, 2014
    • Ryan Williams's avatar
      fix broken links in README.md · 4ceb048b
      Ryan Williams authored
      seems like `building-spark.html` was renamed to `building-with-maven.html`?
      
      Is Maven the blessed build tool these days, or SBT? I couldn't find a building-with-sbt page so I went with the Maven one here.
      
      Author: Ryan Williams <ryan.blake.williams@gmail.com>
      
      Closes #2859 from ryan-williams/broken-links-readme and squashes the following commits:
      
      7692253 [Ryan Williams] fix broken links in README.md
      4ceb048b
  18. Oct 20, 2014
  19. Sep 16, 2014
    • Nicholas Chammas's avatar
      [Docs] minor punctuation fix · df90e81f
      Nicholas Chammas authored
      Author: Nicholas Chammas <nicholas.chammas@gmail.com>
      
      Closes #2414 from nchammas/patch-1 and squashes the following commits:
      
      14664bf [Nicholas Chammas] [Docs] minor punctuation fix
      df90e81f
    • Sean Owen's avatar
      SPARK-3069 [DOCS] Build instructions in README are outdated · 61e21fe7
      Sean Owen authored
      Here's my crack at Bertrand's suggestion. The Github `README.md` contains build info that's outdated. It should just point to the current online docs, and reflect that Maven is the primary build now.
      
      (Incidentally, the stanza at the end about contributions of original work should go in https://cwiki.apache.org/confluence/display/SPARK/Contributing+to+Spark too. It won't hurt to be crystal clear about the agreement to license, given that ICLAs are not required of anyone here.)
      
      Author: Sean Owen <sowen@cloudera.com>
      
      Closes #2014 from srowen/SPARK-3069 and squashes the following commits:
      
      501507e [Sean Owen] Note that Zinc is for Maven builds too
      db2bd97 [Sean Owen] sbt -> sbt/sbt and add note about zinc
      be82027 [Sean Owen] Fix additional occurrences of building-with-maven -> building-spark
      91c921f [Sean Owen] Move building-with-maven to building-spark and create a redirect. Update doc links to building-spark.html Add jekyll-redirect-from plugin and make associated config changes (including fixing pygments deprecation). Add example of SBT to README.md
      999544e [Sean Owen] Change "Building Spark with Maven" title to "Building Spark"; reinstate tl;dr info about dev/run-tests in README.md; add brief note about building with SBT
      c18d140 [Sean Owen] Optionally, remove the copy of contributing text from main README.md
      8e83934 [Sean Owen] Add CONTRIBUTING.md to trigger notice on new pull request page
      b1c04a1 [Sean Owen] Refer to current online documentation for building, and remove slightly outdated copy in README.md
      61e21fe7
  20. Sep 05, 2014
    • Nicholas Chammas's avatar
      [Docs] fix minor MLlib case typo · 6a37ed83
      Nicholas Chammas authored
      Also make the list of features consistent in style.
      
      Author: Nicholas Chammas <nicholas.chammas@gmail.com>
      
      Closes #2278 from nchammas/patch-1 and squashes the following commits:
      
      56df319 [Nicholas Chammas] [Docs] fix minor MLlib case typo
      6a37ed83
  21. Aug 26, 2014
  22. Aug 23, 2014
    • Kousuke Saruta's avatar
      [SPARK-2963] REGRESSION - The description about how to build for using CLI and... · 323cd92b
      Kousuke Saruta authored
      [SPARK-2963] REGRESSION - The description about how to build for using CLI and Thrift JDBC server is absent in proper document  -
      
      The most important things I mentioned in #1885 is as follows.
      
      * People who build Spark is not always programmer.
      * If a person who build Spark is not a programmer, he/she won't read programmer's guide before building.
      
      So, how to build for using CLI and JDBC server is not only in programmer's guide.
      
      Author: Kousuke Saruta <sarutak@oss.nttdata.co.jp>
      
      Closes #2080 from sarutak/SPARK-2963 and squashes the following commits:
      
      ee07c76 [Kousuke Saruta] Modified regression of the description about building for using Thrift JDBC server and CLI
      ed53329 [Kousuke Saruta] Modified description and notaton of proper noun
      07c59fc [Kousuke Saruta] Added a description about how to build to use HiveServer and CLI for SparkSQL to building-with-maven.md
      6e6645a [Kousuke Saruta] Merge branch 'master' of git://git.apache.org/spark into SPARK-2963
      c88fa93 [Kousuke Saruta] Added a description about building to use HiveServer and CLI for SparkSQL
      323cd92b
  23. Aug 22, 2014
  24. Aug 20, 2014
    • Patrick Wendell's avatar
      SPARK-3092 [SQL]: Always include the thriftserver when -Phive is enabled. · f2f26c2a
      Patrick Wendell authored
      Currently we have a separate profile called hive-thriftserver. I originally suggested this in case users did not want to bundle the thriftserver, but it's ultimately lead to a lot of confusion. Since the thriftserver is only a few classes, I don't see a really good reason to isolate it from the rest of Hive. So let's go ahead and just include it in the same profile to simplify things.
      
      This has been suggested in the past by liancheng.
      
      Author: Patrick Wendell <pwendell@gmail.com>
      
      Closes #2006 from pwendell/hiveserver and squashes the following commits:
      
      742ea40 [Patrick Wendell] Merge remote-tracking branch 'apache/master' into hiveserver
      034ad47 [Patrick Wendell] SPARK-3092: Always include the thriftserver when -Phive is enabled.
      f2f26c2a
  25. Aug 13, 2014
  26. Jul 15, 2014
  27. Jul 11, 2014
    • Kousuke Saruta's avatar
      [SPARK-2457] Inconsistent description in README about build option · cbff1877
      Kousuke Saruta authored
      Now, we should use -Pyarn instead of SPARK_YARN when building but README says as follows.
      
          For Apache Hadoop 2.2.X, 2.1.X, 2.0.X, 0.23.x, Cloudera CDH MRv2, and other Hadoop versions
          with YARN, also set `SPARK_YARN=true`:
      
            # Apache Hadoop 2.0.5-alpha
            $ sbt/sbt -Dhadoop.version=2.0.5-alpha -Pyarn assembly
      
            # Cloudera CDH 4.2.0 with MapReduce v2
            $ sbt/sbt -Dhadoop.version=2.0.0-cdh4.2.0 -Pyarn assembly
      
            # Apache Hadoop 2.2.X and newer
            $ sbt/sbt -Dhadoop.version=2.2.0 -Pyarn assembly
      
      Author: Kousuke Saruta <sarutak@oss.nttdata.co.jp>
      
      Closes #1382 from sarutak/SPARK-2457 and squashes the following commits:
      
      e7b2d64 [Kousuke Saruta] Replaced "SPARK_YARN=true" with "-Pyarn" in README
      cbff1877
  28. Jul 10, 2014
  29. May 19, 2014
    • Matei Zaharia's avatar
      [SPARK-1876] Windows fixes to deal with latest distribution layout changes · 7b70a707
      Matei Zaharia authored
      - Look for JARs in the right place
      - Launch examples the same way as on Unix
      - Load datanucleus JARs if they exist
      - Don't attempt to parse local paths as URIs in SparkSubmit, since paths with C:\ are not valid URIs
      - Also fixed POM exclusion rules for datanucleus (it wasn't properly excluding it, whereas SBT was)
      
      Author: Matei Zaharia <matei@databricks.com>
      
      Closes #819 from mateiz/win-fixes and squashes the following commits:
      
      d558f96 [Matei Zaharia] Fix comment
      228577b [Matei Zaharia] Review comments
      d3b71c7 [Matei Zaharia] Properly exclude datanucleus files in Maven assembly
      144af84 [Matei Zaharia] Update Windows scripts to match latest binary package layout
      7b70a707
  30. May 09, 2014
    • Patrick Wendell's avatar
      SPARK-1565 (Addendum): Replace `run-example` with `spark-submit`. · 06b15baa
      Patrick Wendell authored
      Gives a nicely formatted message to the user when `run-example` is run to
      tell them to use `spark-submit`.
      
      Author: Patrick Wendell <pwendell@gmail.com>
      
      Closes #704 from pwendell/examples and squashes the following commits:
      
      1996ee8 [Patrick Wendell] Feedback form Andrew
      3eb7803 [Patrick Wendell] Suggestions from TD
      2474668 [Patrick Wendell] SPARK-1565 (Addendum): Replace `run-example` with `spark-submit`.
      06b15baa
  31. Apr 19, 2014
    • Reynold Xin's avatar
      README update · 28238c81
      Reynold Xin authored
      Author: Reynold Xin <rxin@apache.org>
      
      Closes #443 from rxin/readme and squashes the following commits:
      
      16853de [Reynold Xin] Updated SBT and Scala instructions.
      3ac3ceb [Reynold Xin] README update
      28238c81
  32. Feb 26, 2014
  33. Jan 09, 2014
  34. Jan 08, 2014
  35. Jan 06, 2014
  36. Jan 04, 2014
  37. Jan 03, 2014
Loading