-
- Downloads
[SPARK-14103][SQL] Parse unescaped quotes in CSV data source.
## What changes were proposed in this pull request? This PR resolves the problem during parsing unescaped quotes in input data. For example, currently the data below: ``` "a"b,ccc,ddd e,f,g ``` produces a data below: - **Before** ```bash ["a"b,ccc,ddd[\n]e,f,g] <- as a value. ``` - **After** ```bash ["a"b], [ccc], [ddd] [e], [f], [g] ``` This PR bumps up the Univocity parser's version. This was fixed in `2.0.2`, https://github.com/uniVocity/univocity-parsers/issues/60. ## How was this patch tested? Unit tests in `CSVSuite` and `sbt/sbt scalastyle`. Author: hyukjinkwon <gurwls223@gmail.com> Closes #12226 from HyukjinKwon/SPARK-14103-quote.
Showing
- dev/deps/spark-deps-hadoop-2.2 1 addition, 1 deletiondev/deps/spark-deps-hadoop-2.2
- dev/deps/spark-deps-hadoop-2.3 1 addition, 1 deletiondev/deps/spark-deps-hadoop-2.3
- dev/deps/spark-deps-hadoop-2.4 1 addition, 1 deletiondev/deps/spark-deps-hadoop-2.4
- dev/deps/spark-deps-hadoop-2.6 1 addition, 1 deletiondev/deps/spark-deps-hadoop-2.6
- dev/deps/spark-deps-hadoop-2.7 1 addition, 1 deletiondev/deps/spark-deps-hadoop-2.7
- sql/core/pom.xml 1 addition, 1 deletionsql/core/pom.xml
- sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/csv/CSVParser.scala 1 addition, 0 deletions...pache/spark/sql/execution/datasources/csv/CSVParser.scala
- sql/core/src/test/resources/unescaped-quotes.csv 2 additions, 0 deletionssql/core/src/test/resources/unescaped-quotes.csv
- sql/core/src/test/scala/org/apache/spark/sql/execution/datasources/csv/CSVSuite.scala 12 additions, 0 deletions...apache/spark/sql/execution/datasources/csv/CSVSuite.scala
Please register or sign in to comment