-
- Downloads
[SPARK-18042][SQL] OutputWriter should expose file path written
## What changes were proposed in this pull request? This patch adds a new "path" method on OutputWriter that returns the path of the file written by the OutputWriter. This is part of the necessary work to consolidate structured streaming and batch write paths. The batch write path has a nice feature that each data source can define the extension of the files, and allow Spark to specify the staging directory and the prefix for the files. However, in the streaming path we need to collect the list of files written, and there is no interface right now to do that. ## How was this patch tested? N/A - there is no behavior change and this should be covered by existing tests. Author: Reynold Xin <rxin@databricks.com> Closes #15580 from rxin/SPARK-18042.
Showing
- mllib/src/main/scala/org/apache/spark/ml/source/libsvm/LibSVMRelation.scala 7 additions, 1 deletion...la/org/apache/spark/ml/source/libsvm/LibSVMRelation.scala
- sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/OutputWriter.scala 10 additions, 7 deletions...apache/spark/sql/execution/datasources/OutputWriter.scala
- sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/csv/CSVRelation.scala 7 additions, 1 deletion...che/spark/sql/execution/datasources/csv/CSVRelation.scala
- sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/json/JsonFileFormat.scala 7 additions, 1 deletion...spark/sql/execution/datasources/json/JsonFileFormat.scala
- sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFileFormat.scala 1 addition, 1 deletion...sql/execution/datasources/parquet/ParquetFileFormat.scala
- sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetOptions.scala 1 addition, 1 deletion...rk/sql/execution/datasources/parquet/ParquetOptions.scala
- sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetOutputWriter.scala 14 additions, 10 deletions...l/execution/datasources/parquet/ParquetOutputWriter.scala
- sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/text/TextFileFormat.scala 23 additions, 2 deletions...spark/sql/execution/datasources/text/TextFileFormat.scala
- sql/hive/src/main/scala/org/apache/spark/sql/hive/orc/OrcFileFormat.scala 14 additions, 15 deletions...n/scala/org/apache/spark/sql/hive/orc/OrcFileFormat.scala
- sql/hive/src/test/scala/org/apache/spark/sql/sources/CommitFailureTestSource.scala 3 additions, 0 deletions...rg/apache/spark/sql/sources/CommitFailureTestSource.scala
- sql/hive/src/test/scala/org/apache/spark/sql/sources/SimpleTextRelation.scala 3 additions, 0 deletions...ala/org/apache/spark/sql/sources/SimpleTextRelation.scala
Loading
Please register or sign in to comment