-
- Downloads
[SPARK-17599] Prevent ListingFileCatalog from failing if path doesn't exist
## What changes were proposed in this pull request? The `ListingFileCatalog` lists files given a set of resolved paths. If a folder is deleted at any time between the paths were resolved and the file catalog can check for the folder, the Spark job fails. This may abruptly stop long running StructuredStreaming jobs for example. Folders may be deleted by users or automatically by retention policies. These cases should not prevent jobs from successfully completing. ## How was this patch tested? Unit test in `FileCatalogSuite` Author: Burak Yavuz <brkyvz@gmail.com> Closes #15153 from brkyvz/SPARK-17599.
Showing
- sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/ListingFileCatalog.scala 10 additions, 2 deletions.../spark/sql/execution/datasources/ListingFileCatalog.scala
- sql/core/src/test/scala/org/apache/spark/sql/execution/datasources/FileCatalogSuite.scala 11 additions, 0 deletions...he/spark/sql/execution/datasources/FileCatalogSuite.scala
Please register or sign in to comment