Skip to content
  • Dongjoon Hyun's avatar
    83c00876
    [SPARK-23456][SPARK-21783] Turn on `native` ORC impl and PPD by default · 83c00876
    Dongjoon Hyun authored
    ## What changes were proposed in this pull request?
    
    Apache Spark 2.3 introduced `native` ORC supports with vectorization and many fixes. However, it's shipped as a not-default option. This PR enables `native` ORC implementation and predicate-pushdown by default for Apache Spark 2.4. We will improve and stabilize ORC data source before Apache Spark 2.4. And, eventually, Apache Spark will drop old Hive-based ORC code.
    
    ## How was this patch tested?
    
    Pass the Jenkins with existing tests.
    
    Author: Dongjoon Hyun <dongjoon@apache.org>
    
    Closes #20634 from dongjoon-hyun/SPARK-23456.
    83c00876
    [SPARK-23456][SPARK-21783] Turn on `native` ORC impl and PPD by default
    Dongjoon Hyun authored
    ## What changes were proposed in this pull request?
    
    Apache Spark 2.3 introduced `native` ORC supports with vectorization and many fixes. However, it's shipped as a not-default option. This PR enables `native` ORC implementation and predicate-pushdown by default for Apache Spark 2.4. We will improve and stabilize ORC data source before Apache Spark 2.4. And, eventually, Apache Spark will drop old Hive-based ORC code.
    
    ## How was this patch tested?
    
    Pass the Jenkins with existing tests.
    
    Author: Dongjoon Hyun <dongjoon@apache.org>
    
    Closes #20634 from dongjoon-hyun/SPARK-23456.
Loading