Skip to content
  • Xiangrui Meng's avatar
    b54c6ab3
    [SPARK-4396] allow lookup by index in Python's Rating · b54c6ab3
    Xiangrui Meng authored
    In PySpark, ALS can take an RDD of (user, product, rating) tuples as input. However, model.predict outputs an RDD of Rating. So on the input side, users can use r[0], r[1], r[2], while on the output side, users have to use r.user, r.product, r.rating. We should allow lookup by index in Rating by making Rating a namedtuple.
    
    davies
    
    <!-- Reviewable:start -->
    [<img src="https://reviewable.io/review_button.png" height=40 alt="Review on Reviewable"/>](https://reviewable.io/reviews/apache/spark/3261)
    <!-- Reviewable:end -->
    
    Author: Xiangrui Meng <meng@databricks.com>
    
    Closes #3261 from mengxr/SPARK-4396 and squashes the following commits:
    
    543aef0 [Xiangrui Meng] use named tuple to implement ALS
    0b61bae [Xiangrui Meng] Merge remote-tracking branch 'apache/master' into SPARK-4396
    d3bd7d4 [Xiangrui Meng] allow lookup by index in Python's Rating
    b54c6ab3
    [SPARK-4396] allow lookup by index in Python's Rating
    Xiangrui Meng authored
    In PySpark, ALS can take an RDD of (user, product, rating) tuples as input. However, model.predict outputs an RDD of Rating. So on the input side, users can use r[0], r[1], r[2], while on the output side, users have to use r.user, r.product, r.rating. We should allow lookup by index in Rating by making Rating a namedtuple.
    
    davies
    
    <!-- Reviewable:start -->
    [<img src="https://reviewable.io/review_button.png" height=40 alt="Review on Reviewable"/>](https://reviewable.io/reviews/apache/spark/3261)
    <!-- Reviewable:end -->
    
    Author: Xiangrui Meng <meng@databricks.com>
    
    Closes #3261 from mengxr/SPARK-4396 and squashes the following commits:
    
    543aef0 [Xiangrui Meng] use named tuple to implement ALS
    0b61bae [Xiangrui Meng] Merge remote-tracking branch 'apache/master' into SPARK-4396
    d3bd7d4 [Xiangrui Meng] allow lookup by index in Python's Rating
Loading