diff --git a/docs/programming-guide.md b/docs/programming-guide.md index 7a16ee8742dc0752ca8922eb0a2583976636cc32..5e0d5c15d7069f50e3b9b98a13ece690ee8b031c 100644 --- a/docs/programming-guide.md +++ b/docs/programming-guide.md @@ -934,6 +934,12 @@ for details. <td> Reshuffle the data in the RDD randomly to create either more or fewer partitions and balance it across them. This always shuffles all data over the network. </td> </tr> +<tr> + <td> <b>repartitionAndSortWithinPartitions</b>(<i>partitioner</i>) </td> + <td> Repartition the RDD according to the given partitioner and, within each resulting partition, + sort records by their keys. This is more efficient than calling <code>repartition</code> and then sorting within + each partition because it can push the sorting down into the shuffle machinery. </td> +</tr> </table> ### Actions