Skip to content

Commit

Permalink
Documentation: add description for repartitionAndSortWithinPartitions
Browse files Browse the repository at this point in the history
  • Loading branch information
msiddalingaiah committed Nov 20, 2014
1 parent 15cacc8 commit 0fc12d7
Showing 1 changed file with 6 additions and 0 deletions.
6 changes: 6 additions & 0 deletions docs/programming-guide.md
Original file line number Diff line number Diff line change
Expand Up @@ -934,6 +934,12 @@ for details.
<td> Reshuffle the data in the RDD randomly to create either more or fewer partitions and balance it across them.
This always shuffles all data over the network. </td>
</tr>
<tr>
<td> <b>repartitionAndSortWithinPartitions</b>(<i>partitioner</i>) </td>
<td> Repartition the RDD according to the given partitioner and, within each resulting partition,
sort records by their keys. This is more efficient than calling <b>repartition</b> and then sorting within
each partition because it can push the sorting down into the shuffle machinery. </td>
</tr>
</table>

### Actions
Expand Down

0 comments on commit 0fc12d7

Please sign in to comment.