Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
reduce cpu host overhead when using moe (#5578)
The operation `.to('cpu') `is not necessary for exp_counts, and it will cause device to host synchronization which damage performance. Co-authored-by: Olatunji Ruwase <[email protected]>
- Loading branch information