BMTrain v0.2.2
What's Changed
- Undo a deletion of detach in previous version by @Achazwl in #69
- avoid empty state when justify scale by @Achazwl in #68
- fix run bmtrain with one gpu without torchrun by @Achazwl in #70
- fix inspector grad when tensor is not recorded in some layer by @Achazwl in #90
- fix: make load stream wait default stream after init_parameters by @Achazwl in #78
- temparary fix of bmtrain+opendelta load state dict by @Achazwl in #77
- support multiple input-output in transformerblocklist by @Achazwl in #92 #91
Full Changelog: 0.2.1...0.2.2