InternEvo-v0.4.1dev20240510
sunpengsdu
released this
10 May 08:26
·
76 commits
to develop
since this release
What's Changed
- feat(moe): support topk gating (k>2) by @blankde in #171
- fix(ci): check job status by @kkscilife in #148
- fix get_accelerator error by @sallyjunjun in #179
- feat(internlm): remove use_cuda_flash_attn by @SolenoidWGT in #175
- fix(dipu): fix dipu import rotary by @SolenoidWGT in #183
- fix(model/utils.py): fix unpack data inference squeeze dim and cuda linear wgrad by @huangting4201 in #184
- fix(mlp): fix mlp ckpt save/load by @SolenoidWGT in #181
- Feat(logger): add real tgs computing and logging by @li126com in #174
- Fix(QA): fix some QA code for new version by @li126com in #189
- Fix(CI): fix little bug in yaml by @li126com in #190
- adapt for the newest deeplinkect by @SolenoidWGT in #186
- fix(logger): add filehandler by @JiaoPL in #180
- Fix(CI): fix little bug in yaml once more by @li126com in #191
- feat(multimodal): support train llava with dummy dataset by @Khoray in #91
- fix(logger.py): fix logger that print info twice by @huangting4201 in #192
- fix(logger): no log files by @JiaoPL in #193
- fix test model error by @sallyjunjun in #185
- fix(ci): rm parameter 'update_panel' by @JiaoPL in #194
- fix(solver): fix gpu fused adamw condition by @SolenoidWGT in #196
- fix(multimodal): handle the case when 'input_ids' is None by @JiaoPL in #197
- fix(train/utils.py): fix moe and fp32 param group split when model dtype is fp32 by @huangting4201 in #198
- fix(utils/common.py): assert PYTORCH_CUDA_ALLOC_CONF is None and fix loss test ckpt load failed by @huangting4201 in #201
- Fix(QA): fix monthly test by @li126com in #202
New Contributors
Full Changelog: v0.4.0dev20240403...v0.4.1dev20240510