-
Notifications
You must be signed in to change notification settings - Fork 264
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
support deepspeed tp load int4 checkpoint (#3328)
* tp int4 checkpoint * small changes. * update * update lm_head * unnecessary change * some change on tp * tp update * simplify the code. * modify run_accuracy_with_deepspeed. * modify according to comment. * Support low precision checkpoint with TP in llm.optimize * Revert some changes in llm.optimize * fix bug for gpt-j. * remove unnecessary change. * remove unnecessary change. * fix bug. * support mixtral. * support mixtral. * flake8 format. * fix bug. --------- Co-authored-by: Tao, Ran <[email protected]> Co-authored-by: Xia, Weiwen <[email protected]>
- Loading branch information
1 parent
4679764
commit ae09c58
Showing
7 changed files
with
494 additions
and
64 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.