[LLM] fix lora target modules on llama #8372

SylarTiaNII · 2024-05-07T03:48:26Z

PR types

Bug fixes

PR changes

Others

Description

Fix lora target modules on llama when fuse_ffn&fuse_qkv enabled (which are critical for memory usage).

paddle-bot · 2024-05-07T03:48:31Z

Thanks for your contribution!

codecov · 2024-05-07T04:18:16Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 55.36%. Comparing base (fdcabf8) to head (eafceb6).
Report is 1 commits behind head on develop.

Additional details and impacted files

@@           Coverage Diff            @@
##           develop    #8372   +/-   ##
========================================
  Coverage    55.36%   55.36%           
========================================
  Files          614      614           
  Lines        96016    96016           
========================================
  Hits         53164    53164           
  Misses       42852    42852

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

wawltor · 2024-05-07T07:45:05Z

llm/utils.py

@@ -125,9 +125,10 @@ def get_lora_target_modules(model):
            ".*v_proj.*",
            ".*k_proj.*",
            ".*o_proj.*",
-            ".*gate_proj.*",
+            ".*qkv_proj.*" ".*gate_proj.*",


这里缺少一个逗号？

wawltor

LGTM

* [XPU] llama add xpu support (#8282) * [XPU] llama add xpu support * fix * use try import * fix * refine * refine * refine * refine * update (#8399) * [LLM] Support fuse attention q, k, v weights (#8202) 1. add use-interface & fuse action 1.1. modify 1., code order 2. switch to name_mapping 3. solve tp branch 3.2 follow hui, handel qkv separately 3.3 handle pdparams 3.4 from torch 3.5 abandon low_cpu_mem_usage 3.6 solve shard branch * 3.6.1 solve shard branch after rebase develop * code clean * remove debug comment * Redefine fuse and split functions * Redefine fuse and split functions * comment and fix * update method * update QKV fuse and split * support fuse weights in multi-files * add precision compare * simplify function call * support use_fast_ffn * clean modeling and configuration * add test for gpt and opt * fix tp_actions get * add fast_ffn test * add Qwen2Moe * Revert "add Qwen2Moe" This reverts commit 113b883. * add test for split * update doc * update filter_dict_keys --------- Co-authored-by: Zii <[email protected]> * [LLM] Fix fuse or split with same key (#8378) * fix fuse or split with same key * fix * fix eps * update format * [LLM] add decay steps option for finetuning (#8251) * [LLM] add memory stats to logger of trainer (#8269) * [Distributed] fix lora (#8325) * [LLM] fix lora target modules on llama (#8372) * [Distributed] metric calculation supports tp logits (#8370) * Update model_utils.py * Update model_utils.py * Update model_utils.py --------- Co-authored-by: Jianbang Yang <[email protected]> Co-authored-by: DrownFish19 <[email protected]> Co-authored-by: Zii <[email protected]> Co-authored-by: Tian <[email protected]>

wawltor reviewed May 7, 2024

View reviewed changes

[LLM] fix lora target modules on llama

eafceb6

SylarTiaNII force-pushed the fix_lora_llama branch from 252fdc4 to eafceb6 Compare May 7, 2024 07:46

wawltor approved these changes May 7, 2024

View reviewed changes

wawltor merged commit 9f3cf82 into PaddlePaddle:develop May 7, 2024
8 of 11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LLM] fix lora target modules on llama #8372

[LLM] fix lora target modules on llama #8372

SylarTiaNII commented May 7, 2024

paddle-bot bot commented May 7, 2024

codecov bot commented May 7, 2024 •

edited

Loading

wawltor May 7, 2024

wawltor left a comment

[LLM] fix lora target modules on llama #8372

[LLM] fix lora target modules on llama #8372

Conversation

SylarTiaNII commented May 7, 2024

PR types

PR changes

Description

paddle-bot bot commented May 7, 2024

codecov bot commented May 7, 2024 • edited Loading

Codecov Report

wawltor May 7, 2024

Choose a reason for hiding this comment

wawltor left a comment

Choose a reason for hiding this comment

codecov bot commented May 7, 2024 •

edited

Loading