Revised PoSE #8822

whf313 · 2024-07-29T06:52:01Z

PR types

PR changes

Description

Pose + Yarn Algorithm

…tegies_old.py

…tegies_yarn.py

paddle-bot · 2024-07-29T06:52:05Z

Thanks for your contribution!

CLAassistant · 2024-07-29T06:52:06Z

All committers have signed the CLA.

lugimzzz · 2024-09-24T12:40:39Z

llm/run_finetune.py

+        model_config.long_sequence_init_args = {
+            "dim": int(model_config.hidden_size / model_config.num_attention_heads),
+            "max_position_embeddings": data_args.scaled_max_length,  # extended context window
+            "base": 10000,


base建议从model config中读取

lugimzzz · 2024-09-24T12:42:16Z

llm/run_finetune.py

-        if ptq_ds is not None
-        else None
-    )
+    train_ds = train_ds.map(partial(trans_func)) if train_ds is not None else None


恢复一下这块，不改变原生代码的逻辑

lugimzzz · 2024-09-24T12:43:15Z

llm/utils/data.py

+    return features
+
+
+def test_preprocess_function(example, tokenizer, inference_length):


这个函数没用的话删除

codecov · 2024-10-08T03:35:57Z

Codecov Report

Attention: Patch coverage is 85.24590% with 9 lines in your changes missing coverage. Please review.

Project coverage is 52.91%. Comparing base (d5a90f7) to head (0d11641).
Report is 231 commits behind head on develop.

Files with missing lines	Patch %	Lines
...s/long_sequence_strategies/embedding_strategies.py	85.24%	9 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #8822      +/-   ##
===========================================
+ Coverage    52.81%   52.91%   +0.10%     
===========================================
  Files          677      679       +2     
  Lines       107943   108403     +460     
===========================================
+ Hits         57010    57365     +355     
- Misses       50933    51038     +105

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

lugimzzz · 2024-10-21T12:07:03Z

llm/run_finetune.py

        from utils.data import convert_example_common

        trans_func = partial(convert_example_common, tokenizer=tokenizer, data_args=data_args)
    else:
        trans_func = partial(get_convert_example(model), tokenizer=tokenizer, data_args=data_args)

+    if data_args.zero_padding:


删除这段

lugimzzz · 2024-10-21T12:07:26Z

llm/run_finetune.py

-        if dev_ds is not None
-        else None
-    )
+    dev_ds = dev_ds.map(partial(trans_func)) if dev_ds is not None else None


？为什么修改这段

lugimzzz · 2024-10-21T12:08:02Z

llm/utils/argument.py

+    rope_scaling_type: str = field(default=None, metadata={"help": "Rope extension strategy"})
+    rope_scaling_factor: float = field(default=None, metadata={"help": "Rope extension scaling factor"})
+    strategy_type: str = field(default=None, metadata={"help": "Long sequence strategy type"})
+    strategy_name: str = field(default=None, metadata={"help": "Long sequence strategy name"})


增加对应文档 https://github.com/PaddlePaddle/PaddleNLP/blob/develop/llm/docs/finetune.md#4%E7%B2%BE%E8%B0%83%E5%8F%82%E6%95%B0%E4%BB%8B%E7%BB%8D

lugimzzz · 2024-10-21T12:11:50Z

llm/utils/data.py

+
+def get_example_pose(example, tokenizer, data_args):
+    if "src" in example:
+        source = example["src"]


可以和写longlora的同学https://github.com/PaddlePaddle/PaddleNLP/pull/8798/files

一起商量一下你们的数据格式到底叫src还是text，并且可以把get_example_pose写成tokenize_autogressive函数里一个分支if pose：return get_example_pose（）

lugimzzz · 2024-11-20T06:14:58Z

PaddleNLP-CI 有报错看一下

lugimzzz · 2024-11-20T06:15:19Z

license签署

lugimzzz

LGTM

whf313 and others added 9 commits July 29, 2024 09:21

Yarn

e6c8fbd

Revise

2c7dd53

Resived Pose

689f231

Revised

d865425

Delete paddlenlp/transformers/llama/modeling_new.py

e516822

Delete paddlenlp/transformers/llama/modeling_sparse.py

cb47c7b

Delete paddlenlp/transformers/long_sequence_strategies/embedding_stra…

78b5064

…tegies_old.py

Delete paddlenlp/transformers/long_sequence_strategies/embedding_stra…

e79d4bb

…tegies_yarn.py

Delete llm/run_finetune_old.py

352a80e

paddle-bot bot added the contributor label Jul 29, 2024

paddle-bot bot assigned lugimzzz Jul 29, 2024

lugimzzz added the Beijing Innovation Consortium label Sep 12, 2024

lugimzzz reviewed Sep 24, 2024

View reviewed changes

Merge branch 'develop' into develop

28a1934

whf313 added 11 commits October 8, 2024 14:39

Update data.py

32c0226

Update test_long_sequence_strategies_pose.py

7c8d64c

Update test_long_sequence_strategies_pose.py

cfe9c60

Create test_long_sequence_strategies_pose.py

569ecbc

Update and rename test_long_sequence_strategies_pose.py to test_yarn.py

c177a4d

Delete tests/peft/test_yarn.py

b646a81

Update run_finetune.py

eb2bfc5

Update pyproject.toml

2c05f71

Update pyproject.toml

abde047

Add files via upload

b031d47

Update pyproject.toml

66e8152

lugimzzz reviewed Oct 21, 2024

View reviewed changes

Merge branch 'PaddlePaddle:develop' into develop

fa5afdd

whf313 and others added 3 commits October 22, 2024 17:20

Revise

44c69ed

Merge branch 'PaddlePaddle:develop' into develop

8d643a5

Revised

6cffabd

whf313 and others added 6 commits November 20, 2024 14:22

Revised

a646700

Merge branch 'PaddlePaddle:develop' into develop

2af0f50

Revised

1ca5b63

update

34b73cf

update

86ce695

update

0d11641

lugimzzz approved these changes Nov 21, 2024

View reviewed changes

lugimzzz merged commit 6813e40 into PaddlePaddle:develop Nov 21, 2024
10 of 12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Revised PoSE #8822

Revised PoSE #8822

whf313 commented Jul 29, 2024

paddle-bot bot commented Jul 29, 2024

CLAassistant commented Jul 29, 2024 •

edited

Loading

lugimzzz Sep 24, 2024

lugimzzz Sep 24, 2024

lugimzzz Sep 24, 2024

codecov bot commented Oct 8, 2024 •

edited

Loading

lugimzzz Oct 21, 2024

lugimzzz Oct 21, 2024

lugimzzz Oct 21, 2024

whf313 Nov 19, 2024

lugimzzz Oct 21, 2024 •

edited

Loading

whf313 Nov 19, 2024

lugimzzz commented Nov 20, 2024

lugimzzz commented Nov 20, 2024

lugimzzz left a comment

		return features


		def test_preprocess_function(example, tokenizer, inference_length):

Revised PoSE #8822

Revised PoSE #8822

Conversation

whf313 commented Jul 29, 2024

PR types

PR changes

Description

paddle-bot bot commented Jul 29, 2024

CLAassistant commented Jul 29, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codecov bot commented Oct 8, 2024 • edited Loading

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lugimzzz Oct 21, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lugimzzz commented Nov 20, 2024

lugimzzz commented Nov 20, 2024

lugimzzz left a comment

Choose a reason for hiding this comment

CLAassistant commented Jul 29, 2024 •

edited

Loading

codecov bot commented Oct 8, 2024 •

edited

Loading

lugimzzz Oct 21, 2024 •

edited

Loading